[ML] Improve performance of closing files before spawning #2424

droberts195 · 2022-11-29T12:01:47Z

Before spawning new processes from the controller we close all open file descriptors except for stdin, stdout and stderr.

Previously this was done by checking every possible file descriptor to see if it was open, but this is very expensive if the file descriptor limit is high. During a lookback a large number of normalize processes get started, and this could lead to significant CPU usage by the controller process.

This change makes the file closure code learn the highest open file descriptor each time it is used, and work on the basis that no more than 10 files will be opened in between calls to it. This significantly reduces controller CPU usage on machines that have high file descriptor limits and run a lot of normalize processes.

Before spawning new processes from the `controller` we close all open file descriptors except for stdin, stdout and stderr. Previously this was done by checking every possible file descriptor to see if it was open, but this is very expensive if the file descriptor limit is high. During a lookback a large number of `normalize` processes get started, and this could lead to significant CPU usage by the `controller` process. This change makes the file closure code learn the highest open file descriptor each time it is used, and work on the basis that no more than 10 files will be opened in between calls to it. This significantly reduces `controller` CPU usage on machines that have high file descriptor limits and run a lot of `normalize` processes.

edsavage

LGTM

Before spawning new processes from the `controller` we close all open file descriptors except for stdin, stdout and stderr. Previously this was done by checking every possible file descriptor to see if it was open, but this is very expensive if the file descriptor limit is high. During a lookback a large number of `normalize` processes get started, and this could lead to significant CPU usage by the `controller` process. This change makes the file closure code learn the highest open file descriptor each time it is used, and work on the basis that no more than 10 files will be opened in between calls to it. This significantly reduces `controller` CPU usage on machines that have high file descriptor limits and run a lot of `normalize` processes. Backport of elastic#2424

Before spawning new processes from the `controller` we close all open file descriptors except for stdin, stdout and stderr. Previously this was done by checking every possible file descriptor to see if it was open, but this is very expensive if the file descriptor limit is high. During a lookback a large number of `normalize` processes get started, and this could lead to significant CPU usage by the `controller` process. This change makes the file closure code learn the highest open file descriptor each time it is used, and work on the basis that no more than 10 files will be opened in between calls to it. This significantly reduces `controller` CPU usage on machines that have high file descriptor limits and run a lot of `normalize` processes. Backport of #2424

droberts195 added >bug :ml v8.6.0 v7.17.9 v8.7.0 labels Nov 29, 2022

droberts195 requested a review from edsavage November 30, 2022 13:40

edsavage approved these changes Dec 5, 2022

View reviewed changes

droberts195 merged commit 698dd92 into elastic:main Dec 5, 2022

droberts195 deleted the more_efficient_file_closure_before_spawn branch December 5, 2022 09:51

droberts195 mentioned this pull request Dec 5, 2022

[8.6] [ML] Improve performance of closing files before spawning #2426

Merged

droberts195 mentioned this pull request Dec 5, 2022

[7.17] [ML] Improve performance of closing files before spawning #2427

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Improve performance of closing files before spawning #2424

[ML] Improve performance of closing files before spawning #2424

Uh oh!

droberts195 commented Nov 29, 2022

Uh oh!

edsavage left a comment

Uh oh!

Uh oh!

[ML] Improve performance of closing files before spawning #2424

[ML] Improve performance of closing files before spawning #2424

Uh oh!

Conversation

droberts195 commented Nov 29, 2022

Uh oh!

edsavage left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!