Refactor perf collector to better handle failures and warn on opens by hodgesds · Pull Request #2190 · prometheus/node_exporter

hodgesds · 2021-11-01T00:08:37Z

This changes the perf collector to handle partial failures even better by checking the HasProfilers method on the various profilers interfaces. The configuration of the profiler interfaces have changed so that an optional bitmask of profilers can be set. In theory this could make for much finer control of which profilers are enabled, but that is beyond the scope of this change. The profiler interfaces are also changed so that in theory object pools could be used to reduce memory allocations.

Tested running locally and then:

curl localhost:9100/metrics | grep perf | grep 'cpu="0"'
node_perf_branch_instructions_total{cpu="0"} 8.20418e+06
node_perf_branch_misses_total{cpu="0"} 162365
node_perf_cache_bpu_read_hits_total{cpu="0"} 9.985934e+06
node_perf_cache_bpu_read_misses_total{cpu="0"} 221845
node_perf_cache_l1_instr_read_misses_total{cpu="0"} 181514
node_perf_cache_l1d_read_hits_total{cpu="0"} 1.4483013e+07
node_perf_cache_l1d_read_misses_total{cpu="0"} 685931
node_perf_cache_misses_total{cpu="0"} 141926
node_perf_cache_refs_total{cpu="0"} 2.173699e+06
node_perf_cache_tlb_instr_read_hits_total{cpu="0"} 849
node_perf_cache_tlb_instr_read_misses_total{cpu="0"} 603
node_perf_context_switches_total{cpu="0"} 3930
node_perf_cpu_migrations_total{cpu="0"} 3
node_perf_cpucycles_total{cpu="0"} 1.87280477e+08
node_perf_instructions_total{cpu="0"} 4.4194532e+07
node_perf_major_faults_total{cpu="0"} 0
node_perf_minor_faults_total{cpu="0"} 1352
node_perf_page_faults_total{cpu="0"} 1352

Signed-off-by: Daniel Hodges hodges.daniel.scott@gmail.com

hodgesds · 2021-11-01T00:10:06Z

collector/perf_linux.go

I may consider using AllCacheProfilers for clearer code but that may enable some profilers that aren't used and cause more multiplexing of perf events on the CPU.

hodgesds · 2021-11-01T00:24:00Z

collector/perf_linux.go

This function is just a wrapper around unix.Getrlimit.

hodgesds · 2021-11-01T00:32:42Z

collector/perf_linux.go

In theory these could come from an object pool so extra allocations aren't required.

hodgesds · 2021-11-01T00:37:31Z

After this diff I'd like to add stalled frontend/backend cycles as it can be used to give a better picture of instruction performance. Here's a post that has some background info as well as some of the ways that the perf subsystem exposes those metrics. They are already available in the perf-utils library, but haven't been added yet.

discordianfish · 2021-11-01T10:28:57Z

Is it possible to just attempt opening the profiler handles and fail with a meaningful error? Not sure how I feel about checking open files. All(?) other collectors just attempt opening stuff and fail if they can't.

hodgesds · 2021-11-01T13:13:06Z

Is it possible to just attempt opening the profiler handles and fail with a meaningful error?

Potentially, the problem is some of the profilers could be unavailable due to kernel configurations or even physical hardware (ex: CPU doesn't implement all hardware counters). The other problem is there is a large combination of different profiler handle types so having one flag to enable them all is a little problematic.

Not sure how I feel about checking open files. All(?) other collectors just attempt opening stuff and fail if they can't.

That's reasonable if setting up the profiler handles fail fast. I was almost thinking that check would be better as a info check during start up to let users know if they are using default configs that may need tuned (IIRC RLIMIT_NOFILE is 1024).

discordianfish · 2021-11-03T10:59:44Z

Potentially, the problem is some of the profilers could be unavailable due to kernel configurations or even physical hardware (ex: CPU doesn't implement all hardware counters).

But then the call can fail and we return a meaningful error to the user, right?

The other problem is there is a large combination of different profiler handle types so having one flag to enable them all is a little problematic
But this is orthogonal to the problem here, right? Maybe we should have a flag for each of them?

That's reasonable if setting up the profiler handles fail fast.
Why wouldn't they fail fast?

All in all, I might still miss something but I don't understand why we can't just open the handlers and return an error that is meaningful if it fails (like 'out of fds' vs 'cpu doesn't support hw counters' vs 'perf not enabled in kernel'). That'd be ideal I think?

hodgesds · 2021-11-10T01:04:59Z

All in all, I might still miss something but I don't understand why we can't just open the handlers and return an error that is meaningful if it fails (like 'out of fds' vs 'cpu doesn't support hw counters' vs 'perf not enabled in kernel'). That'd be ideal I think?

Here's a simple example, if running in a virtual you may not have access to the underlying hardware counters, but still may have access to software counters from the kernel. We can make it fail fast, but that makes the perf collector basically worthless for use inside a VM. Alternatively, it could do something like a best effort of setting up all the relevant profilers and fail if none are available. The other tricky part with errors return from perf_event_open are most likely EOPNOTSUPP, EINVAL and EACCES for failures. I'm guessing handling EOPNOTSUPP is the tricky part to get right as that is returned if the desired counters are available.

discordianfish · 2021-11-12T10:29:31Z

Alternatively, it could do something like a best effort of setting up all the relevant profilers and fail if none are available

Yeah, that, plus logging the ones that can't be setup, sounds reasonable. I'd get rid of the file handlers check here though and return an error if out of fds.

@SuperQ wdyt?

discordianfish · 2021-11-15T09:48:48Z

I think this needs rebasing to pick up the macos fix.

…le limits Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>

…le limits Origin: prometheus#2190 Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>

SuperQ

This needs a rebase.

hodgesds commented Nov 1, 2021

View reviewed changes

hodgesds force-pushed the perf-refactor branch 2 times, most recently from 94a0c8d to a18a787 Compare November 1, 2021 00:19

hodgesds commented Nov 1, 2021

View reviewed changes

collector/perf_linux.go Outdated

Copy link

Contributor Author

hodgesds Nov 1, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This function is just a wrapper around unix.Getrlimit.

hodgesds force-pushed the perf-refactor branch 2 times, most recently from 360ae91 to e78e5d7 Compare November 1, 2021 00:31

hodgesds commented Nov 1, 2021

View reviewed changes

collector/perf_linux.go Outdated

Copy link

Contributor Author

hodgesds Nov 1, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In theory these could come from an object pool so extra allocations aren't required.

hodgesds mentioned this pull request Nov 1, 2021

Release 1.3.0 #2168

Merged

hodgesds force-pushed the perf-refactor branch 2 times, most recently from 843f661 to dcd2665 Compare November 3, 2021 00:24

hodgesds force-pushed the perf-refactor branch from dcd2665 to 0827fab Compare November 12, 2021 14:29

Refactor perf collector to better handle failures and warn on open fi…

f9ee4de

…le limits Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>

hodgesds force-pushed the perf-refactor branch from 0827fab to f9ee4de Compare November 15, 2021 17:58

bdrung pushed a commit to bdrung/node_exporter that referenced this pull request Jan 10, 2022

Refactor perf collector to better handle failures and warn on open fi…

93651c6

…le limits Origin: prometheus#2190 Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>

bdrung pushed a commit to bdrung/node_exporter that referenced this pull request Jan 15, 2022

Refactor perf collector to better handle failures and warn on open fi…

b846085

…le limits Origin: prometheus#2190 Signed-off-by: Daniel Hodges <hodges.daniel.scott@gmail.com>

hodgesds mentioned this pull request Sep 14, 2022

perf collector not working on EPYC CPU #2469

Open

SuperQ requested changes May 24, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Refactor perf collector to better handle failures and warn on opens#2190

Refactor perf collector to better handle failures and warn on opens#2190
hodgesds wants to merge 1 commit intoprometheus:masterfrom
hodgesds:perf-refactor

hodgesds commented Nov 1, 2021

Uh oh!

hodgesds Nov 1, 2021

Uh oh!

hodgesds Nov 1, 2021

Uh oh!

hodgesds Nov 1, 2021

Uh oh!

hodgesds commented Nov 1, 2021

Uh oh!

discordianfish commented Nov 1, 2021

Uh oh!

hodgesds commented Nov 1, 2021

Uh oh!

discordianfish commented Nov 3, 2021

Uh oh!

hodgesds commented Nov 10, 2021

Uh oh!

discordianfish commented Nov 12, 2021

Uh oh!

discordianfish commented Nov 15, 2021

Uh oh!

SuperQ left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

hodgesds commented Nov 1, 2021

Uh oh!

hodgesds Nov 1, 2021

Choose a reason for hiding this comment

Uh oh!

hodgesds Nov 1, 2021

Choose a reason for hiding this comment

Uh oh!

hodgesds Nov 1, 2021

Choose a reason for hiding this comment

Uh oh!

hodgesds commented Nov 1, 2021

Uh oh!

discordianfish commented Nov 1, 2021

Uh oh!

hodgesds commented Nov 1, 2021

Uh oh!

discordianfish commented Nov 3, 2021

Uh oh!

hodgesds commented Nov 10, 2021

Uh oh!

discordianfish commented Nov 12, 2021

Uh oh!

discordianfish commented Nov 15, 2021

Uh oh!

SuperQ left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants