Skip to content

Conversation

@naoki9911
Copy link
Contributor

@naoki9911 naoki9911 commented May 29, 2025

The link status of PCIe devices sometimes changes, like link or speed downgrades, and devices disappear. This patch collects PCIe devices' link infromation to detect such failures.

As a first step, this collector exports PCIe devices'

  • Device information (vendor_id, device_id, etc.)
  • Parent PCIe device (e.g. PCIe bridge, PCIe switch)
  • Link status (max_link_{speed|width}, current_link_{speed|width}

https://groups.google.com/g/prometheus-developers/c/0GJTs2OjvCs

The link status of PCIe devices sometimes changes,
like link or speed downgrades, and devices disappear.
This patch collects PCIe devices' link infromation  to detect such failures.

As a first step, this collector exports PCIe devices'
- Device information (vendor_id, device_id, etc.)
- Parent PCIe device (e.g. PCIe bridge, PCIe switch)
- Link status (max_link_{speed|width}, current_link_{speed|width}

Signed-off-by: Naoki MATSUMOTO <m.naoki9911@gmail.com>
Copy link
Member

@SuperQ SuperQ left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very nice, thanks!

@SuperQ SuperQ merged commit 6176818 into prometheus:master Jul 3, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants