Show HN: Deterministic PCIe Diagnostics for GPUs on Linux
Posted by gpu_systems 9 hours ago
I built a small Linux tool to deterministically verify GPU PCIe link health and bandwidth.
It reports: - Negotiated PCIe generation and width - Peak Host→Device and Device→Host memcpy bandwidth - Sustained PCIe TX/RX utilization via NVML - A rule-based verdict derived from observable hardware data only
This exists because PCIe issues (Gen downgrades, reduced lane width, risers, bifurcation) are often invisible at the application layer and can’t be fixed by kernel tuning or async overlap.
Linux-only: it relies on sysfs and PCIe AER exposure that Windows does not provide.
Comments
Comment by AuthAuth 7 hours ago
Comment by wtallis 8 hours ago
Comment by kimixa 5 hours ago
Though I don't think there's anything particularly device-specific they're measuring, they're using the private nvidia interfaces to do so.
Comment by cr125rider 2 hours ago