ROCm/ROCmValidationSuite
A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high-performance computing environments
GitHub repository with 103 stars and 44 forks.
Language: C++