Mattral/SIMD-Microkernels-for-ML-Workloads
Lightweight C++ implementations of SIMD-optimized microkernels for ML primitives, with Python bindings, benchmark automation, and optional OpenMP support.
GitHub repository with 11 stars and 4 forks.
Language: Makefile