NVIDIA/cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

GitHub repository with 9,893 stars and 1,906 forks.

Language: C++

Topics: cuda, deep-learning, deep-learning-library, cpp, nvidia, gpu, python

Open provider repository

Latest metric snapshot

2026-06-14: 9,893 stars and 1,906 forks.

Similar repositories

1. Luce-Org/lucebox-hub

Fast LLM speculative inference server for consumer hardware.

GitHub repository with 2,491 stars and 229 forks.

Trending score: 2.88; stars gained: +27; forks gained: +6.

Language: C++

Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090
2. MrNeRF/LichtFeld-Studio

Train, inspect, edit, automate, and export 3D Gaussian Splatting scenes from a single native application.

GitHub repository with 3,230 stars and 359 forks.

Trending score: 2.12; stars gained: +6; forks gained: +0.

Language: C++

Topics: cuda, gaussian-splatting, optimization, computer-graphics, computer-vision
3. shader-slang/slang

Making it easier to work with shaders

GitHub repository with 5,371 stars and 455 forks.

Trending score: 1.68; stars gained: +4; forks gained: +0.

Language: C++

Topics: cuda, d3d12, glsl, hlsl, shaders, vulkan
4. ROCm/hip

HIP: C++ Heterogeneous-Compute Interface for Portability

GitHub repository with 4,347 stars and 587 forks.

Trending score: 1.33; stars gained: +3; forks gained: +5.

Language: C++

Topics: hip, hip-runtime, hip-portability, hip-kernel-language, hipify, cuda
5. NVIDIA/cccl

CUDA Core Compute Libraries

GitHub repository with 2,380 stars and 410 forks.

Trending score: 1.21; stars gained: +1; forks gained: +2.

Language: C++

Topics: accelerated-computing, cpp, cpp-programming, cuda, cuda-cpp, cuda-kernels
6. LuisaGroup/LuisaCompute

High-Performance Rendering Framework on Stream Architectures

GitHub repository with 1,023 stars and 101 forks.

Trending score: 0.90; stars gained: +5; forks gained: +0.

Language: C++

Topics: cpu, gpu, high-performance, cross-platform, cuda, directx

Trending in C++

1. ggml-org/llama.cpp

LLM inference in C/C++

GitHub repository with 116,599 stars and 19,593 forks.

Trending score: 4.92; stars gained: +285; forks gained: +59.

Language: C++

Topics: ggml
2. opencv/opencv

Open Source Computer Vision Library

GitHub repository with 89,158 stars and 56,658 forks.

Trending score: 4.35; stars gained: +147; forks gained: +16.

Language: C++

Topics: c-plus-plus, computer-vision, deep-learning, image-processing, opencv
3. alibaba/zvec

A lightweight, lightning-fast, in-process vector database

GitHub repository with 10,064 stars and 583 forks.

Trending score: 4.23; stars gained: +283; forks gained: +17.

Language: C++

Topics: agent-skills, db, embedded, faiss, hnsw, llm-memory
4. ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

GitHub repository with 50,727 stars and 5,661 forks.

Trending score: 3.92; stars gained: +155; forks gained: +26.

Language: C++

Topics: inference, openai, speech-recognition, speech-to-text, transformer, whisper
5. noctalia-dev/noctalia

A sleek and minimal desktop shell thoughtfully crafted for Wayland.

GitHub repository with 7,750 stars and 545 forks.

Trending score: 3.82; stars gained: +91; forks gained: +10.

Language: C++

Topics: dotfiles, hyprland, linux, niri, noctalia, quickshell
6. ml-explore/mlx

MLX: An array framework for Apple silicon

GitHub repository with 27,010 stars and 1,908 forks.

Trending score: 3.68; stars gained: +58; forks gained: +10.

Language: C++

Topics: mlx

NVIDIA/cutlass

Latest metric snapshot

Similar repositories

1. Luce-Org/lucebox-hub

2. MrNeRF/LichtFeld-Studio

3. shader-slang/slang

4. ROCm/hip

5. NVIDIA/cccl

6. LuisaGroup/LuisaCompute

Trending in C++

1. ggml-org/llama.cpp

2. opencv/opencv

3. alibaba/zvec

4. ggml-org/whisper.cpp

5. noctalia-dev/noctalia

6. ml-explore/mlx

Trending topic: cuda

1. LMCache/LMCache

2. vllm-project/vllm

3. sgl-project/sglang

4. Luce-Org/lucebox-hub

5. NVlabs/cuda-oxide

6. unilabsim/UniLab