NVIDIA/cuopt
GPU accelerated decision optimization
GitHub repository with 943 stars and 196 forks.
Language: Cuda
Topics: gpu, linear-programming, optimization, cuda
GPU accelerated decision optimization
GitHub repository with 943 stars and 196 forks.
Language: Cuda
Topics: gpu, linear-programming, optimization, cuda
2026-06-13: 943 stars and 196 forks.
Graphics Processing Units Molecular Dynamics
GitHub repository with 786 stars and 187 forks.
Trending score: 0.83; stars gained: +1; forks gained: -1.
Language: Cuda
Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon
Header-only CUDA runtime library for automatic dependency-aware kernel scheduling based on memory access semantics.
GitHub repository with 6 stars and 0 forks.
Trending score: 0.55; stars gained: +1; forks gained: +0.
Language: Cuda
Topics: cpp, cpp17, cuda, cuda-driver-api, gpu, header-only
Personal CUDA learning repo, built step by step from scratch.
GitHub repository with 39 stars and 5 forks.
Trending score: 0.41; stars gained: +1; forks gained: +0.
Language: Cuda
Topics: cpp, cuda, gpu, gpu-computing, gpu-optimization, gpu-programming
A plugin to use Nvidia GPU in PySCF package
GitHub repository with 317 stars and 63 forks.
Trending score: 0.09; stars gained: +0; forks gained: +1.
Language: Cuda
Topics: gpu
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
GitHub repository with 1,223 stars and 214 forks.
Trending score: 2.00; stars gained: +6; forks gained: +4.
Language: Cuda
Topics: gpt, inference, llama, llm, llm-serving, llmops
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
GitHub repository with 2,313 stars and 220 forks.
Trending score: 1.87; stars gained: +6; forks gained: +1.
Language: Cuda
Suckless no dependencies CUDA/C++ port of NVIDIA's DVLT, feed it images, get a 3D point cloud + camera poses. NO python. One fast 5MB binary.
GitHub repository with 53 stars and 8 forks.
Trending score: 0.93; stars gained: +1; forks gained: +0.
Language: Cuda
Topics: 3d, 3d-models, 3d-reconstruction, cuda, cuda-cpp, cuda-kernels
Graphics Processing Units Molecular Dynamics
GitHub repository with 786 stars and 187 forks.
Trending score: 0.83; stars gained: +1; forks gained: -1.
Language: Cuda
Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon
GPT inference in pure CUDA and C++
GitHub repository with 17 stars and 3 forks.
Trending score: 0.63; stars gained: +0; forks gained: +0.
Language: Cuda
Header-only CUDA runtime library for automatic dependency-aware kernel scheduling based on memory access semantics.
GitHub repository with 6 stars and 0 forks.
Trending score: 0.55; stars gained: +1; forks gained: +0.
Language: Cuda
Topics: cpp, cpp17, cuda, cuda-driver-api, gpu, header-only
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly.
GitHub repository with 4,787 stars and 263 forks.
Trending score: 4.13; stars gained: +62; forks gained: +5.
Language: Python
Topics: ai, cli, llm, local-llm, command-line-tool, gguf
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
GitHub repository with 10,163 stars and 1,100 forks.
Trending score: 3.24; stars gained: +63; forks gained: +6.
Language: Python
Topics: cloud-computing, deep-learning, gpu, hyperparameter-tuning, machine-learning, tpu
Tensors and Dynamic neural networks in Python with strong GPU acceleration
GitHub repository with 100,772 stars and 28,025 forks.
Trending score: 3.14; stars gained: +24; forks gained: +6.
Language: Python
Topics: autograd, deep-learning, gpu, machine-learning, neural-network, numpy
cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign language bindings, just Rust.
GitHub repository with 2,756 stars and 182 forks.
Trending score: 2.83; stars gained: +25; forks gained: +4.
Language: Rust
Topics: async, compiler-backend, cuda, gpu, heterogeneous-computing, high-performance-computing
A cross-platform, safe, pure-Rust graphics API.
GitHub repository with 17,374 stars and 1,329 forks.
Trending score: 2.82; stars gained: +29; forks gained: +8.
Language: Rust
Topics: d3d12, gpu, hacktoberfest, metal, opengl, rust
Estimate whether a Hugging Face model fits and fine-tunes on your local GPU.
GitHub repository with 495 stars and 71 forks.
Trending score: 2.82; stars gained: +20; forks gained: +2.
Language: Python
Topics: bitsandbytes, fine-tuning, gpu, hugging-face, llm, lora