jiajun-c/AI-infra-LearningNote
GitHub repository with 5 stars and 0 forks.
Language: Cuda
GitHub repository with 5 stars and 0 forks.
Language: Cuda
2026-06-05: 5 stars and 0 forks.
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
GitHub repository with 1,181 stars and 204 forks.
Trending score: 1.09; stars gained: +9; forks gained: +0.
Language: Cuda
Topics: gpt, inference, llama, llm, llm-serving, llmops
Minimal FlashAttention in CUDA C++/CuTe: readable WMMA/CuTe kernels, no NxN workspace, up to 4.5x faster than naive PyTorch
GitHub repository with 21 stars and 1 forks.
Trending score: 1.02; stars gained: +9; forks gained: +1.
Language: Cuda
Topics: attention, cuda, cute, cutlass, flash-attention, flashattention
CUDA Library Samples
GitHub repository with 2,424 stars and 459 forks.
Trending score: 0.79; stars gained: +5; forks gained: +1.
Language: Cuda
Topics: cufft, curand, cusolver, cusparse, nvjpeg, cudss
Graphics Processing Units Molecular Dynamics
GitHub repository with 782 stars and 186 forks.
Trending score: 0.69; stars gained: +4; forks gained: +2.
Language: Cuda
Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon
cuGraph - RAPIDS Graph Analytics Library
GitHub repository with 2,189 stars and 357 forks.
Trending score: 0.49; stars gained: +2; forks gained: +0.
Language: Cuda
Topics: rapids, nvidia, gpu, cuda, graph, graph-algorithms
Zero-knowledge template library
GitHub repository with 219 stars and 97 forks.
Trending score: 0.18; stars gained: +0; forks gained: +1.
Language: Cuda
Topics: cuda, bls12-377, bls12-381, pasta-curves, zero-knowledge, zero-knowledge-proofs
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
GitHub repository with 1,181 stars and 204 forks.
Trending score: 1.09; stars gained: +9; forks gained: +0.
Language: Cuda
Topics: gpt, inference, llama, llm, llm-serving, llmops
Minimal FlashAttention in CUDA C++/CuTe: readable WMMA/CuTe kernels, no NxN workspace, up to 4.5x faster than naive PyTorch
GitHub repository with 21 stars and 1 forks.
Trending score: 1.02; stars gained: +9; forks gained: +1.
Language: Cuda
Topics: attention, cuda, cute, cutlass, flash-attention, flashattention
CUDA Library Samples
GitHub repository with 2,424 stars and 459 forks.
Trending score: 0.79; stars gained: +5; forks gained: +1.
Language: Cuda
Topics: cufft, curand, cusolver, cusparse, nvjpeg, cudss
Graphics Processing Units Molecular Dynamics
GitHub repository with 782 stars and 186 forks.
Trending score: 0.69; stars gained: +4; forks gained: +2.
Language: Cuda
Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon
cuGraph - RAPIDS Graph Analytics Library
GitHub repository with 2,189 stars and 357 forks.
Trending score: 0.49; stars gained: +2; forks gained: +0.
Language: Cuda
Topics: rapids, nvidia, gpu, cuda, graph, graph-algorithms
Zero-knowledge template library
GitHub repository with 219 stars and 97 forks.
Trending score: 0.18; stars gained: +0; forks gained: +1.
Language: Cuda
Topics: cuda, bls12-377, bls12-381, pasta-curves, zero-knowledge, zero-knowledge-proofs