Cuda repositories

Discover trending Cuda repositories ranked by recent growth and activity.

  1. 1. alibaba/rtp-llm

    RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

    GitHub repository with 1,181 stars and 204 forks.

    Trending score: 1.09; stars gained: +9; forks gained: +0.

    Language: Cuda

    Topics: gpt, inference, llama, llm, llm-serving, llmops

  2. 2. lavawolfiee/mini-flash-attention

    Minimal FlashAttention in CUDA C++/CuTe: readable WMMA/CuTe kernels, no NxN workspace, up to 4.5x faster than naive PyTorch

    GitHub repository with 21 stars and 1 forks.

    Trending score: 1.02; stars gained: +9; forks gained: +1.

    Language: Cuda

    Topics: attention, cuda, cute, cutlass, flash-attention, flashattention

  3. 3. NVIDIA/CUDALibrarySamples

    CUDA Library Samples

    GitHub repository with 2,424 stars and 459 forks.

    Trending score: 0.79; stars gained: +5; forks gained: +1.

    Language: Cuda

    Topics: cufft, curand, cusolver, cusparse, nvjpeg, cudss

  4. 4. brucefan1983/GPUMD

    Graphics Processing Units Molecular Dynamics

    GitHub repository with 782 stars and 186 forks.

    Trending score: 0.69; stars gained: +4; forks gained: +2.

    Language: Cuda

    Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon

  5. 5. rapidsai/cugraph

    cuGraph - RAPIDS Graph Analytics Library

    GitHub repository with 2,189 stars and 357 forks.

    Trending score: 0.49; stars gained: +2; forks gained: +0.

    Language: Cuda

    Topics: rapids, nvidia, gpu, cuda, graph, graph-algorithms

  6. 6. supranational/sppark

    Zero-knowledge template library

    GitHub repository with 219 stars and 97 forks.

    Trending score: 0.18; stars gained: +0; forks gained: +1.

    Language: Cuda

    Topics: cuda, bls12-377, bls12-381, pasta-curves, zero-knowledge, zero-knowledge-proofs

  7. 7. MetaX-MACA/mcoplib

    A high perfromance op kernel lib running on metax HD platform

    GitHub repository with 6 stars and 3 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Cuda

  8. 8. XopMC/brainflayer-CUDA

    "brainflayer" CUDA & private key recovery tool

    GitHub repository with 6 stars and 4 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Cuda

  9. 9. gau-nernst/learn-cuda

    Learn CUDA with PyTorch

    GitHub repository with 313 stars and 49 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Cuda

  10. 10. BoundlessWindMoon/minivllm

    A light, transparent, and modular inference & quantization engine for studying LLMs.

    GitHub repository with 9 stars and 0 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Cuda

    Topics: awq, cuda-graph, framework, megakernel, multi-backends, quantum-kernel

  11. 12. AmrMSharafeldin/semanticfoam

    This repository contains the official implementation of Semantic Foam: Unifying Spatial and Semantic Scene Decomposition

    GitHub repository with 10 stars and 0 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Cuda