alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

GitHub repository with 1,181 stars and 204 forks.

Language: Cuda

Topics: gpt, inference, llama, llm, llm-serving, llmops, model-serving

Open provider repository

24h trend summary

Trending score 1.09, activity score 0.85, stars gained +9, forks gained +0.

Latest metric snapshot

2026-06-05: 1,181 stars and 204 forks.

Similar repositories

  1. 1. alibaba/rtp-llm

    RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

    GitHub repository with 1,181 stars and 204 forks.

    Trending score: 1.09; stars gained: +9; forks gained: +0.

    Language: Cuda

    Topics: gpt, inference, llama, llm, llm-serving, llmops

Trending in Cuda

  1. 1. alibaba/rtp-llm

    RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

    GitHub repository with 1,181 stars and 204 forks.

    Trending score: 1.09; stars gained: +9; forks gained: +0.

    Language: Cuda

    Topics: gpt, inference, llama, llm, llm-serving, llmops

  2. 2. lavawolfiee/mini-flash-attention

    Minimal FlashAttention in CUDA C++/CuTe: readable WMMA/CuTe kernels, no NxN workspace, up to 4.5x faster than naive PyTorch

    GitHub repository with 21 stars and 1 forks.

    Trending score: 1.02; stars gained: +9; forks gained: +1.

    Language: Cuda

    Topics: attention, cuda, cute, cutlass, flash-attention, flashattention

  3. 3. NVIDIA/CUDALibrarySamples

    CUDA Library Samples

    GitHub repository with 2,424 stars and 459 forks.

    Trending score: 0.79; stars gained: +5; forks gained: +1.

    Language: Cuda

    Topics: cufft, curand, cusolver, cusparse, nvjpeg, cudss

  4. 4. brucefan1983/GPUMD

    Graphics Processing Units Molecular Dynamics

    GitHub repository with 782 stars and 186 forks.

    Trending score: 0.69; stars gained: +4; forks gained: +2.

    Language: Cuda

    Topics: molecular-dynamics-simulation, heat-transport, cuda, molecular-dynamics, gpumd, phonon

  5. 5. rapidsai/cugraph

    cuGraph - RAPIDS Graph Analytics Library

    GitHub repository with 2,189 stars and 357 forks.

    Trending score: 0.49; stars gained: +2; forks gained: +0.

    Language: Cuda

    Topics: rapids, nvidia, gpu, cuda, graph, graph-algorithms

  6. 6. supranational/sppark

    Zero-knowledge template library

    GitHub repository with 219 stars and 97 forks.

    Trending score: 0.18; stars gained: +0; forks gained: +1.

    Language: Cuda

    Topics: cuda, bls12-377, bls12-381, pasta-curves, zero-knowledge, zero-knowledge-proofs

Trending topic: gpt

  1. 1. langgenius/dify

    Production-ready platform for agentic workflow development.

    GitHub repository with 143,961 stars and 22,652 forks.

    Trending score: 4.60; stars gained: +322; forks gained: +54.

    Language: TypeScript

    Topics: agent, agentic-ai, agentic-framework, agentic-workflow, ai, automation

  2. 2. lobehub/lobehub

    🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.

    GitHub repository with 78,213 stars and 15,373 forks.

    Trending score: 4.03; stars gained: +79; forks gained: +2.

    Language: TypeScript

    Topics: agent, agent-collaboration, agent-harness, ai, cao, chatgpt

  3. 3. OpenHands/OpenHands

    🙌 OpenHands: AI-Driven Development

    GitHub repository with 75,883 stars and 9,631 forks.

    Trending score: 3.82; stars gained: +164; forks gained: +28.

    Language: Python

    Topics: agent, artificial-intelligence, llm, chatgpt, claude-ai, cli

  4. 4. vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    GitHub repository with 81,986 stars and 17,669 forks.

    Trending score: 3.75; stars gained: +79; forks gained: +46.

    Language: Python

    Topics: amd, blackwell, cuda, deepseek, deepseek-v3, gpt

  5. 5. Ontos-AI/knowhere

    Knowhere extracts, parses, and outputs structured chunks ready for AI Agents and RAG.

    GitHub repository with 956 stars and 96 forks.

    Trending score: 3.67; stars gained: +90; forks gained: +8.

    Language: Python

    Topics: agent, ai-agents, chromadb, claude, claude-code, cursor

  6. 6. alistaitsacle/free-llm-api-keys

    Free LLM API keys for GPT-5.5, Claude, DeepSeek, Gemini, Grok — copy, paste, use. Updated 3-5x daily. No credit card needed.

    GitHub repository with 1,539 stars and 157 forks.

    Trending score: 3.38; stars gained: +67; forks gained: +6.

    Language: Python

    Topics: ai, api-key, api-keys, chatgpt, claude, deepseek