Sphere-AI-Lab/orbit

Stable and Efficient Reinforcement Learning for Trillion-Parameter LLMs

GitHub repository with 127 stars and 6 forks.

Language: Python

Topics: cuda, low-precision, peft, reinforcement-learning, transformers

Open provider repository

24h trend summary

Trending score 0.30, activity score 0.01, stars gained +1, forks gained +0.

Latest metric snapshot

2026-06-05: 127 stars and 6 forks.

Similar repositories

1. vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

GitHub repository with 81,948 stars and 17,658 forks.

Trending score: 3.75; stars gained: +79; forks gained: +46.

Language: Python

Topics: amd, blackwell, cuda, deepseek, deepseek-v3, gpt
2. gpustack/gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

GitHub repository with 5,102 stars and 541 forks.

Trending score: 2.51; stars gained: +11; forks gained: +1.

Language: Python

Topics: ascend, cuda, deepseek, distributed-inference, genai, inference
3. sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

GitHub repository with 28,875 stars and 6,336 forks.

Trending score: 1.72; stars gained: -55; forks gained: +18.

Language: Python

Topics: attention, blackwell, cuda, deepseek, diffusion, glm
4. NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

GitHub repository with 13,805 stars and 2,437 forks.

Trending score: 1.18; stars gained: +16; forks gained: +7.

Language: Python

Topics: blackwell, cuda, llm-serving, moe, pytorch
5. flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

GitHub repository with 5,744 stars and 1,026 forks.

Trending score: 1.16; stars gained: +15; forks gained: +8.

Language: Python

Topics: attention, cuda, distributed-inference, gpu, jit, large-large-models
6. NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

GitHub repository with 3,378 stars and 738 forks.

Trending score: 1.10; stars gained: +1; forks gained: +3.

Language: Python

Topics: cuda, deep-learning, fp4, fp8, gpu, jax

Trending in Python

1. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 180,988 stars and 31,047 forks.

Trending score: 5.95; stars gained: +1,867; forks gained: +361.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
2. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 12,420 stars and 807 forks.

Trending score: 5.69; stars gained: +2,829; forks gained: +175.

Language: Python

Topics: agent, ai, anthropic, claude-code, compression, context-engineering
3. Imbad0202/academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

GitHub repository with 27,211 stars and 2,239 forks.

Trending score: 5.52; stars gained: +1,079; forks gained: +89.

Language: Python

Topics: academic-writing, ai-research, claude, claude-code, literature-review, peer-review
4. open-webui/open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

GitHub repository with 140,042 stars and 20,111 forks.

Trending score: 5.04; stars gained: +317; forks gained: +58.

Language: Python

Topics: ollama, ollama-webui, llm, webui, self-hosted, llm-ui
5. ZhuLinsen/daily_stock_analysis

LLM驱动的 A/H/美股智能分析：多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送，零成本定时运行，纯白嫖. LLM-powered stock analysis system for A/H/US markets.

GitHub repository with 40,750 stars and 38,939 forks.

Trending score: 4.88; stars gained: +836; forks gained: +443.

Language: Python

Topics: a-stock, ai-agent, aigc, llm, quant, quantitative-finance
6. anthropics/financial-services

GitHub repository with 29,960 stars and 4,217 forks.

Trending score: 4.88; stars gained: +688; forks gained: +114.

Language: Python

Sphere-AI-Lab/orbit

24h trend summary

Latest metric snapshot

Similar repositories

1. vllm-project/vllm

2. gpustack/gpustack

3. sgl-project/sglang

4. NVIDIA/TensorRT-LLM

5. flashinfer-ai/flashinfer

6. NVIDIA/TransformerEngine

Trending in Python

1. NousResearch/hermes-agent

2. chopratejas/headroom

3. Imbad0202/academic-research-skills

4. open-webui/open-webui

5. ZhuLinsen/daily_stock_analysis

6. anthropics/financial-services

Trending topic: cuda

1. vllm-project/vllm

2. gpustack/gpustack

3. Luce-Org/lucebox-hub

4. shader-slang/slang

5. tenstorrent/tt-metal

6. AmmarkoV/SAM3DBody-cpp