datawhalechina/all-in-rag

🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/

GitHub repository with 8,583 stars and 4,285 forks.

Language: Python

Topics: embedding, kimi-k2, langchain, llama-index, llm, milvus, multimodal, rag, ai, neo4j

Open provider repository

Latest metric snapshot

2026-06-15: 8,583 stars and 4,285 forks.

Similar repositories

  1. 1. modelscope/ms-swift

    Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

    GitHub repository with 14,512 stars and 1,480 forks.

    Trending score: 2.22; stars gained: +6; forks gained: +0.

    Language: Python

    Topics: deepseek-r1, embedding, grpo, internvl, liger, llama

  2. 2. memtomem/memtomem

    Markdown-first, long-term memory infrastructure for AI agents. Hybrid BM25 + semantic search across markdown/code files via MCP.

    GitHub repository with 7 stars and 24 forks.

    Trending score: 0.56; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: agent, agent-harness, agent-memory, ai, bm25, claude

  3. 3. EngramMemory/engram-memory

    The highest-scoring AI memory system ever benchmarked that isn't reliant on LLM reranking. And it's free & burns less tokens.

    GitHub repository with 32 stars and 4 forks.

    Trending score: 0.49; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: ai, ai-tools, embedded-systems, embedding, embedding-vectors, mcp

  4. 4. arun1729/cog

    Embedded Graph Database for Python. Lives inside your Python process. Quick setup. No server. Runs in notebooks, apps, even your browser.

    GitHub repository with 364 stars and 35 forks.

    Trending score: 0.40; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: nosql, python, graph-database, graph, linkeddata, network-graph

  5. 5. jaylfc/taosmd

    Local-first AI memory — runs offline on any machine with 8 GB+ RAM (SBC, mini PC, laptop, workstation). Zero-loss verbatim archive, knowledge graph, hybrid retrieval. Framework-agnostic, no cloud.

    GitHub repository with 45 stars and 2 forks.

    Trending score: 0.24; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai-memory, arm, edge-ai, embedding, framework-agnostic, knowledge-graph

  6. 6. PlateerLab/synaptic-memory

    Brain-inspired knowledge graph: spreading activation, Hebbian learning, memory consolidation.

    GitHub repository with 29 stars and 1 forks.

    Trending score: 0.07; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai-agent, embedding, graph-database, hebbian-learning, knowledge-graph, llm

Trending in Python

  1. 1. harry0703/MoneyPrinterTurbo

    利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

    GitHub repository with 88,031 stars and 12,625 forks.

    Trending score: 6.02; stars gained: +1,097; forks gained: +218.

    Language: Python

    Topics: ai, automation, chatgpt, moviepy, python, shortvideo

  2. 2. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 71,459 stars and 9,112 forks.

    Trending score: 5.98; stars gained: +834; forks gained: +140.

    Language: Python

  3. 3. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 194,120 stars and 33,992 forks.

    Trending score: 5.92; stars gained: +753; forks gained: +209.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  4. 4. NVIDIA/SkillSpector

    Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.

    GitHub repository with 5,962 stars and 441 forks.

    Trending score: 5.61; stars gained: +874; forks gained: +76.

    Language: Python

  5. 5. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 32,676 stars and 5,366 forks.

    Trending score: 5.59; stars gained: +762; forks gained: +135.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  6. 6. Agents365-ai/drawio-skill

    Generate draw.io diagrams from natural language — 6 presets, vision self-check + up to 5-round refinement, codebase-to-diagram, 10,000+ official shapes & 321 AI/LLM brand logos. Exports PNG/SVG/PDF/JPG.

    GitHub repository with 3,445 stars and 240 forks.

    Trending score: 5.51; stars gained: +1,369; forks gained: +113.

    Language: Python

    Topics: agent-skill, agent-skills, architecture-diagram, claude-code, claude-code-skill, claude-skills

Trending topic: embedding

  1. 1. TencentCloud/TencentDB-Agent-Memory

    TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.

    GitHub repository with 5,756 stars and 495 forks.

    Trending score: 4.80; stars gained: +386; forks gained: +29.

    Language: TypeScript

    Topics: agent, llm, memory, openclaw-plugin, ai-agent, embedding

  2. 2. CodeBendKit/codeseek

    Rust-powered code intelligence CLI for AI coding agents. Builds call graphs and hybrid semantic search indexes (Dense + Sparse + RRF + Reranker) across 7 languages. Ships as native MCP tools for Claude Code and Codex CLI.

    GitHub repository with 135 stars and 11 forks.

    Trending score: 2.65; stars gained: +19; forks gained: +2.

    Language: Rust

    Topics: bm25, c-li, call-graph, claude-code, cli, code-analysis

  3. 3. modelscope/ms-swift

    Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

    GitHub repository with 14,512 stars and 1,480 forks.

    Trending score: 2.22; stars gained: +6; forks gained: +0.

    Language: Python

    Topics: deepseek-r1, embedding, grpo, internvl, liger, llama

  4. 4. infiniflow/infinity

    The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

    GitHub repository with 4,568 stars and 425 forks.

    Trending score: 1.33; stars gained: +3; forks gained: +0.

    Language: C++

    Topics: ai-native, approximate-nearest-neighbor-search, bm25, cpp20, cpp20-modules, embedding

  5. 5. nirholas/three.ws

    Open-source 3D AI agent framework — GLB/glTF avatars with LLM brains, memory, emotions, and autonomous payments. MCP server · x402 · Solana/EVM · Three.js. Embed anywhere as a web component. Character studio, animation gallery, OAuth 2.1. Browser-native.

    GitHub repository with 60 stars and 16 forks.

    Trending score: 0.98; stars gained: +1; forks gained: +0.

    Language: JavaScript

    Topics: 3d, ai-agent, animation, avatar, blockchain, character-studio

  6. 6. opensolon/solon-ai

    Java AI application development framework (supports LLM-tool,skill; RAG; MCP; Agent-ReAct,Team-Agent). Compatible with java8 ~ java25. It can also be embedded in SpringBoot, jFinal, Vert.x, Quarkus, and other frameworks.

    GitHub repository with 403 stars and 60 forks.

    Trending score: 0.97; stars gained: +2; forks gained: +1.

    Language: Java

    Topics: ai, chat, deepseek, embedding, function-call, java