liangdabiao/Multimodal-RAG

基于多模态 Embedding + Zilliz + Qwen 视觉理解的多模态 RAG 系统。支持 **Cohere / DashScope Embedding** 和 **DashScope / OpenRouter LLM** 双引擎切换。上传 PDF,用自然语言提问,系统自动检索最相关的页面并由 AI 生成回答。 与传统 RAG 不同,本系统**不做文本提取和 OCR**,而是直接将 PDF 页面当作图片处理,通过视觉 Embedding 模型编码,完整保留表格、图表、排版、手写批注等所有视觉信息。

GitHub repository with 33 stars and 7 forks.

Language: Python

Open provider repository

Latest metric snapshot

2026-06-04: 33 stars and 7 forks.

Similar repositories

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 180,863 stars and 31,019 forks.

    Trending score: 5.79; stars gained: +1,360; forks gained: +322.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. microsoft/SkillOpt

    SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

    GitHub repository with 4,890 stars and 487 forks.

    Trending score: 4.55; stars gained: +340; forks gained: +27.

    Language: Python

    Topics: agent-skills, self-evolving-agents

  3. 3. mukul975/Anthropic-Cybersecurity-Skills

    754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

    GitHub repository with 13,233 stars and 1,551 forks.

    Trending score: 4.53; stars gained: +301; forks gained: +38.

    Language: Python

    Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing

  4. 4. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,166 stars and 523 forks.

    Trending score: 4.43; stars gained: +415; forks gained: +37.

    Language: Python

  5. 5. anthropics/claude-code

    Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

    GitHub repository with 130,153 stars and 21,149 forks.

    Trending score: 4.42; stars gained: +277; forks gained: +38.

    Language: Python

  6. 6. CloakHQ/CloakBrowser

    Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

    GitHub repository with 23,119 stars and 1,836 forks.

    Trending score: 4.24; stars gained: +250; forks gained: +17.

    Language: Python

    Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 180,863 stars and 31,019 forks.

    Trending score: 5.79; stars gained: +1,360; forks gained: +322.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. microsoft/SkillOpt

    SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

    GitHub repository with 4,890 stars and 487 forks.

    Trending score: 4.55; stars gained: +340; forks gained: +27.

    Language: Python

    Topics: agent-skills, self-evolving-agents

  3. 3. mukul975/Anthropic-Cybersecurity-Skills

    754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

    GitHub repository with 13,233 stars and 1,551 forks.

    Trending score: 4.53; stars gained: +301; forks gained: +38.

    Language: Python

    Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing

  4. 4. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,166 stars and 523 forks.

    Trending score: 4.43; stars gained: +415; forks gained: +37.

    Language: Python

  5. 5. anthropics/claude-code

    Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

    GitHub repository with 130,153 stars and 21,149 forks.

    Trending score: 4.42; stars gained: +277; forks gained: +38.

    Language: Python

  6. 6. CloakHQ/CloakBrowser

    Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

    GitHub repository with 23,119 stars and 1,836 forks.

    Trending score: 4.24; stars gained: +250; forks gained: +17.

    Language: Python

    Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint