Droidtown/ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的成績。

GitHub repository with 415 stars and 37 forks.

Language: Python

Topics: nlp, cws, nlu, pos-tagger, pos-tagging, part-of-speech-tagger, part-of-speech-embdding, artificial-intelligence, natural-language-processing, natural-language-understanding

Open provider repository

24h trend summary

Trending score 0.04, activity score 0.04, stars gained not enough history, forks gained not enough history.

Latest metric snapshot

2026-06-05: 415 stars and 37 forks.

Similar repositories

  1. 1. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,286 stars and 33,414 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: nlp, natural-language-processing, pytorch, pytorch-transformers, transformer, model-hub

  2. 2. fancyboi999/ai-engineering-from-scratch-zh

    从零精通 AI 工程 · 20 阶段 468 课 · 中文全量翻译 + 配套站点 如何成为一名Agent 工程师 修成指南

    GitHub repository with 34 stars and 12 forks.

    Trending score: 0.95; stars gained: +8; forks gained: +1.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, chinese, chinese-translation

  3. 3. aaronsb/knowledge-graph-system

    Kappa Graph — κ(G). A semantic knowledge graph where knowledge has weight. Extracts concepts, measures grounding strength, preserves disagreement, traces everything to source.

    GitHub repository with 103 stars and 15 forks.

    Trending score: 0.81; stars gained: +5; forks gained: +2.

    Language: Python

    Topics: apache-age, concept-extraction, docker, fastapi, graph-database, knowledge-graph

  4. 4. adbar/trafilatura

    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

    GitHub repository with 6,047 stars and 379 forks.

    Trending score: 0.76; stars gained: +4; forks gained: +0.

    Language: Python

    Topics: web-scraping, text-extraction, nlp, html2text, text-mining, crawler

  5. 5. xr843/fojin

    Buddhist Digital Text Platform — 9,200+ texts, 500+ sources, 8 UI languages, AI Q&A (RAG), knowledge graph, full-text search

    GitHub repository with 308 stars and 53 forks.

    Trending score: 0.66; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: ai, buddhism, chinese-classics, digital-humanities, docker, elasticsearch

  6. 6. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 120 stars and 13 forks.

    Trending score: 0.58; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 181,098 stars and 31,072 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 12,594 stars and 817 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, claude-code, compression, context-engineering

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,254 stars and 2,241 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. open-webui/open-webui

    User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

    GitHub repository with 140,059 stars and 20,110 forks.

    Trending score: 5.04; stars gained: +317; forks gained: +58.

    Language: Python

    Topics: ollama, ollama-webui, llm, webui, self-hosted, llm-ui

  5. 5. ZhuLinsen/daily_stock_analysis

    LLM驱动的 A/H/美股智能分析:多数据源行情 + 实时新闻 + LLM决策仪表盘 + 多渠道推送,零成本定时运行,纯白嫖. LLM-powered stock analysis system for A/H/US markets.

    GitHub repository with 40,774 stars and 38,952 forks.

    Trending score: 4.88; stars gained: +836; forks gained: +443.

    Language: Python

    Topics: a-stock, ai-agent, aigc, llm, quant, quantitative-finance

  6. 6. anthropics/financial-services

    GitHub repository with 29,960 stars and 4,217 forks.

    Trending score: 4.88; stars gained: +688; forks gained: +114.

    Language: Python

Trending topic: nlp

  1. 1. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,286 stars and 33,414 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: nlp, natural-language-processing, pytorch, pytorch-transformers, transformer, model-hub

  2. 2. asgeirtj/system_prompts_leaks

    Extracted system prompts from Anthropic - Claude Code, Claude Design, Opus 4.8, Sonnet 4.6. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex, Google - Gemini - 3.5 Flash, 3.1 Pro, Antigravity, xAI - Grok, Cursor, Copilot, VS Code, Perplexity, and more. Updated regularly.

    GitHub repository with 41,256 stars and 6,839 forks.

    Trending score: 3.20; stars gained: +54; forks gained: +13.

    Language: JavaScript

    Topics: ai, anthropic, awesome, chatbot, chatgpt, claude

  3. 3. deepset-ai/haystack

    Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

    GitHub repository with 25,459 stars and 2,829 forks.

    Trending score: 2.66; stars gained: +20; forks gained: +5.

    Language: MDX

    Topics: nlp, question-answering, pytorch, semantic-search, information-retrieval, summarization

  4. 4. Ammaar-Alam/minebench

    Minecraft-style voxel benchmark for comparing AI models (Arena + Sandbox)

    GitHub repository with 244 stars and 17 forks.

    Trending score: 1.14; stars gained: +13; forks gained: +0.

    Language: TypeScript

    Topics: ai, benchmark, llm, nlp, voxel, comparison-benchmarks

  5. 5. chrisvdweth/selene

    An open, large-scale, interactive textbook.

    GitHub repository with 161 stars and 16 forks.

    Trending score: 1.13; stars gained: +14; forks gained: +0.

    Language: Jupyter Notebook

    Topics: ai, computer-science, edcuational, learning-resources, data-science, deep-learning

  6. 6. fancyboi999/ai-engineering-from-scratch-zh

    从零精通 AI 工程 · 20 阶段 468 课 · 中文全量翻译 + 配套站点 如何成为一名Agent 工程师 修成指南

    GitHub repository with 34 stars and 12 forks.

    Trending score: 0.95; stars gained: +8; forks gained: +1.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, chinese, chinese-translation