preligens-lab/textnoisr

Adding random noise to a text dataset, and controlling very accurately the quality of the result

GitHub repository with 20 stars and 3 forks.

Language: Python

Topics: nlp, noise, ocr

Open provider repository

Latest metric snapshot

2026-06-05: 20 stars and 3 forks.

Similar repositories

  1. 1. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  2. 2. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,319 stars and 33,423 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest

  3. 3. fancyboi999/ai-engineering-from-scratch-zh

    从零精通 AI 工程 · 20 阶段 468 课 · 中文全量翻译 + 配套站点 如何成为一名Agent 工程师 修成指南

    GitHub repository with 39 stars and 12 forks.

    Trending score: 0.95; stars gained: +8; forks gained: +1.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, chinese, chinese-translation

  4. 4. huggingface/datasets

    🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

    GitHub repository with 21,583 stars and 3,237 forks.

    Trending score: 0.88; stars gained: +7; forks gained: +5.

    Language: Python

    Topics: ai, artificial-intelligence, computer-vision, dataset-hub, datasets, deep-learning

  5. 5. aaronsb/knowledge-graph-system

    Kappa Graph — κ(G). A semantic knowledge graph where knowledge has weight. Extracts concepts, measures grounding strength, preserves disagreement, traces everything to source.

    GitHub repository with 103 stars and 15 forks.

    Trending score: 0.81; stars gained: +5; forks gained: +2.

    Language: Python

    Topics: apache-age, concept-extraction, docker, fastapi, graph-database, knowledge-graph

  6. 6. RaguTeam/RAGU

    Codebase for the implementaion of graph-rag system

    GitHub repository with 36 stars and 12 forks.

    Trending score: 0.78; stars gained: +4; forks gained: +0.

    Language: Python

    Topics: graph, knowledge-graph, llm, rag, nlp

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,353 stars and 31,271 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 14,053 stars and 885 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,616 stars and 2,272 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  5. 5. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,433 stars and 28,046 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools

  6. 6. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,540 stars and 4,942 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

Trending topic: nlp

  1. 1. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  2. 2. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,319 stars and 33,423 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest

  3. 3. Ammaar-Alam/minebench

    Minecraft-style voxel benchmark for comparing AI models (Arena + Sandbox)

    GitHub repository with 245 stars and 17 forks.

    Trending score: 1.14; stars gained: +13; forks gained: +0.

    Language: TypeScript

    Topics: ai, benchmark, llm, nlp, voxel, comparison-benchmarks

  4. 4. fancyboi999/ai-engineering-from-scratch-zh

    从零精通 AI 工程 · 20 阶段 468 课 · 中文全量翻译 + 配套站点 如何成为一名Agent 工程师 修成指南

    GitHub repository with 39 stars and 12 forks.

    Trending score: 0.95; stars gained: +8; forks gained: +1.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, chinese, chinese-translation

  5. 5. huggingface/datasets

    🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

    GitHub repository with 21,583 stars and 3,237 forks.

    Trending score: 0.88; stars gained: +7; forks gained: +5.

    Language: Python

    Topics: ai, artificial-intelligence, computer-vision, dataset-hub, datasets, deep-learning

  6. 6. antoniolupetti/algebrica

    Algebrica is free and open a mathematical knowledge base dedicated to clarity, structure, and conceptual coherence.

    GitHub repository with 714 stars and 37 forks.

    Trending score: 0.88; stars gained: +7; forks gained: +0.

    Topics: ai, mathematics, ml, nlp