UKPLab/PeerQA

Code and Data for PeerQA: A Scientific Question Answering Dataset from Peer Reviews, NAACL 2025 https://aclanthology.org/2025.naacl-long.22/

GitHub repository with 14 stars and 4 forks.

Language: Python

Topics: dataset, information-retrieval, llm, nlp, peer-review, question-answering, rag, retrieval

Open provider repository

Latest metric snapshot

2026-06-05: 14 stars and 4 forks.

Similar repositories

  1. 1. emmanuelgjr/genai_incidents

    Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.

    GitHub repository with 13 stars and 3 forks.

    Trending score: 0.87; stars gained: +6; forks gained: +1.

    Language: Python

    Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents

  2. 2. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 120 stars and 13 forks.

    Trending score: 0.58; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

  3. 3. juliensimon/space-datasets

    200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.36; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets

  4. 4. COeXISTENCE-PROJECT/URB

    URB - Urban Routing Benchmark - Benchmarking MARL algorithms on the fleet routing tasks.

    GitHub repository with 16 stars and 9 forks.

    Trending score: 0.07; stars gained: +0; forks gained: +1.

    Language: Python

    Topics: benchmark, dataset, reinforcement-learning, reinforcement-learning-algorithms, sumo, routerl

  5. 5. JafarAkhondali/Morefixes

    MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository Discovery

    GitHub repository with 72 stars and 17 forks.

    Trending score: 0.07; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: hacktoberfest, commit, cve, dataset, fix, patch

  6. 6. Project-AgML/AgML

    AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.

    GitHub repository with 281 stars and 45 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +1.

    Language: Python

    Topics: deep-learning, agriculture, pytorch, dataset, image-classification, object-detection

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 181,649 stars and 31,166 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 13,361 stars and 853 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,422 stars and 2,253 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. anthropics/financial-services

    GitHub repository with 30,029 stars and 4,231 forks.

    Trending score: 4.88; stars gained: +688; forks gained: +114.

    Language: Python

  5. 5. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,250 stars and 534 forks.

    Trending score: 4.88; stars gained: +476; forks gained: +68.

    Language: Python

  6. 6. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,371 stars and 28,044 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

Trending topic: dataset

  1. 1. emmanuelgjr/genai_incidents

    Single source of truth for GenAI and agentic AI security incidents, mapped to OWASP LLM Top 10, OWASP Agentic Top 10 (ASI), NIST AI RMF, and MITRE ATLAS.

    GitHub repository with 13 stars and 3 forks.

    Trending score: 0.87; stars gained: +6; forks gained: +1.

    Language: Python

    Topics: agentic-incidents, ai-incidents, ai-safety, cybersecurity, dataset, genai-incidents

  2. 2. mdn/browser-compat-data

    Browser compatibility data for Web technologies as displayed on MDN

    GitHub repository with 5,681 stars and 2,565 forks.

    Trending score: 0.63; stars gained: +2; forks gained: +2.

    Language: JSON

    Topics: compat, compatibility, data, dataset, json

  3. 3. ATOM00blue/machine-learning-library

    A hand-curated, topic-organized library of the best ML education — 923 docs (391 arXiv papers, 474 Stanford/MIT/Karpathy/fast.ai lectures, 58 explainer articles), normalized to Markdown with full provenance. Open it in Obsidian or point your agent at it. A clean ML corpus for learning, RAG & fine-tuning.

    GitHub repository with 120 stars and 13 forks.

    Trending score: 0.58; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: arxiv, corpus, dataset, deep-learning, education, llm

  4. 4. commoncrawl/web-languages

    Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code

    GitHub repository with 69 stars and 93 forks.

    Trending score: 0.42; stars gained: +1; forks gained: +2.

    Topics: crawling, dataset, language-detection

  5. 5. juliensimon/space-datasets

    200+ auto-updated space, astronomy & physics datasets on Hugging Face (NASA, NOAA, ESA, JPL, SpaceX, Wikidata). Satellites, asteroids, space probes (Voyager, Cassini, Mars), space weather, exoplanets, pulsars, radio/X-ray surveys, cosmic rays, particle physics, and more. Parquet format, no API keys.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.36; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: asteroids, astronomy, dataset, esa, exoplanet, huggingface-datasets

  6. 6. samapriya/Earth-Engine-Datasets-List

    List of all datasets included in Google Earth Engine (generated from https://developers.google.com/earth-engine/datasets/catalog/)

    GitHub repository with 92 stars and 27 forks.

    Trending score: 0.32; stars gained: +1; forks gained: +0.

    Topics: earth-engine, dataset