bsc-mem/Mess-2.0
The Mess benchmark, redesigned. A C++ framework for generating bandwidth–latency curves to characterize memory systems.
GitHub repository with 8 stars and 1 forks.
Language: C++
Topics: benchmark, memory-system, performance
The Mess benchmark, redesigned. A C++ framework for generating bandwidth–latency curves to characterize memory systems.
GitHub repository with 8 stars and 1 forks.
Language: C++
Topics: benchmark, memory-system, performance
2026-06-05: 8 stars and 1 forks.
A microbenchmark support library
GitHub repository with 10,221 stars and 1,772 forks.
Trending score: 0.84; stars gained: +5; forks gained: -1.
Language: C++
Topics: benchmark
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
GitHub repository with 459 stars and 73 forks.
Trending score: 0.17; stars gained: +0; forks gained: +0.
Language: C++
Topics: benchmark, cuda, gpu, hip, opencl, openmp
Cache-aware frequency sort: a header-only C++20 sort for low-cardinality integer arrays on x86-64.
GitHub repository with 21 stars and 0 forks.
Trending score: 0.01; stars gained: +0; forks gained: +0.
Language: C++
Topics: avx2, benchmark, branchless, cache-aware, cpp, cpp20
🚀 Fast prime number generator
GitHub repository with 1,093 stars and 135 forks.
Trending score: 0.01; stars gained: +0; forks gained: +0.
Language: C++
Topics: prime-numbers, sieve-of-eratosthenes, math, eratosthenes, primes, sieve
LLM inference in C/C++
GitHub repository with 114,737 stars and 19,197 forks.
Trending score: 4.40; stars gained: +304; forks gained: +99.
Language: C++
Topics: ggml
DuckDB is an analytical in-process SQL database management system
GitHub repository with 38,621 stars and 3,299 forks.
Trending score: 3.50; stars gained: +40; forks gained: +6.
Language: C++
Topics: analytics, database, embedded-database, olap, sql
Community maintained hardware plugin for vLLM on Ascend
GitHub repository with 2,201 stars and 1,350 forks.
Trending score: 3.25; stars gained: +16; forks gained: +22.
Language: C++
Topics: ascend, inference, llm, llm-serving, llmops, mlops
:electron: Build cross-platform desktop apps with JavaScript, HTML, and CSS
GitHub repository with 121,543 stars and 17,235 forks.
Trending score: 3.02; stars gained: +16; forks gained: +2.
Language: C++
Topics: electron, javascript, c-plus-plus, html, css, chrome
ClickHouse® is a real-time analytics database management system
GitHub repository with 47,833 stars and 8,469 forks.
Trending score: 2.96; stars gained: +53; forks gained: +10.
Language: C++
Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp
Truly independent web browser
GitHub repository with 63,763 stars and 3,075 forks.
Trending score: 2.89; stars gained: +52; forks gained: +5.
Language: C++
Topics: browser, browser-engine
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research · 浏览器里运行的安卓模拟器 · Browser-hosted Android Simulator · Verifiable Evaluation · Scalable Online RL Training
GitHub repository with 519 stars and 82 forks.
Trending score: 3.00; stars gained: +33; forks gained: +4.
Language: TypeScript
Topics: agent, agents, ai, android, automation, benchmark
🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.
GitHub repository with 780 stars and 9 forks.
Trending score: 1.88; stars gained: +102; forks gained: +0.
Language: Python
Topics: agentic-ai, benchmark, llm, proactive-agent, search, search-agent
Minecraft-style voxel benchmark for comparing AI models (Arena + Sandbox)
GitHub repository with 244 stars and 17 forks.
Trending score: 1.14; stars gained: +13; forks gained: +0.
Language: TypeScript
Topics: ai, benchmark, llm, nlp, voxel, comparison-benchmarks
AMD Strix Halo local LLM guide: direct 100.0 t/s 30B Qwen MoE on Ryzen AI MAX+ 395 / Radeon 8060S. Setup, benchmarks, raw evidence.
GitHub repository with 91 stars and 4 forks.
Trending score: 0.98; stars gained: +7; forks gained: +0.
Language: Python
Topics: amd, benchmark, gfx1151, inference, llama-cpp, llm
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
GitHub repository with 1,273 stars and 328 forks.
Trending score: 0.92; stars gained: +7; forks gained: +1.
Language: Python
Topics: benchmark, llm, ai, language-model-agent, conversational-agents
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
GitHub repository with 7,061 stars and 784 forks.
Trending score: 0.91; stars gained: +4; forks gained: +1.
Language: Python
Topics: benchmark, chatgpt, evaluation, large-language-model, llama2, llama3