open-circle/schema-benchmarks
Transparent comparisons between schema validation libraries
GitHub repository with 44 stars and 6 forks.
Language: TypeScript
Topics: benchmark, javascript, library, schema, typescript
Transparent comparisons between schema validation libraries
GitHub repository with 44 stars and 6 forks.
Language: TypeScript
Topics: benchmark, javascript, library, schema, typescript
2026-06-05: 44 stars and 6 forks.
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research · 浏览器里运行的安卓模拟器 · Browser-hosted Android Simulator · Verifiable Evaluation · Scalable Online RL Training
GitHub repository with 519 stars and 82 forks.
Trending score: 3.00; stars gained: +33; forks gained: +4.
Language: TypeScript
Topics: agent, agents, ai, android, automation, benchmark
Minecraft-style voxel benchmark for comparing AI models (Arena + Sandbox)
GitHub repository with 244 stars and 17 forks.
Trending score: 1.14; stars gained: +13; forks gained: +0.
Language: TypeScript
Topics: ai, benchmark, llm, nlp, voxel, comparison-benchmarks
🏆 #1 on LLM routing benchmark · Cheapest LLM router with memory · Open-source parallel multi-LLM execution across 47+ providers
GitHub repository with 10 stars and 0 forks.
Trending score: 0.33; stars gained: +1; forks gained: +0.
Language: TypeScript
Topics: ai-gateway, api-gateway, artificial-intelligence, benchmark, circuit-breaker, cli-tool
Rank your GitHub R&D org against @steipete in the absurd unit of one Peter. Next.js 15 + React 19.
GitHub repository with 8 stars and 0 forks.
Trending score: 0.30; stars gained: +1; forks gained: +0.
Language: TypeScript
Topics: benchmark, developer-tools, dx, github-api, nextjs, react
📊 Benchmark Comparison of Packages with Runtime Validation and TypeScript Support
GitHub repository with 824 stars and 88 forks.
Trending score: 0.18; stars gained: +0; forks gained: +0.
Language: TypeScript
Topics: typescript, types, benchmarks, validation, benchmark, json
Benchmark-driven model routing for OpenClaw with data-driven policy tuning. Inspired by karpathy/autoresearch.
GitHub repository with 5 stars and 3 forks.
Trending score: 0.05.
Language: TypeScript
Topics: ai-agent, llm, model-routing, multi-model, openclaw, autoresearch
🎨 Local-first, open-source Claude Design alternative. 🖥️ Native desktop app. ⚡ 259+ Skills · ✨ 142+ Design Systems 🖼️ Web · desktop · mobile prototypes · slides · images · videos · HyperFrames 📦 Sandboxed preview · HTML/PDF/PPTX/MP4 export 🤖 Claude Code / OpenClaw / Codex / Cursor / OpenCode / Qwen / Copilot / Hermes / Kimi & 17+ CLIs.
GitHub repository with 59,176 stars and 6,657 forks.
Trending score: 5.98; stars gained: +1,178; forks gained: +117.
Language: TypeScript
Topics: agent-skills, ai-agents, ai-design, byok, claude-code-for-design, claude-design
Pre-indexed code knowledge graph for Claude Code, Codex, Gemini, Cursor, OpenCode, AntiGravity, Kiro, and Hermes Agent — fewer tokens, fewer tool calls, 100% local
GitHub repository with 41,600 stars and 2,573 forks.
Trending score: 5.83; stars gained: +2,953; forks gained: +188.
Language: TypeScript
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
GitHub repository with 10,639 stars and 890 forks.
Trending score: 4.82; stars gained: +560; forks gained: +62.
Language: TypeScript
Topics: ai-agent, ai-coding-agent, anthropic, bun, claude, cli
The API to search, scrape, and interact with the web at scale. 🔥
GitHub repository with 128,853 stars and 7,670 forks.
Trending score: 4.80; stars gained: +954; forks gained: +49.
Language: TypeScript
Topics: ai, ai-agents, ai-crawler, ai-scraping, ai-search, crawler
🌊 The leading agent meta-harness for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features adaptive memory, self-learning swarm intelligence, RAG integration, and native Claude Code / Codex Integration
GitHub repository with 57,954 stars and 6,626 forks.
Trending score: 4.76; stars gained: +401; forks gained: +52.
Language: TypeScript
Topics: claude-code, swarm, agentic-ai, agentic-framework, agentic-rag, agentic-workflow
Write HTML. Render video. Built for agents.
GitHub repository with 24,519 stars and 2,280 forks.
Trending score: 4.72; stars gained: +732; forks gained: +60.
Language: TypeScript
Topics: ai, animation, ffmpeg, framework, gsap, html
MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research · 浏览器里运行的安卓模拟器 · Browser-hosted Android Simulator · Verifiable Evaluation · Scalable Online RL Training
GitHub repository with 519 stars and 82 forks.
Trending score: 3.00; stars gained: +33; forks gained: +4.
Language: TypeScript
Topics: agent, agents, ai, android, automation, benchmark
🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.
GitHub repository with 780 stars and 9 forks.
Trending score: 1.88; stars gained: +102; forks gained: +0.
Language: Python
Topics: agentic-ai, benchmark, llm, proactive-agent, search, search-agent
Minecraft-style voxel benchmark for comparing AI models (Arena + Sandbox)
GitHub repository with 244 stars and 17 forks.
Trending score: 1.14; stars gained: +13; forks gained: +0.
Language: TypeScript
Topics: ai, benchmark, llm, nlp, voxel, comparison-benchmarks
AMD Strix Halo local LLM guide: direct 100.0 t/s 30B Qwen MoE on Ryzen AI MAX+ 395 / Radeon 8060S. Setup, benchmarks, raw evidence.
GitHub repository with 91 stars and 4 forks.
Trending score: 0.98; stars gained: +7; forks gained: +0.
Language: Python
Topics: amd, benchmark, gfx1151, inference, llama-cpp, llm
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
GitHub repository with 1,273 stars and 328 forks.
Trending score: 0.92; stars gained: +7; forks gained: +1.
Language: Python
Topics: benchmark, llm, ai, language-model-agent, conversational-agents
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
GitHub repository with 7,061 stars and 784 forks.
Trending score: 0.91; stars gained: +4; forks gained: +1.
Language: Python
Topics: benchmark, chatgpt, evaluation, large-language-model, llama2, llama3