Furyton/awesome-language-model-analysis

This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.

GitHub repository with 100 stars and 2 forks.

Language: Python

Topics: analytics, awesome, llm, transformers, ai, analysis, chatgpt, deep-learning, generative-ai, large-language-models

Open provider repository

Latest metric snapshot

2026-06-05: 100 stars and 2 forks.

Similar repositories

  1. 1. PostHog/posthog

    🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

    GitHub repository with 34,872 stars and 2,826 forks.

    Trending score: 2.85; stars gained: +44; forks gained: +8.

    Language: Python

    Topics: ab-testing, ai-analytics, analytics, cdp, data-warehouse, experiments

  2. 2. jupyter-naas/abi

    AI Operating System - Build your own AI using ontologies as the unifying field connecting data, models, workflows, and systems.

    GitHub repository with 126 stars and 38 forks.

    Trending score: 0.98; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: abi, ai, aiagentic, aiagentsframework, analytics, artificial-intelligence

  3. 3. getredash/redash

    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

    GitHub repository with 28,616 stars and 4,601 forks.

    Trending score: 0.88; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: redash, python, visualization, analytics, bi, redshift

  4. 4. savvina-ai/savvina

    Self-hosted NL-to-SQL analytics — query your database with plain English

    GitHub repository with 9 stars and 1 forks.

    Trending score: 0.71; stars gained: +4; forks gained: +0.

    Language: Python

    Topics: analytics, docker, fastapi, llm, mysql, natural-language-sql

  5. 5. zhongyu09/openchatbi

    OpenChatBI is an intelligent chat-based BI tool powered by large language models, designed to help users query, analyze, and visualize data through natural language conversations. It uses LangGraph and LangChain to build chat agent and workflows that support natural language to SQL conversion and data analysis.

    GitHub repository with 571 stars and 74 forks.

    Trending score: 0.60; stars gained: +3; forks gained: +1.

    Language: Python

    Topics: agent, ai, analytics, bi, database, datawarehouse

  6. 6. fedbiomed/fedbiomed

    A collaborative learning framework for empowering biomedical research

    GitHub repository with 86 stars and 16 forks.

    Trending score: 0.33; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: ai, analytics, biomedical, clinical, collaborative, data-science

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,353 stars and 31,271 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 14,053 stars and 885 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,613 stars and 2,272 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  5. 5. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,427 stars and 28,046 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

  6. 6. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,539 stars and 4,943 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

Trending topic: analytics

  1. 1. duckdb/duckdb

    DuckDB is an analytical in-process SQL database management system

    GitHub repository with 38,625 stars and 3,301 forks.

    Trending score: 3.50; stars gained: +40; forks gained: +6.

    Language: C++

    Topics: analytics, database, embedded-database, olap, sql

  2. 2. ClickHouse/ClickHouse

    ClickHouse® is a real-time analytics database management system

    GitHub repository with 47,840 stars and 8,471 forks.

    Trending score: 2.96; stars gained: +53; forks gained: +10.

    Language: C++

    Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp

  3. 3. PostHog/posthog

    🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, data warehouse, a CDP, and an AI product assistant to help debug your code, ship features faster, and keep all your usage and customer data in one stack.

    GitHub repository with 34,872 stars and 2,826 forks.

    Trending score: 2.85; stars gained: +44; forks gained: +8.

    Language: Python

    Topics: ab-testing, ai-analytics, analytics, cdp, data-warehouse, experiments

  4. 4. Kaelio/ktx

    ktx is an executable context layer for data and analytics agents 🐙 Allow Claude Code, Codex, and any AI agent to query data accurately through MCP with skills, memory and a semantic layer

    GitHub repository with 895 stars and 46 forks.

    Trending score: 2.76; stars gained: +24; forks gained: +1.

    Language: TypeScript

    Topics: agent, agent-skills, agents, ai-agent, ai-agents, analytics

  5. 5. metabase/metabase

    The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data :bar_chart:

    GitHub repository with 47,577 stars and 6,518 forks.

    Trending score: 2.57; stars gained: +20; forks gained: +7.

    Language: Clojure

    Topics: analytics, bi, business-intelligence, businessintelligence, clojure, dashboard

  6. 6. langwatch/langwatch

    The platform for LLM evaluations and AI agent testing

    GitHub repository with 3,288 stars and 320 forks.

    Trending score: 2.05; stars gained: +6; forks gained: +0.

    Language: TypeScript

    Topics: ai, analytics, datasets, dspy, evaluation, gpt