qualcomm/aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

GitHub repository with 2,633 stars and 451 forks.

Language: Python

Topics: quantization, deep-learning, compression, open-source, machine-learning, pruning, auto-ml, network-compression, deep-neural-networks, network-quantization

Open provider repository

24h trend summary

Trending score 0.88, activity score 1.88, stars gained +3, forks gained +0.

Latest metric snapshot

2026-06-05: 2,633 stars and 451 forks.

Similar repositories

  1. 1. RyanCodrai/turbovec

    A vector index built on TurboQuant, written in Rust with Python bindings

    GitHub repository with 4,086 stars and 389 forks.

    Trending score: 1.70; stars gained: +63; forks gained: +9.

    Language: Python

    Topics: ann, avx512, embeddings, faiss, nearest-neighbor, neon

  2. 2. hiyouga/LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    GitHub repository with 71,901 stars and 8,787 forks.

    Trending score: 1.30; stars gained: +22; forks gained: +2.

    Language: Python

    Topics: fine-tuning, llama, llm, peft, transformers, rlhf

  3. 3. huawei-csl/KVarN

    KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

    GitHub repository with 239 stars and 9 forks.

    Trending score: 1.14; stars gained: +14; forks gained: +1.

    Language: Python

    Topics: agentic-ai, kv-cache, llm, llm-inference, long-context, quantization

  4. 4. intel/auto-round

    A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

    GitHub repository with 1,436 stars and 135 forks.

    Trending score: 1.06; stars gained: +6; forks gained: +0.

    Language: Python

    Topics: gguf, int4, llms, mxfp4, nvfp4, quantization

  5. 5. qualcomm/aimet

    AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

    GitHub repository with 2,633 stars and 451 forks.

    Trending score: 0.88; stars gained: +3; forks gained: +0.

    Language: Python

    Topics: quantization, deep-learning, compression, open-source, machine-learning, pruning

  6. 6. wanshuiyin/ARIS-in-AI-Offer

    Bilingual (中文+EN) ML / LLM / diffusion / agent interview cheat sheets for AI 秋招 — auto-generated by the ARIS /render-html workflow into single-file HTML, reads anywhere — plus a CV→DBLP-fact-checked academic homepage generator and long-form blogs/surveys 🌱

    GitHub repository with 166 stars and 6 forks.

    Trending score: 0.59; stars gained: +3; forks gained: +0.

    Language: Python

    Topics: ai-interview, aris, autumn-recruiting, cheatsheet, chinese, claude-code

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 181,649 stars and 31,166 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 13,361 stars and 853 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,422 stars and 2,253 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. anthropics/financial-services

    GitHub repository with 30,029 stars and 4,231 forks.

    Trending score: 4.88; stars gained: +688; forks gained: +114.

    Language: Python

  5. 5. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,250 stars and 534 forks.

    Trending score: 4.88; stars gained: +476; forks gained: +68.

    Language: Python

  6. 6. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,371 stars and 28,044 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

Trending topic: quantization

  1. 1. RyanCodrai/turbovec

    A vector index built on TurboQuant, written in Rust with Python bindings

    GitHub repository with 4,086 stars and 389 forks.

    Trending score: 1.70; stars gained: +63; forks gained: +9.

    Language: Python

    Topics: ann, avx512, embeddings, faiss, nearest-neighbor, neon

  2. 2. hiyouga/LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    GitHub repository with 71,901 stars and 8,787 forks.

    Trending score: 1.30; stars gained: +22; forks gained: +2.

    Language: Python

    Topics: fine-tuning, llama, llm, peft, transformers, rlhf

  3. 3. timtoole02/Camelid

    Camelid: a Rust-native local inference backend with evidence-gated model compatibility.

    GitHub repository with 49 stars and 10 forks.

    Trending score: 1.25; stars gained: +17; forks gained: +2.

    Language: Rust

    Topics: apple-silicon, gguf, inference, llama, llm, local-first

  4. 4. huawei-csl/KVarN

    KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

    GitHub repository with 239 stars and 9 forks.

    Trending score: 1.14; stars gained: +14; forks gained: +1.

    Language: Python

    Topics: agentic-ai, kv-cache, llm, llm-inference, long-context, quantization

  5. 5. intel/auto-round

    A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

    GitHub repository with 1,436 stars and 135 forks.

    Trending score: 1.06; stars gained: +6; forks gained: +0.

    Language: Python

    Topics: gguf, int4, llms, mxfp4, nvfp4, quantization

  6. 6. qualcomm/aimet

    AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

    GitHub repository with 2,633 stars and 451 forks.

    Trending score: 0.88; stars gained: +3; forks gained: +0.

    Language: Python

    Topics: quantization, deep-learning, compression, open-source, machine-learning, pruning