winstonsmith1897/DantinoX

DantinoX: A modular, memory-efficient Transformer implementation in JAX/Flax NNX. Includes Sparse MoE, GQA, Sliding Window Attention, Gradient Accumulation and Checkpointing

GitHub repository with 5 stars and 1 forks.

Language: Python

Topics: attention-mechanism, mixture-of-experts, transformer-architecture, flax, jax, fine, llm, pre-training

Open provider repository

24h trend summary

Trending score 0.04, activity score 0.05, stars gained +0, forks gained +0.

Latest metric snapshot

2026-06-13: 5 stars and 1 forks.

Similar repositories

  1. 1. hasibul0912/ChunkWise

    📝 Streamline text processing in Arabic and English with ChunkWise, a library offering 31 chunking strategies for NLP and RAG systems.

    GitHub repository with 5 stars and 0 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: aggregation, attention-mechanism, chunk, chunkwise-processing, csv, data-analysis

  2. 2. winstonsmith1897/DantinoX

    DantinoX: A modular, memory-efficient Transformer implementation in JAX/Flax NNX. Includes Sparse MoE, GQA, Sliding Window Attention, Gradient Accumulation and Checkpointing

    GitHub repository with 5 stars and 1 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: attention-mechanism, mixture-of-experts, transformer-architecture, flax, jax, fine

  3. 3. lucidrains/isab-pytorch

    An implementation of (Induced) Set Attention Block, from the Set Transformers paper

    GitHub repository with 70 stars and 4 forks.

    Trending score: 0.03; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: artificial-intelligence, deep-learning, attention-mechanism, attention

  4. 4. qflen/nsa-from-scratch

    From-scratch reimplementation of DeepSeek's Native Sparse Attention (arXiv:2502.11089) in Triton + CUDA Hopper WGMMA. 7.4x faster than FlashAttention-3 at 64k context. Five-model training fleet, perplexity sweep, LongBench v2, MoBA comparison.

    GitHub repository with 6 stars and 0 forks.

    Trending score: 0.00; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: attention-mechanism, cuda, deepseek, flash-attention, gpu-kernels, hopper

Trending in Python

  1. 1. mvanhorn/last30days-skill

    AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

    GitHub repository with 40,614 stars and 3,271 forks.

    Trending score: 5.82; stars gained: +1,312; forks gained: +87.

    Language: Python

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 25,425 stars and 1,676 forks.

    Trending score: 5.73; stars gained: +2,844; forks gained: +202.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 69,697 stars and 8,821 forks.

    Trending score: 5.70; stars gained: +951; forks gained: +165.

    Language: Python

  4. 4. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 192,327 stars and 33,531 forks.

    Trending score: 5.48; stars gained: +990; forks gained: +282.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  5. 5. safishamsi/graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

    GitHub repository with 66,467 stars and 6,719 forks.

    Trending score: 5.25; stars gained: +1,314; forks gained: +109.

    Language: Python

    Topics: antigravity, claude-code, codex, gemini, graphrag, knowledge-graph

  6. 6. hugohe3/ppt-master

    AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images · by Hugo He

    GitHub repository with 27,112 stars and 2,418 forks.

    Trending score: 5.10; stars gained: +903; forks gained: +61.

    Language: Python

    Topics: ai-agent, aippt, office, powerpoint, powerpoint-generation, ppt

Trending topic: attention-mechanism

  1. 1. ruvnet/RuVector

    RuVector is a High Performance, Real-Time, Self-Learning Ai, Vector GNN, Memory DB built in Rust.

    GitHub repository with 4,235 stars and 564 forks.

    Trending score: 2.16; stars gained: +13; forks gained: +6.

    Language: Rust

    Topics: ai, ai-ocr, gnn, graph, llm-inference, low-latency

  2. 2. nndl/nndl

    邱锡鹏《神经网络与深度学习》(蒲公英书)理论书 v2 与通识版

    GitHub repository with 18,809 stars and 3,667 forks.

    Trending score: 0.29; stars gained: +0; forks gained: +1.

    Topics: attention-mechanism, chinese, deep-learning, machine-learning, neural-networks, textbook

  3. 3. hasibul0912/ChunkWise

    📝 Streamline text processing in Arabic and English with ChunkWise, a library offering 31 chunking strategies for NLP and RAG systems.

    GitHub repository with 5 stars and 0 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: aggregation, attention-mechanism, chunk, chunkwise-processing, csv, data-analysis

  4. 4. winstonsmith1897/DantinoX

    DantinoX: A modular, memory-efficient Transformer implementation in JAX/Flax NNX. Includes Sparse MoE, GQA, Sliding Window Attention, Gradient Accumulation and Checkpointing

    GitHub repository with 5 stars and 1 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: attention-mechanism, mixture-of-experts, transformer-architecture, flax, jax, fine

  5. 5. Scottcjn/pse-vcipher-collapse

    Non-bijunctive attention collapse for LLM inference — POWER8 hardware AES (vcipher) + AltiVec vec_perm. Hebbian path selection, cross-head diffusion, O(1) KV prefiltering.

    GitHub repository with 34 stars and 3 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: C

    Topics: aes, altivec, attention-mechanism, cpu-inference, deep-learning, hardware-acceleration

  6. 6. lucidrains/isab-pytorch

    An implementation of (Induced) Set Attention Block, from the Set Transformers paper

    GitHub repository with 70 stars and 4 forks.

    Trending score: 0.03; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: artificial-intelligence, deep-learning, attention-mechanism, attention