fla-org/flash-linear-attention
🚀 Efficient implementations for emerging model architectures
GitHub repository with 5,188 stars and 549 forks.
Language: Python
Topics: large-language-models, machine-learning-systems, natural-language-processing, sequence-modeling