quic/efficient-transformers
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficiently on Qualcomm Cloud AI 100 accelerators.
GitHub repository with 89 stars and 89 forks.
Language: Python
Topics: accelerator, ai, cloud, llm, qualcomm