OpenMOSS/MOSS-TTS
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
GitHub repository with 3,071 stars and 273 forks.
Language: Python
Topics: audio, audio-tokenizer, llm, multimodal, text-to-speech, voice-cloning