FunAudioLLM/CosyVoice — Multi-lingual large voice generation model, providing i

SVG Banners

👉🏻 CosyVoice 👈🏻

Fun-CosyVoice 3.0: Demos; Paper; Modelscope; Huggingface; CV3-Eval

CosyVoice 2.0: Demos; Paper; Modelscope; HuggingFace

CosyVoice 1.0: Demos; Paper; Modelscope; HuggingFace

Highlight🔥

Fun-CosyVoice 3.0 is an advanced text-to-speech (TTS) system based on large language models (LLM), surpassing its predecessor (CosyVoice 2.0) in content consistency, speaker similarity, and prosody naturalness. It is designed for zero-shot multilingual speech synthesis in the wild.

Key Features

Language Coverage: Covers 9 common languages (Chinese, English, Japanese, Korean, German, Spanish, French, Italian, Russian), 18+ Chinese dialects/accents (Guangdong, Minnan, Sichuan, Dongbei, Shan3xi, Shan1xi, Shanghai, Tianjin, Shandong, Ningxia, Gansu, etc.) and meanwhile supports both multi-lingual/cross-lingual zero-shot voice cloning.
Content Consistency & Naturalness: Achieves state-of-the-art performance in content consistency, speaker similarity, and prosody naturalness.
Pronunciation Inpainting: Supports pronunciation inpainting of Chinese Pinyin and English CMU phonemes, providing more controllability and thus suitable for production use.
Text Normalization: Supports reading of numbers, special symbols and various text formats without a traditional frontend module.
Bi-Streaming: Support both text-in streaming and audio-out streaming, and achieves latency as low as 150ms while maintaining high-quality audio output.
Instruct Support: Supports various instructions such as languages, dialects, emotions, speed, volume, etc.

Roadmap

2025/12
- release Fun-CosyVoice3-0.5B-2512 base model, rl model and its training/inference script
- release Fun-CosyVoice3-0.5B modelscope gradio space
2025/08

CosyVoice

Quick Overview

Scores

Trust Score

Maintenance

Popularity

Star History

Snapshot Versions

Alternatives

prompts.chat

dify

langchain

open-webui

hermes-agent

awesome-llm-apps

Community Reviews

README

👉🏻 CosyVoice 👈🏻

Highlight🔥

Key Features

Roadmap