E2/F5 TTS

This is an online demo for F5-TTS with advanced batch processing support. This app supports the following TTS models:

F5-TTS (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
E2 TTS (Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS)

The checkpoints currently support English and Chinese.

If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s with ✂ in the bottom right corner (otherwise might have non-optimal auto-trimmed result).

NOTE: Reference text will be automatically transcribed with Whisper if not provided. For best results, keep your reference clips short (<15s). Ensure the audio is fully uploaded before generating.

E2/F5 TTS

Batched TTS