Edge-TTS
Generate speech audio using Microsoft Azure text-to-speech voices
Edge-TTS
Edge-TTS uses Microsoft Azure's text-to-speech service to convert your narration scripts into natural-sounding audio. It's available on all subscription plans and supports 50+ languages.
Features
- Multi-language support — English, Spanish, French, German, Japanese, Korean, Chinese, and many more
- Multiple voices per language — male and female voice options
- Speech rate control — adjust speed from 0.5x to 2.0x
- Real-time preview — listen to generated audio before saving
- Single and batch generation — process one line or an entire script
- Free with all plans — no additional API costs
Single Generation
Generate audio for a single piece of text:
- Open the Tool Suite panel → select Edge-TTS
- Type or paste your text in the input field
- Select a Language from the dropdown
- Choose a Voice (options change based on selected language)
- Adjust Speech Rate if needed (default: 1.0x)
- Click Generate to create the audio
- Use the Play button to preview, then Save to export
Batch Generation
Process an entire narration script at once:
- Prepare a text file with one line per audio segment
- In Edge-TTS, click Load Text File
- Configure language, voice, and speech rate
- Select an Output Folder
- Click Batch Generate — each line becomes a separate audio file
- Monitor progress in the progress bar
Output Formats
- MP3 — smaller file size, good for web and general use
- WAV — uncompressed, best quality for video production
Edge-TTS vs. Kokoro-TTS
| Feature | Edge-TTS | Kokoro-TTS | |---------|----------|------------| | Availability | All plans | Standard & Pro | | Processing | Cloud (requires internet) | Local (runs on your machine) | | Languages | 50+ languages | English (primary) | | Voice quality | Good (Azure neural voices) | Excellent (neural TTS) | | Speed | Fast (cloud processing) | Depends on hardware (GPU recommended) | | Cost | Free (included) | Free (included) |
Tips
- For the best results with video narration, use a consistent voice throughout a chapter
- Adjust speech rate to match the pacing of your content — action scenes may benefit from slightly faster narration
- Preview audio before batch processing to ensure the voice and rate settings are right
- Edge-TTS requires an internet connection — for offline processing, use Kokoro-TTS instead