Runtime Text To Speech - offline, cross-platform TTS, over 35 languages and 900 voices

:speaking_head: Transform your game with real-time, offline, cross-platform text-to-speech synthesis!

Add powerful offline text-to-speech capabilities to your project with over 35 languages and 900 voices featuring more than 120 voice qualities. Synthesize speech in real-time without internet connectivity, powered by Piper and ONNX Runtime.

Quick links:

Key features:

:dart: Core Capabilities:

  • Complete offline text-to-speech synthesis
  • 35+ languages supported
  • 900+ unique voices available
  • 120+ voice qualities
  • Cross-platform support: Windows, Linux, Mac, Android (including Oculus/Meta Quest), iOS
  • Experimental support for Meta Quest and Apple Vision Pro

:zap: Voice System:

  • One-click voice model downloads through editor interface
  • In-editor voice preview and testing
  • Runtime voice model selection
  • Raw PCM float audio output
  • Flexible integration with any audio playback solution
  • Built-in compatibility with Runtime Audio Importer

:hammer_and_wrench: Development Features:

  • Full Blueprint and C++ API support
  • Easy voice model management and packaging
  • Comprehensive voice metadata access
  • Simple voice model selection via dropdown
  • Automated voice model packaging with projects

:earth_africa: Supported Languages:

  • :us: English (United States) – 18 voice models, 23 qualities
  • :uk: English (British) – 9 voice models, 11 qualities
  • :cn: Simplified Chinese (简体中文) – 1 voice model, 2 qualities
  • :mexico: Spanish (Mexican / Español Mexicano) – 2 voice models, 2 qualities
  • :es: Spanish (European / Español Europeo) – 5 voice models, 5 qualities
  • :ru: Russian (Русский) – 4 voice models, 4 qualities
  • :brazil: Portuguese (Brazil / Português do Brasil) – 2 voice models, 2 qualities
  • :portugal: Portuguese (Portugal / Português de Portugal) – 1 voice model, 1 quality
  • :de: German (Deutsch) – 8 voice models, 10 qualities
  • :fr: French (Français) – 6 voice models, 7 qualities
  • :tr: Turkish (Türkçe) – 3 voice models, 3 qualities
  • :poland: Polish (Polski) – 4 voice models, 4 qualities
  • :it: Italian (Italiano) – 2 voice models, 2 qualities
  • :ukraine: Ukrainian (Украї́нська мо́ва) – 2 voice models, 2 qualities
  • :andorra: Catalan (Català) – 2 voice models, 3 qualities
  • :czech_republic: Czech (Čeština) – 1 voice model, 2 qualities
  • :wales: Welsh (Cymraeg) – 1 voice model, 1 quality
  • :denmark: Danish (Dansk) – 1 voice model, 1 quality
  • :greece: Greek (Ελληνικά) – 1 voice model, 1 quality
  • :iran: Farsi (فارسی) – 2 voice models, 2 qualities
  • :finland: Finnish (Suomi) – 1 voice model, 1 quality
  • :hungary: Hungarian (Magyar) – 3 voice models, 3 qualities
  • :iceland: Icelandic (Íslenska) – 4 voice models, 4 qualities
  • :georgia: Georgian (ქართული ენა) – 1 voice model, 1 quality
  • :kazakhstan: Kazakh (Қазақша) – 3 voice models, 3 qualities
  • :luxembourg: Luxembourgish (Lëtzebuergesch) – 1 voice model, 1 quality
  • :latvia: Latvian (Latviešu) – 1 voice model, 1 quality
  • :nepal: Nepali (नेपाली) – 1 voice model, 2 qualities
  • :belgium: Dutch (Belgium / Vlaams) – 2 voice models, 4 qualities
  • :netherlands: Dutch (Netherlands / Nederlands) – 3 voice models, 3 qualities
  • :norway: Norwegian (Bokmål / Nynorsk) – 1 voice model, 1 quality
  • :romania: Romanian (Română) – 1 voice model, 1 quality
  • :slovakia: Slovak (Slovenčina) – 1 voice model, 1 quality
  • :slovenia: Slovenian (Slovenščina) – 1 voice model, 1 quality
  • :serbia: Serbian (Srpski) – 1 voice model, 1 quality
  • :sweden: Swedish (Svenska) – 1 voice model, 1 quality
  • :kenya: Swahili (Kiswahili) – 1 voice model, 1 quality
  • :vietnam: Vietnamese (Tiếng Việt) – 3 voice models, 3 qualities

:video_game: Perfect for:

  • Accessible game interfaces
  • Dynamic NPC conversations
  • Voice-driven tutorials and hints
  • Procedurally generated content
  • Localization solutions
  • Assistive technologies
  • Interactive storytelling
  • Educational applications