Maya1 TTS
What is Maya1 TTS?
The First Fully Open-Source 3B Parameter TTS Model
Maya1 TTS is a revolutionary open-source text-to-speech model developed by Maya Research. Built on a Llama-style decoder-only Transformer architecture with 3B parameters, it combines natural language voice control, emotional expression capabilities, and real-time streaming generation. Our approach delivers unprecedented voice customization and naturalness, supporting detailed voice descriptions, 20+ emotional tags, and multiple English accents.
- Natural Language Voice Control: Use XML-style descriptions to define voice characteristics like age, accent, pitch, and personality
- Emotional Expression: 20+ emotional tags like <laugh>, <sigh>, <whisper>, <angry> for realistic human emotions
- Real-Time Streaming: Sub-100ms latency with low buffering for interactive applications
- Open Source: Apache 2.0 licensed, commercial-friendly, and no per-second fees
Getting Started with Maya1 TTS
Quick Guide to Using Maya1 TTS
- Visit the Hugging Face repository to access Maya1 TTS models and resources
- Install dependencies: torch, transformers, snac, and soundfile
- Load the model and create prompts with voice descriptions and emotional tags
Maya1 TTS Key Features
Discover What Makes Maya1 TTS Revolutionary
Natural Language Voice Control
Use intuitive XML-style descriptions to define voice characteristics. Simply describe age, accent, pitch, tone, and personality in natural language.
Inline Emotional Tags
Insert 20+ emotional tags directly into text to control local expression. Tags include <laugh>, <sigh>, <whisper>, <angry>, <giggle>, and more based on real human emotions.
Real-Time Streaming Generation
Achieve sub-100ms latency with low buffering for interactive applications. Perfect for AI assistants, gaming, and live content creation.
Multi-Accent English Support
Supports various English accents and character variations, pre-trained on internet-scale English speech corpora for diverse voice options.