Cartesia
Real-time voice AI with ultra-low latency text-to-speech and voice cloning in 40+ languages
Free Trial
Cartesia provides real-time voice AI APIs built on state space models. Its Sonic-3 TTS engine delivers 90ms time-to-first-audio with natural, expressive voices including laughter and emotion in 40+ languages. Voice cloning requires just 15 seconds of audio. Also offers Ink-Whisper streaming speech-to-text and on-device models for edge deployment. Common use cases include voice agents, customer support, and interactive applications. Free tier includes 20,000 credits per month.
Pricing: Free / monthly subscriptions
Is your product missing? 👀 Add it here →