Hume unveils Octave, a groundbreaking AI model that goes beyond reading text: it grasps its meaning, producing natural, expressive voices that capture emotions and contexts like never before.
Hume has introduced Octave, a text-to-speech system bringing a fresh approach to artificial intelligence. Unlike conventional methods that simply pronounce words, this model—described by its creators as the first large language model for text-to-speech—interprets a text’s context and emotions. It adjusts tone, rhythm, and timbre, delivering whispers for intimate scenes or calm explanations, much like an actor reading a script.
In a test with 180 evaluators, Octave outperformed ElevenLabs, a notable competitor. It earned 71.6% preference in audio quality, 51.7% in naturalness, and 57.7% in matching voice descriptions, based on 120 varied examples, from movie narrators to medieval characters. These results highlight its ability to adapt to diverse styles and needs.
The system features tools like Voice Design, which crafts unique voices from detailed descriptions, such as an empathetic counselor or a medieval knight. It also offers Acting Instructions, enabling real-time tweaks to emotions and styles. Soon, it will add voice cloning, requiring just five seconds of audio to replicate a voice.
Octave is now accessible on platform.hume.ai and via API, making it suitable for audiobooks, podcasts, or interactive apps. Alongside this, Hume has launched Expressive TTS Arena, a public platform where anyone can compare advanced voice systems and test their skills with complex, expressive texts.
Developed initially for English and Spanish, Octave is still evolving. Beyond synthesizing speech, it explores how people express themselves, paving the way for future AI applications.
Research laboratory and technology company specialized in AI models with emotional intelligence. Its main model integrates voice and language processing, with adjustable voice synthesis in timbre, ...
15/01/2026
Replit has launched Mobile Apps on Replit, a feature that allows users to describe an idea, create the application, and publish it completely on the ...
14/01/2026
Google has introduced Personal Intelligence, a feature that allows Gemini to access information from applications like Gmail, Google Photos, and ...
07/01/2026
OpenAI has introduced ChatGPT Health, a dedicated experience that allows users to connect their medical records and wellness apps to obtain ...
05/01/2026
Amazon introduces Alexa.com, a new platform that brings its Alexa+ artificial intelligence assistant to web browsers and completes its multi-platform ...