A new feature in ElevenCreative lets you turn text into a finished audiobook without stepping foot in a recording studio or hiring professional narrators.
Indian startup Sarvam AI has unveiled Bulbul V3 – a speech synthesis model supporting 15 languages and capable of voice cloning from a short audio sample.
Indian developers have unveiled an audio model that doesn't just transcribe speech – it understands the context of the conversation and adapts its output format accordingly.
Indian company Sarvam AI has unveiled a system for automatically dubbing videos into regional languages while preserving the original intonations and synchronizing lip movements.
Version 1.2 expands editing and audio capabilities in the Suno Studio generative workstation, providing users with more control over the final mix.
A Brazilian engineer explains how the new DARC model allows controlling drum rhythm via beatbox without losing musical harmony – much like conducting a samba with hand gestures.