Indian developers have unveiled an audio model that doesn't just transcribe speech – it understands the context of the conversation and adapts its output format accordingly.
Version 1.2 expands editing and audio capabilities in the Suno Studio generative workstation, providing users with more control over the final mix.
Anthropic has proposed a way to standardize the integration of language models with external sources – from databases to work tools. We explore how the MCP protocol solves the problem of fragmented integrations.
Robots are learning to coordinate their actions with one another. We take a closer look at how group interaction works, why it's trickier than it looks, and the role modern neural networks play in the process.
AI: Events
Barcelona Supercomputing Center and ACAPPS develop AI tools for people with hearing impairments
Society
BSC and ACAPPS are developing AI-driven technologies designed to help deaf and hard-of-hearing individuals interact more effectively with digital services.
Roblox has presented Cube, its proprietary model for generating three-dimensional scenes. The tool is designed to simplify spatial design and help users create content within the platform faster.
Mistral AI has unveiled Voxtral – a real-time speech transcription model featuring precise speaker separation and a new interactive «sandbox» for audio workflows.
The company has released an improved version of its model, designed to adapt user interfaces for various languages and cultures.
The Elastic platform has acquired a built-in automation system, the ability to ask data questions in plain language, and a tool for quickly assembling AI agents.