Google has updated Vids: the editor now offers free video and music generation based on the Veo 3.1 and Lyria 3 AI models.
AI: Events
When One GPU Isn't Enough, and a Second Is Too Costly: A New Approach to Running AI in Production
Infrastructure
Two new open-source projects offer a way to run multiple AI models on a single GPU with dynamic memory management, without sacrificing performance.
Alibaba introduces Qwen3.6-Plus, an enterprise AI model capable of independently developing code and analyzing visual content in real-world scenarios.
How a small research team turns the theoretical potential of GPUs into real-world performance for AI systems – the story of the Together AI team.
Hcompany has introduced Holo3, an agent model that set a record on a key computer operation benchmark and is designed for autonomous work in corporate environments.
AI: Events
One GPU Failure Shouldn't Bring Down the Entire System
Technical context • Infrastructure
The Mooncake and Volcano Engine teams have integrated an elastic expert parallelism mechanism into the SGLang framework, allowing it to withstand partial failures without requiring a restart.
AI: Events
When Banks Stop Making You Wait: How AI Agents Are Transforming Customer Support
Products
Startup Gradient Labs has developed AI agents for banking support, powered by OpenAI models. These agents operate quickly, reliably, and without the need for human intervention.
AI: Events
Alibaba Unveils Wan2.7-Image: Precise Color, Lifelike Characters, and Error-Free Text
Products
Alibaba has unveiled Wan2.7-Image, a unified image generation model featuring precise color control, personalized characters, and support for 12 languages.
ASUS has unveiled the UGen300 – a compact USB accelerator designed for running AI directly on-device, без cloud или подписок, and with support for language models.