Alibaba has released a preview version of its next flagship language model, Qwen3.5-Max-Preview, which has already appeared on a public platform.
AI: Events
How to Adapt a Large AI Model for Dozens of Languages and Cultures: The Sakana AI Approach
Research
Japanese lab Sakana AI has developed a technology to adapt large, general-purpose language models for specific languages and cultures.
AI: Events
AMD Opens Access to Powerful RL Training on Its GPUs: What This Means for Developers
Technical context • Infrastructure
AMD has adapted the Miles framework for large-scale reinforcement learning on Instinct GPUs – now it works without NVIDIA hardware.
Mistral has released Small 4, a new compact model that's faster, more accurate, and boasts improved performance across multiple languages, including Russian.
Mistral AI has joined the NVIDIA Nemotron coalition, a partnership aimed at advancing open language models and multimodal AI capabilities.
KDDI and ELYZA have been selected as language model providers for the Japanese Digital Agency's government AI program.
AI: Events
Mamba-3: Faster Than Transformers in Practice, Not Just on Paper
Technical context • Research
Mamba-3 has been released – an open-source language model that outpaces transformers in text generation speed and surpasses previous versions in quality.
Hcompany has introduced Holotron-12B, a language model capable of independently controlling a computer and performing tasks within the interfaces of real applications.
AMD explains how to easily deploy the Qwen3-5 language model on its Developer Cloud service using the SGLang framework.