Intellectual hub of the topic

open language models

AI: Events

Alibaba Unveils Qwen3.5-Max-Preview: What We Know About the New Flagship

Products

Alibaba has released a preview version of its next flagship language model, Qwen3.5-Max-Preview, which has already appeared on a public platform.

Alibaba Cloudwww.alibabacloud.com Mar 25, 2026

AI: Events

How to Adapt a Large AI Model for Dozens of Languages and Cultures: The Sakana AI Approach

Research

Japanese lab Sakana AI has developed a technology to adapt large, general-purpose language models for specific languages and cultures.

Sakana AIsakana.ai Mar 24, 2026

AI: Events

AMD Opens Access to Powerful RL Training on Its GPUs: What This Means for Developers

Technical context • Infrastructure

AMD has adapted the Miles framework for large-scale reinforcement learning on Instinct GPUs – now it works without NVIDIA hardware.

LMSYS ORGlmsys.org Mar 24, 2026

AI: Events

Mistral Small 3.1 Makes Way for Mistral Small 4

Products

Mistral has released Small 4, a new compact model that's faster, more accurate, and boasts improved performance across multiple languages, including Russian.

Mistral AImistral.ai Mar 20, 2026

AI: Events

Mistral AI and NVIDIA Team Up for Open Models

Business

Mistral AI has joined the NVIDIA Nemotron coalition, a partnership aimed at advancing open language models and multimodal AI capabilities.

Mistral AImistral.ai Mar 20, 2026

AI: Events

Japanese Government Selects Domestic Language Model for State AI Initiative

Regulation

KDDI and ELYZA have been selected as language model providers for the Japanese Digital Agency's government AI program.

ELYZA.incelyza.ai Mar 20, 2026

AI: Events

Mamba-3: Faster Than Transformers in Practice, Not Just on Paper

Technical context • Research

Mamba-3 has been released – an open-source language model that outpaces transformers in text generation speed and surpasses previous versions in quality.

Together.aiwww.together.ai Mar 19, 2026

AI: Events

Holotron-12B: The Agent That Controls Your Computer for You

Products

Hcompany has introduced Holotron-12B, a language model capable of independently controlling a computer and performing tasks within the interfaces of real applications.

Hugging Facehuggingface.co Mar 18, 2026

AI: Events

Qwen3-5 and AMD: How to Run a Powerful Language Model on Cloud Hardware

Infrastructure

AMD explains how to easily deploy the Qwen3-5 language model on its Developer Cloud service using the SGLang framework.

AMDwww.amd.com Mar 17, 2026

Want to dive deeper into the world
of neuro-creativity?

Be the first to learn about new books, articles, and AI experiments
on our Telegram channel!