Intellectual hub of the topic

multimodal models

AI: Events

Qwen3.5: The First Natively Multimodal Model

Products

Alibaba has introduced Qwen3.5, the first model in the Qwen3 family, adept at processing text, images, and audio natively, without needing additional adapters.

Alibaba Cloudwww.alibabacloud.com Feb 17, 2026

AI: Events

ByteDance Releases Dola-Seed-2.0-Preview: A Long-Context Model with Advanced Reasoning

Products

ByteDance has introduced Dola-Seed-2.0-Preview, a new language model that combines long-context capabilities, advanced analytical features, and multimodality.

ByteDanceseed.bytedance.com Feb 16, 2026

AI: Events

How AMD and Qwen Optimized MI300X GPUs for Peak Performance

Technical context • Infrastructure

The Qwen team optimized their models to effectively run on AMD MI300X GPUs, achieving a response latency as low as 15 ms per token and full image generation in just 0.4 seconds.

LMSYS ORGlmsys.org Feb 13, 2026

AI: Events

Qwen-Image 2.0: When a Neural Network Can Both Draw and Edit

Products

Alibaba has released Qwen-Image 2.0 – a model that generates 2K images, handles text, and allows graphics editing within a single tool.

Alibaba Cloudwww.alibabacloud.com Feb 12, 2026

AI: Events

Indian Company Sarvam Unveils Arya Voice Assistant with 10-Language Support

Products

The Bangalore-based developer has released a multimodal model that understands speech, text, and images, supports India's major languages, and is capable of operating offline.

Sarvamwww.sarvam.ai Feb 11, 2026

AI: Events

Copy.ai Explains How Multimodality is Transforming Sales and Marketing

Business

Copy.ai shared how the combined use of text, data, and images allows for the integration of fragmented workflows into a single, efficient ecosystem.

Copy AIwww.copy.ai Feb 10, 2026

AI: Events

Sarvam Vision: A Document-Processing Model with Indic Language Expertise

Products

An Indian startup has released a compact multimodal model capable of recognizing text in 22 of the country's languages – often more accurately than global counterparts.

Sarvamwww.sarvam.ai Feb 9, 2026

AI: Events

Tencent Open-Sources 80B-Parameter Hunyuan Model: What It Means

Products

Chinese company Tencent is releasing a large multimodal model to the public. It is already ranked in the top 7 globally for image editing on LMArena, and it's the best among open models.

Tencenthunyuan.tencent.com Feb 4, 2026

AI: Events

K-EXAONE: How South Korea's LG is Building Its Own Large Language Model

Products

LG AI Research shared details about K-EXAONE – a multimodal model developed using proprietary technology, specifically tailored for the Korean language and cultural context.

LG AI Researchwww.lgresearch.ai Feb 4, 2026

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.