Alibaba has introduced Qwen3.5, the first model in the Qwen3 family, adept at processing text, images, and audio natively, without needing additional adapters.
AI: Events
ByteDance Releases Dola-Seed-2.0-Preview: A Long-Context Model with Advanced Reasoning
Products
ByteDance has introduced Dola-Seed-2.0-Preview, a new language model that combines long-context capabilities, advanced analytical features, and multimodality.
AI: Events
How AMD and Qwen Optimized MI300X GPUs for Peak Performance
Technical context • Infrastructure
The Qwen team optimized their models to effectively run on AMD MI300X GPUs, achieving a response latency as low as 15 ms per token and full image generation in just 0.4 seconds.
Alibaba has released Qwen-Image 2.0 – a model that generates 2K images, handles text, and allows graphics editing within a single tool.
The Bangalore-based developer has released a multimodal model that understands speech, text, and images, supports India's major languages, and is capable of operating offline.
Copy.ai shared how the combined use of text, data, and images allows for the integration of fragmented workflows into a single, efficient ecosystem.
An Indian startup has released a compact multimodal model capable of recognizing text in 22 of the country's languages – often more accurately than global counterparts.
Chinese company Tencent is releasing a large multimodal model to the public. It is already ranked in the top 7 globally for image editing on LMArena, and it's the best among open models.
LG AI Research shared details about K-EXAONE – a multimodal model developed using proprietary technology, specifically tailored for the Korean language and cultural context.