Indian company Sarvam AI has open-sourced two large language models – 30B and 105B – with a focus on supporting the languages of India.
Why the ability to run AI models on any hardware is becoming a strategically important task and how the open-source community is solving it.
Alibaba Cloud has open-sourced SysOM MCP – a tool that allows AI agents to independently diagnose problems in server and system operations.
Alibaba DAMO Academy has unveiled RynnBrain, an open-source model for robot control capable of interpreting its environment and making real-world decisions.
Two key libraries for running AI models on everyday devices have joined forces with Hugging Face – and it could change the future of local AI.
AI: Events
Tencent Releases the Most Compact Language Model: 0.3 Billion Parameters in 600 MB
Development
The Chinese company has open-sourced the HY-1.8B-2Bit model with 2-bit quantization – it weighs less than many mobile apps.
AI: Events
Olmix: Allen AI's Approach to Data Mixing Across All Stages of Language Model Training
Development
Allen AI has introduced Olmix, an open-source framework for data mixing in the language model training process, including pre-training, instruction tuning, and alignment.
AI: Events
MiniMax Introduces Forge: A Platform for Training AI Agents on Powerful Computing Clusters
Infrastructure
Chinese company MiniMax has released Forge, an open platform designed for training agents using reinforcement learning on large-scale GPU clusters.
Chinese company MiniMax has released M2.5, a family of open-weight models whose performance is approaching that of Claude 3.5 Sonnet.