A year has passed since DeepSeek demonstrated that powerful models can be created without billion-dollar budgets – and the industry hasn't been the same since.
AI: Events
SenseTime Open-Sources SenseNova-MARS – A Model for Searching and Analyzing Diverse Data Types
Products
The Chinese company has released an open-source model that works simultaneously with text, images, video, and audio, and is also capable of searching and analyzing information.
Hugging Face has introduced Daggr – an open-source tool that helps assemble chains of AI models and visually track their internal processes.
The Chinese text recognition model has been adapted for AMD GPUs – we break down what this means for those working with documents.
OpenHands has launched a benchmark demonstrating how models handle real-world GitHub tasks – from bug fixes to implementing new features in open-source projects.
AI: Events
Claude Taught to Write CUDA Kernels and Train Open Models
Technical context • Development
Anthropic has enhanced Claude's capabilities in handling low-level code and transferring knowledge to other models through its new «Extended Thinking» feature.
We explore the architectural solutions developers of Chinese open-source models are choosing and why decoder-based approaches continue to dominate the ecosystem.
AI: Events
Open Coding Agents: AI Code Assistants That Work With Any Repository
Technical context • Development
The Allen Institute for AI has unveiled Open Coding Agents – open-source models for autonomous coding that adapt to a project's structure.
AI: Events
How LinkedIn Trained Its Code-Generating GPT-OSS Using Agentic Reinforcement Learning
Technical context • Development
The LinkedIn team shared their experience applying reinforcement learning to an open-source model and discussed the challenges they faced in the process.