Cursor has introduced Bugbot Autofix, a tool that automatically corrects errors found in code by launching separate cloud agents to handle the task.
We explain how the Mixture of Experts architecture works – an approach that makes models smarter without making them 'think' harder.
AI: Events
Offline Tuning in PyTorch: Accelerating Neural Networks Before Their First Run
Technical context • Infrastructure
An exploration of how TunableOp technology enables the pre-selection of optimal parameters for neural networks, and why this is a valuable practice.
We explore why the future of AI agents lies not in a single powerful model, but in the coordinated work of specialized systems, each responsible for its own domain.
AI: Events
How to Make Small Language Models Think Better: AMD's Experience with Synthetic Data
Development
AMD has introduced LuminaSFT, an approach that uses synthetic data to fine-tune small language models and achieve surprisingly high performance.
AI: Events
Cache as a Resource: How Alibaba Cloud Teaches AI Not to Calculate the Same Thing Twice
Technical context • Infrastructure
Alibaba Cloud has introduced a precise request routing mechanism for language models that significantly boosts caching efficiency in distributed inference.
AMD has released JAX-AITER, a library of pre-built, optimized computational blocks for developing large AI models on AMD GPUs using the JAX framework.
AI: Events
How to Safely Update AI Services: Canary Releases Across Multiple Clusters
Infrastructure
We explore how companies update AI services without the risk of widespread outages, and why the canary release approach is becoming an industry standard.
AI: Events
Liquid AI Releases LFM2-24B, Its Largest Language Model – And It Runs on a Regular Laptop
Products
Liquid AI has introduced LFM2-24B – its largest language model, featuring an unconventional architecture and the ability to run both in the cloud and on local devices.