Researchers at the Allen Institute for AI have created the Theorizer system, which analyzes arrays of scientific publications and attempts to formulate general patterns from them.
AI: Events
Claude Taught to Write CUDA Kernels and Train Open Models
Technical context • Development
Anthropic has enhanced Claude's capabilities in handling low-level code and transferring knowledge to other models through its new «Extended Thinking» feature.
AI: Events
How LinkedIn Trained Its Code-Generating GPT-OSS Using Agentic Reinforcement Learning
Technical context • Development
The LinkedIn team shared their experience applying reinforcement learning to an open-source model and discussed the challenges they faced in the process.
Researchers at AI21 Labs have devised a method to reduce online reinforcement learning costs by sending some data to «take a nap» until better times.
MiniMax has discussed its approach to fine-tuning language models that do more than just answer questions – they execute complex tasks by interacting with tools.
AMD has unveiled ReasonLite-0.6B, a compact language model focusing on logical reasoning, trained using a majority voting strategy and a staged approach.
Lab
Generalizing Generalization: When Neural Networks Learn to Predict – But Not What We Expected
Computer Science
Let's figure out why a language model's success on one test outside of training doesn't guarantee a win on another – and what this means for real-world AI applications.
Lab
Why Artificial Intelligence Learns From Our Mistakes: The Paradox of Inverse Reinforcement Learning
Finance & Economics
How machines decipher our hidden motives by observing behavior – from robotics to economics, where algorithms become archaeologists of human desire.
Lab
Artificial Intelligence Learns to Think Like Samba: Finding the Perfect Rhythm Between Too Simple and Too Complex
Computer Science
Brazilian researchers have developed the SEELE method, which teaches AI to tackle problems like a samba dancer finding the perfect rhythm – not too slow, not too frantic. It's about discovering that sweet spot where the algorithm truly comes to life