How the exaCB continuous benchmarking system helps monitor the performance of dozens of scientific applications on the exascale supercomputer JUPITER.
AI: Events
900 MW for AI: Crusoe Is Building a Giant Data Center in Texas for Microsoft
Infrastructure
Crusoe has announced the construction of a massive 900 MW AI campus in Abilene, Texas, to support Microsoft's infrastructure needs.
Researchers have demonstrated that small language models can outperform GPT-4o when processing long texts by breaking down tasks and distributing the work among multiple agents.
AI: Events
Smart Selectivity: How a Hybrid Neural Network Remembers Only What's Important
Technical context • Research
A new approach to neural network architecture dramatically reduces memory consumption for text processing without sacrificing comprehension quality.
EgoVerse is an open-source system for training robots using human first-person video, developed by a consortium of leading research teams.
Inception Labs has introduced Mercury 2 – a diffusion language model that operates quickly and affordably, paving the way for a new approach to creating AI agents.
AI: Events
How to Adapt a Large AI Model for Dozens of Languages and Cultures: The Sakana AI Approach
Research
Japanese lab Sakana AI has developed a technology to adapt large, general-purpose language models for specific languages and cultures.
The Korean company Upstage has released the Solar Pro 3 language model, which handles multi-step agentic tasks twice as effectively as its predecessor.
How SmartSearch learned to pull signal from the noise of messy dialogues without complex algorithms – and why effective ranking is more valuable than a perfectly organized data archive.