Hcompany has introduced Holo3, an agent model that set a record on a key computer operation benchmark and is designed for autonomous work in corporate environments.
Company H has announced the release of Holo3, a model that has set a new record on the leading benchmark for AI agents that operate computers.
AI: Events
When an Agent Doesn't Know the Answer: How Retrieval Models Are Learning to Find the Unreachable
Products
Mixedbread has released Search v3 – a retrieval model that significantly narrows the gap between what an agent actually finds and what is theoretically discoverable within the data.
The Allen Institute has introduced MolmoWeb, an open-source web agent. It navigates browsers visually, much like a human, and outperforms many proprietary competitors.
Databricks has developed its own approach to creating AI agents – the coSTAR system, which allows the team to work quickly without losing control over quality.
We explore why assessing AI agents' skills isn't just a formality, but a crucial step toward building systems you can trust with real-world tasks.
LightOn has introduced the NOVA evaluation system. We explore how it works and why a «gut feeling» isn't enough to verify AI agents.
AI: Events
OpenAI and Federal Permits: How AI Is Accelerating One of the Slowest U.S. Bureaucratic Systems
Regulation
In partnership with a national laboratory, OpenAI has developed a tool to evaluate AI agents for speeding up federal approvals and is already seeing the first measurable results.
AI: Events
A Powerful AI Agent Without the Cloud: How LFM2-24B-A2B Runs Directly on Your Computer
Products
Liquid AI has introduced the LFM2-24B-A2B model, capable of running AI agents with tool-calling capabilities directly on consumer hardware – without the cloud or latency.