Google has unveiled TurboQuant, an algorithm that compresses AI's working memory sixfold, which could fundamentally change the approach to neural network infrastructure.
Salesforce AI Research explains how it is restructuring language model training for the agentic era – and why old approaches no longer work.
Hcompany has introduced Holo3, an agent model that set a record on a key computer operation benchmark and is designed for autonomous work in corporate environments.
Arcee AI has released Trinity-Large-Thinking, an open model with a 'thinking' function for complex agentic tasks, available under the Apache 2.0 license.
AI: Events
AMD at MLPerf Inference 6.0: A Million Tokens Per Second and a Debut in Video Generation
Technical context • Infrastructure
AMD has presented its MLPerf Inference 6.0 results, showcasing new performance records, the first video generation tests, and scaling up to the cluster level on the Instinct MI355X GPU.
AI: Events
Red Hat AI Achieves Top Results in MLPerf Inference v6.0 – Here's What's Behind It
Infrastructure
Red Hat AI has secured top spots in the latest round of the MLPerf Inference v6.0 benchmark, testing three models on both NVIDIA and AMD GPUs.
Strict prohibitions don't make AI reliable – it's governance that evolves with the technology that does.
OpenAI has closed the largest funding round in the history of the tech industry, securing $122 billion for AI infrastructure development and global expansion.
Google has launched Veo 3.1 Lite – a lightweight version of its video generation model that is significantly more affordable to use.