Intellectual hub of the topic

ai reliability

AI: Events

How to Tell if Your AI Agent is Actually Working or Just Looking Convincing

Development

LightOn has introduced the NOVA evaluation system. We explore how it works and why a «gut feeling» isn't enough to verify AI agents.

LightOn AIwww.lighton.ai Mar 12, 2026

AI: Events

AI's Chains of Thought Have a Mind of Their Own – and That's Surprisingly a Good Thing

Security

OpenAI has discovered that modern AI models struggle to control their own thought processes – and this could be a crucial defense against manipulation.

OpenAIopenai.com Mar 8, 2026

AI: Events

GPT-5.4 in Microsoft Foundry: A Model for Those Who Want to Act, Not Just Plan

Products

Microsoft has announced the upcoming release of GPT-5.4 in Foundry – a new OpenAI model focused on reliably executing tasks in real-world workflows.

Microsoftwww.microsoft.com Mar 6, 2026

AI: Events

How to Safely Update AI Services: Canary Releases Across Multiple Clusters

Infrastructure

We explore how companies update AI services without the risk of widespread outages, and why the canary release approach is becoming an industry standard.

Alibaba Cloudwww.alibabacloud.com Feb 25, 2026

AI: Events

How to Protect AI from Knowledge Theft: Anthropic Is Tackling the Problem

Security

Anthropic sheds light on distillation attacks – a method to copy an AI model's behavior without accessing its code – and discusses strategies for defending against such attacks.

Anthropicwww.anthropic.com Feb 24, 2026

AI: Events

AMD Demonstrates Non-Stop Large Model Training on Its GPUs Despite Crashes

Infrastructure

AMD has integrated TorchFT with TorchTitan to ensure resilient GPU training: the system can now autonomously recover from errors and keep running.

AMDwww.amd.com Feb 12, 2026

AI: Events

How2Everything: When Chatbot Instructions Actually Need to Work

Development

Allen AI has released a framework to evaluate and improve language models' ability to generate step-by-step instructions that actually help get the job done.

Ai2allenai.org Feb 12, 2026

AI: Events

AMD Shows How to Train Large Models Without the Fear of Losing Progress to a Single Crash

Infrastructure

The new pairing of TorchFT and TorchTitan allows model training on AMD GPUs to continue even after cluster node failures – without a full process restart.

AMDwww.amd.com Feb 10, 2026

AI: Events

BrowseSafe: How to Protect Browser AI Agents from Hidden Attacks

Security

A new defense system helps browser AI agents recognize malicious instructions hidden on web pages, preventing them from bypassing user tasks.

Perplexity AIresearch.perplexity.ai Feb 6, 2026

Want to know about new
experiments first?

Subscribe to our Telegram channel — we share all the latest
and exciting updates from NeuraBooks.