Published on March 14, 2026

Оценка неопределенности в ИИ: как модель сигнализирует о незнании

Model Uncertainty as a Signal: What Happens When AI Encounters the Unknown

Researchers have proposed a new way to measure AI model uncertainty – through 'feature gaps,' which help to more accurately determine when a model shouldn't be trusted.

Research 4 – 5 minutes min read
Event Source: Capital One 4 – 5 minutes min read

When we use any AI-based system – be it for credit scoring, medical diagnostics, or recommendations – there's always a question behind the scenes: how confident is the model in its answer? This is not just idle curiosity. If a model makes a mistake while 'thinking' it's correct, the consequences can be quite significant.

This is precisely the problem addressed by an entire field of research: uncertainty estimation. Recently, a paper was presented at the ICLR conference that offers a fresh perspective on how to measure this uncertainty.

Что значит неуверенность ИИ-модели

What Does It Mean for a Model to Be 'Uncertain'?

Simply put, any trained model knows the world only within the scope of the data it was trained on. When it encounters something unlike anything it has seen before, the model finds itself in a blind spot. Sometimes, it still provides a confident answer – simply because it doesn't know how to recognize its own ignorance.

The task for researchers is to teach the model to honestly signal, «I don't really understand what's in front of me.» This requires some kind of metric that can reflect the degree of this uncertainty.

Что такое пробелы в признаках модели

Feature Gaps – What Are They?

In the new paper, uncertainty is measured through so-called feature gaps. The intuition here is quite straightforward.

When a model processes input data, it internally forms a 'representation' of the object – a set of characteristics it uses to describe it. If this object is similar to what the model has seen before, its internal representation will be dense and rich. If the object is unfamiliar, 'holes,' or gaps, will appear in this representation.

It is precisely these gaps that the authors propose using as a measure of uncertainty. Technically, this is formalized through cross-entropy between the probability distribution output by the model and the 'true' distribution, which is unknown in reality. It sounds complex, but the essence is simple: the greater the discrepancy between what the model 'thinks' and what is actually the case, the higher the uncertainty.

Зачем важна оценка неопределенности на практике

Why This Matters in Practice

A good uncertainty metric is not just of academic interest. It directly impacts how AI systems are used in sensitive domains.

Imagine a bank that uses a model to assess creditworthiness. If an application comes from a person with an unconventional financial history – one the model has never encountered – the model should say, «I'm not sure.» The decision can then be passed on to a human specialist. Without a reliable uncertainty estimate, this doesn't happen: the model simply gives an answer, and no one knows how much it can be trusted.

The same applies to medicine, insurance, and security systems – anywhere the cost of an error is high.

Новый подход к оценке неопределенности в ИИ

What's New About This Approach?

There are many methods for uncertainty estimation. Popular ones include Bayesian approaches, ensemble methods where several models 'vote' on the answer, and other techniques. Each has its pros and cons.

The feature gap approach is interesting because it looks not at the model's external behavior, but at its internal state. It's like the difference between asking someone, «Are you sure?» and observing how they think. The second method can provide a more honest picture.

Furthermore, the authors provide a rigorous mathematical definition for this uncertainty – through the cross-entropy between the predictive and true distributions. This allows us not just to intuitively feel that 'something is wrong,' but to measure it quantitatively.

Открытые вопросы в оценке неопределенности ИИ

Open Questions

Like any research, this work doesn't close the topic completely. One of the key challenges is that the 'true distribution' of data in real-world tasks is unknown. This isn't just a technical detail: the quality of the entire metric depends on how accurately it can be approximated.

The question of how well the approach performs on different types of tasks – where data is highly heterogeneous or where the boundary between 'familiar' and 'unfamiliar' is blurred – also remains open.

Nevertheless, the core idea of looking at uncertainty through the model's internal gaps seems promising. If the approach proves its reliability in broader experiments, it could become a useful tool for anyone building systems that need not only to provide answers but also to understand the limits of their knowledge.

Original Title: Uncertainty as feature gaps
Publication Date: Apr 23, 2026
Capital One www.capitalone.com A U.S.-based financial technology corporation applying artificial intelligence and machine learning to banking services, data analytics, and financial process automation.
Previous Article Nacos 3.2 and Defending Against Malicious AI Skills: Why Enterprises Need a Private Registry Next Article The AI Agent Within: A New Face for the Familiar PC

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Want to know about new
experiments first?

Subscribe to our Telegram channel — we share all the latest
and exciting updates from NeuraBooks.

Subscribe