Published on March 17, 2026

Red Hat and NVIDIA: Nemotron Models in AI Factory from Day One

Red Hat and NVIDIA: Nemotron Models Available in AI Factory from Day One

Red Hat and NVIDIA are expanding their collaboration: open models from the Nemotron family now have day-one support on the AI Factory platform.

Infrastructure 4 – 5 minutes min read
Event Source: Red Hat 4 – 5 minutes min read

Red Hat and NVIDIA have a long-standing collaboration to enable enterprise customers to use generative AI without having to build everything from scratch. This partnership recently saw a significant development: the Red Hat AI Factory platform now supports the open NVIDIA Nemotron family of models – with what's known as “day-zero” support, meaning from the very moment the models are released.

What Is Red Hat AI Factory and Why Is It Needed?

Simply put, Red Hat AI Factory is a suite of tools and services that helps companies run and use AI models within their own infrastructure. Not in a third-party provider's cloud, but on-premise – with full control over data, security settings, and the ability to adapt to specific business needs.

This is crucial for those working in regulated industries like finance, healthcare, and the public sector. In these fields, you can't just send corporate data to a public cloud service and get a response from a model. It's essential for everything to operate within a controlled perimeter.

Nemotron: What Are These Models?

Nemotron – What Are These Models?

The Nemotron family consists of open language models from NVIDIA. “Open” here means that their weights are available for download and use, not just through an API. This allows users to not only use the model but also to fine-tune it on their own data, optimize it for specific hardware, or embed it into their own products.

NVIDIA is developing Nemotron with a focus on enterprise applications: the models are designed for efficient performance specifically on NVIDIA hardware and are validated against business-critical requirements such as accuracy, controllability, and output safety.

Day-One Support: Why It Matters?

Day-One Support – Why Does It Matter?

“Day-zero” support means that new models from the Nemotron family appear in Red Hat AI Factory simultaneously with their official release – without any delay for integration, compatibility testing, or manual configuration. Simply put, companies don't have to wait for someone to “fix” the compatibility between the platform and the new model.

For enterprise users, this reduces friction: a new model can be tested immediately in a production environment without worrying that something might break at the infrastructure level.

Open Source as a Core Principle

Open Source as a Principle, Not a Marketing Tactic

It's also worth noting that this entire stack is built on open-source software. Red Hat has historically adhered to open source as a core principle, and AI Factory is no exception. This means companies don't get a closed “black box” solution, but rather an assemblable and reproducible chain, where every link can be verified, replaced, or adapted.

For enterprises that value transparency and independence from a single vendor, this is fundamental. Here, openness is not just a slogan but a practical way to avoid “vendor lock-in” on a specific platform or provider.

What Do These Changes Mean in Practice?

What Does This Change in Practice?

Looking at the big picture, Red Hat and NVIDIA are jointly creating a kind of “enterprise pathway” for generative AI. On one hand, you have NVIDIA's models, optimized for their hardware and vetted for business use. On the other, you have Red Hat's infrastructure, which allows all of this to be deployed within an enterprise, complete with management, security, and support.

Previously, companies wanting to work with powerful open models in their own infrastructure faced a significant barrier to entry: they had to independently figure out compatibility, optimization, and environment setup. Now, this path is becoming more direct and predictable.

This doesn't mean all challenges have disappeared. Enterprise AI still requires serious data work, customization for specific tasks, and validation against internal requirements. But the infrastructure layer – how models are delivered and run – is becoming less of a pain point.

Why Are Open Models Gaining Traction?

Context: Why Are Open Models Gaining Traction?

Furthermore, it's worth noting a broader trend. More and more large companies are looking toward open models not because they are “free,” but because they offer control. When a model can be deployed locally, fine-tuned, and verified, it provides a different level of confidence in the system compared to working with a closed external service.

NVIDIA with Nemotron and Red Hat with AI Factory are betting on this exact demand. And judging by how the enterprise AI market is developing, this demand is only set to grow.

Original Title: Bringing Nemotron models to the Red Hat AI Factory with NVIDIA
Publication Date: Mar 16, 2026
Red Hat www.redhat.com Global company developing open software platforms and infrastructure solutions with AI support.
Previous Article TabPFN v2 in Driverless AI: What's New for Working with Tabular Data Next Article DynaGuard: Flexible AI Protection That Adapts to Your Rules

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

A year has passed since DeepSeek demonstrated that powerful models can be created without billion-dollar budgets – and the industry hasn't been the same since.

Hugging Facehuggingface.co Feb 3, 2026

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.

Subscribe