Published on March 21, 2026

Databricks запустила serverless-доступ к GPU NVIDIA для обучения ИИ

Databricks Launches Cloud Access to NVIDIA GPUs – No Server Setup or Infrastructure Management Required

Databricks has introduced AI Runtime – an environment for training and fine-tuning models on NVIDIA GPUs without the need to deploy your own infrastructure.

Infrastructure 4 – 5 minutes min read
Event Source: Databricks 4 – 5 minutes min read

Training neural networks, even relatively small ones, requires significant computational resources. Most often, this means GPUs – specialized processors capable of processing vast amounts of data in parallel. Without them, modern AI simply wouldn't work, whether for forecasting tasks, recommendation systems, or especially for training large language or multimodal models.

The problem is that access to these resources has traditionally been complex. You either have to buy your own hardware or rent cloud clusters. In both cases, a significant amount of effort goes not into working with the model itself, but into configuring, scaling, and managing the infrastructure. Databricks decided to simplify this process by introducing AI Runtime – an environment where NVIDIA GPUs are available in a serverless mode, meaning there's no need to deploy and maintain your own servers.

Что такое serverless и его значение для GPU-вычислений

What Serverless Is – And Why It Matters

In short, serverless means that the user works directly with computational power without thinking about how it's all set up “under the hood.” There's no need to rent a cluster in advance, configure it, monitor its load, or pay for idle time. Resources are allocated on demand and released once the task is complete.

This isn't a new concept for general computing, but when it comes to GPUs for AI training, it's relatively rare. GPU resources have historically been “heavy”: expensive, difficult to manage, and not easily scalable on the fly. AI Runtime aims to change exactly that.

Возможности AI Runtime Databricks для обучения моделей

What AI Runtime Can Do

The environment is designed for two main scenarios: training models from scratch and fine-tuning existing ones – that is, adapting a pre-trained model for a specific task or dataset. Both processes require GPUs, and both are now available within the Databricks platform without needing to go elsewhere.

A key feature is scalability. If a task is small, minimal resources are allocated. If more data needs to be processed or a larger model needs to be trained, the system scales automatically. The user doesn't have to handle this manually.

Simply put, this is an attempt to do for GPU computing what cloud platforms have long done for regular servers: remove the operational complexity and leave just the tool itself.

Применение AI Runtime для команд, работающих с данными

Why This Matters for Data Teams

Databricks is, first and foremost, a platform for data and analytics. A significant portion of its users are data engineers, analysts, and ML specialists who already store and process data within the ecosystem. Previously, to move from data to model training, you had to either build a separate pipeline with a GPU cluster or transfer the data to an external environment. Now, that step is eliminated – everything happens in one place.

This is especially relevant for companies that want to fine-tune models on their own corporate data – for example, adapting a language model to internal documentation or training a forecasting model on their transaction history. Previously, this required separate infrastructure. Now, it doesn't.

Почему выбор GPU NVIDIA в основе AI Runtime важен

NVIDIA Inside – It's Not Just Marketing

The choice of NVIDIA GPUs as the foundation is no accident. These processors have become the de facto standard for training AI models, with most popular frameworks and libraries optimized specifically for them. Using NVIDIA hardware in a serverless environment means users aren't just getting “some GPUs”, but the exact architecture the modern AI stack is built for.

This reduces the risk of incompatibility and simplifies migrating existing workflows to the new environment.

Ограничения serverless-подхода Databricks AI Runtime

What's the Catch?

The serverless approach is convenient, but it has a downside. When the infrastructure is hidden, the user loses some control over it. For tasks that require precise environment configuration, fixed hardware specifications, or special data security requirements, serverless may not be the best choice.

Furthermore, it's not yet entirely clear how AI Runtime handles truly massive tasks – for example, training large models with hundreds of billions of parameters. Serverless works well at a medium scale, but the upper limit of its capabilities remains an open question.

Nevertheless, for most practical tasks – like fine-tuning medium-sized models, running experiments, forecasting, and building recommendation systems – this looks like a genuine simplification of the workflow. Less infrastructure work, more time for what really matters.

Original Title: Introducing AI Runtime: Scalable, Serverless NVIDIA GPUs on Databricks for Training and Finetuning
Publication Date: Mar 19, 2026
Databricks www.databricks.com A U.S.-based platform for data analytics and machine learning built on a Lakehouse architecture.
Previous Article How OpenAI Keeps Its AI Agents from Going 'Off Course' Next Article LG AI Research Publishes 2025 AI Ethics Accountability Report

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.

Subscribe