Published on March 21, 2026

Databricks запустила serverless-доступ к GPU NVIDIA для обучения ИИ

Databricks Launches Cloud Access to NVIDIA GPUs – No Server Setup or Infrastructure Management Required

Databricks has introduced AI Runtime – an environment for training and fine-tuning models on NVIDIA GPUs without the need to deploy your own infrastructure.

Infrastructure 4 – 5 minutes min read

Event Source: Databricks 4 – 5 minutes min read

Training neural networks, even relatively small ones, requires significant computational resources. Most often, this means GPUs – specialized processors capable of processing vast amounts of data in parallel. Without them, modern AI simply wouldn't work, whether for forecasting tasks, recommendation systems, or especially for training large language or multimodal models.

The problem is that access to these resources has traditionally been complex. You either have to buy your own hardware or rent cloud clusters. In both cases, a significant amount of effort goes not into working with the model itself, but into configuring, scaling, and managing the infrastructure. Databricks decided to simplify this process by introducing AI Runtime – an environment where NVIDIA GPUs are available in a serverless mode, meaning there's no need to deploy and maintain your own servers.

Что такое serverless и его значение для GPU-вычислений

What Serverless Is – And Why It Matters

In short, serverless means that the user works directly with computational power without thinking about how it's all set up “under the hood.” There's no need to rent a cluster in advance, configure it, monitor its load, or pay for idle time. Resources are allocated on demand and released once the task is complete.

This isn't a new concept for general computing, but when it comes to GPUs for AI training, it's relatively rare. GPU resources have historically been “heavy”: expensive, difficult to manage, and not easily scalable on the fly. AI Runtime aims to change exactly that.

Возможности AI Runtime Databricks для обучения моделей

What AI Runtime Can Do

The environment is designed for two main scenarios: training models from scratch and fine-tuning existing ones – that is, adapting a pre-trained model for a specific task or dataset. Both processes require GPUs, and both are now available within the Databricks platform without needing to go elsewhere.

A key feature is scalability. If a task is small, minimal resources are allocated. If more data needs to be processed or a larger model needs to be trained, the system scales automatically. The user doesn't have to handle this manually.

Simply put, this is an attempt to do for GPU computing what cloud platforms have long done for regular servers: remove the operational complexity and leave just the tool itself.

Применение AI Runtime для команд, работающих с данными

Why This Matters for Data Teams

Databricks is, first and foremost, a platform for data and analytics. A significant portion of its users are data engineers, analysts, and ML specialists who already store and process data within the ecosystem. Previously, to move from data to model training, you had to either build a separate pipeline with a GPU cluster or transfer the data to an external environment. Now, that step is eliminated – everything happens in one place.

This is especially relevant for companies that want to fine-tune models on their own corporate data – for example, adapting a language model to internal documentation or training a forecasting model on their transaction history. Previously, this required separate infrastructure. Now, it doesn't.

Почему выбор GPU NVIDIA в основе AI Runtime важен

NVIDIA Inside – It's Not Just Marketing

The choice of NVIDIA GPUs as the foundation is no accident. These processors have become the de facto standard for training AI models, with most popular frameworks and libraries optimized specifically for them. Using NVIDIA hardware in a serverless environment means users aren't just getting “some GPUs”, but the exact architecture the modern AI stack is built for.

This reduces the risk of incompatibility and simplifies migrating existing workflows to the new environment.

Ограничения serverless-подхода Databricks AI Runtime

What's the Catch?

The serverless approach is convenient, but it has a downside. When the infrastructure is hidden, the user loses some control over it. For tasks that require precise environment configuration, fixed hardware specifications, or special data security requirements, serverless may not be the best choice.

Furthermore, it's not yet entirely clear how AI Runtime handles truly massive tasks – for example, training large models with hundreds of billions of parameters. Serverless works well at a medium scale, but the upper limit of its capabilities remains an open question.

Nevertheless, for most practical tasks – like fine-tuning medium-sized models, running experiments, forecasting, and building recommendation systems – this looks like a genuine simplification of the workflow. Less infrastructure work, more time for what really matters.

#event #applied analysis #neural networks #ai training #infrastructure #scaling #gpu optimization #model training optimization

Link to Original: https://www.databricks.com/blog/introducing-ai-runtime-scalable-serverless-nvidia-gpus-databricks-training-and-finetuning

Original Title: Introducing AI Runtime: Scalable, Serverless NVIDIA GPUs on Databricks for Training and Finetuning

Publication Date: Mar 19, 2026

Databricks www.databricks.com A U.S.-based platform for data analytics and machine learning built on a Lakehouse architecture.

Previous Article How OpenAI Keeps Its AI Agents from Going 'Off Course' Next Article LG AI Research Publishes 2025 AI Ethics Accountability Report

Databricks запустила serverless-доступ к GPU NVIDIA для обучения ИИ

Что такое serverless и его значение для GPU-вычислений

Возможности AI Runtime Databricks для обучения моделей

Применение AI Runtime для команд, работающих с данными

Почему выбор GPU NVIDIA в основе AI Runtime важен

Ограничения serverless-подхода Databricks AI Runtime

Related Publications

MiniMax Introduces Forge: A Platform for Training AI Agents on Powerful Computing Clusters

OpenShift 4.21: Simplifying AI Workloads on the Red Hat Platform

Tencent Open-Sources HPC-Ops Library: How to Accelerate Large Model Inference by 30%

From Source to Analysis

Neural Networks Involved in the Process

1. Analyzing the Original Publication and Writing the Text

2. step.translate-en.title

3. Text Review and Editing

4. Preparing the Illustration Description

5. Creating the Illustration