Published on April 4, 2026

AiChemy: Multi-Agent AI in Drug Discovery

AiChemy: How Multi-Agent AI is Changing Drug Discovery

Databricks has introduced AiChemy – an AI system designed to accelerate drug development by combining multiple agents, scientific databases, and analysis tools.

Medicine 4 – 6 minutes min read
Event Source: Databricks 4 – 6 minutes min read

Developing new drugs is one of the most complex and costly processes in science. The journey from an idea to a finished product can take years, sometimes even decades. At every stage, researchers must work with vast amounts of data: molecular structures, clinical trials, patent databases, and laboratory experiment results. All of this occurs in an environment where crucial knowledge is scattered across hundreds of sources, written in the different “languages” of various disciplines.

To address this very challenge, Databricks has introduced AiChemy – a multi-agent AI tool designed to help scientists navigate this ocean of information more quickly and make more informed decisions during the early stages of research.

Multi-Agent AI Architecture

Not One Agent, But a Team

Simply put, AiChemy is not a single AI assistant, but several specialized agents working together. Each is responsible for its own domain: one understands chemical structures, another analyzes biological data, and a third can work with medical literature and patents.

This architecture is reminiscent of a research group where each member has their own specialization, but they all coordinate their efforts to achieve a common goal. When a scientist asks a question, such as, “What molecules can interact with a specific protein associated with Alzheimer's disease?” the system distributes the task among the agents. Each agent contributes its part of the answer, and the results are then combined into a cohesive output.

This is especially important in pharmaceutical research, where chemistry, biology, medicine, and toxicology are so tightly intertwined that no single specialist can physically hold all the relevant knowledge in their mind at once.

Integrating Proprietary and Public Data

Proprietary Data + Global Science

One of AiChemy's key features is its ability to work not only with public scientific databases but also with an organization's internal data. This is a crucial point for pharmaceutical companies, as they have years' worth of their own experiments, unpublished findings, and corporate knowledge bases that can't simply be “Googled.”

AiChemy allows this data to be integrated into the system and used on par with publicly available sources. In other words, the AI agents can simultaneously consider information from scientific journals and the company's own accumulated research, providing recommendations that take both datasets into account.

Modular Skills for AI Agents

Skills as Building Blocks

Another important element of the system is so-called skills. These are pre-prepared modules that agents can use to perform specific tasks: predicting a compound's toxicity, evaluating its bioavailability, searching for similar molecules in a database, and so on.

This approach avoids “reinventing the wheel” each time. If a task is well-described and solvable by a known method, the agent simply calls the necessary tool. This accelerates the work and reduces the risk of errors that might arise if the AI attempts to reason where a reliable algorithm already exists.

MCP Protocol for Agent Coordination

The Protocol That Ties It All Together

To coordinate between agents and external tools, AiChemy uses MCP – a protocol that allows the system's different components to “talk” to each other in a common language. Without getting too technical, it's akin to a standardized interface: agents know how to query tools and databases without needing a separate configuration for each source.

In practice, this means the system can be expanded – by connecting new databases, analysis tools, and other sources – without having to rewrite everything from scratch. For research organizations whose needs evolve as a project advances, this is a major advantage.

The Urgent Need for AI in Drug Development

Why This Is Needed Right Now

Drug development is facing a unique productivity crisis: despite rising investment, the number of approved drugs relative to the funds invested has been declining for decades. One reason is the colossal amount of information that needs to be processed in the early stages, when unpromising candidates are weeded out.

AI systems like AiChemy do not replace scientists but rather assume the role of an “intelligent analyst” – one capable of quickly processing thousands of publications, correlating data from various sources, and proposing avenues worth exploring. This is especially valuable during the so-called hit-to-lead phase, when the most promising molecules must be selected from a vast pool of candidates for further in-depth study.

Challenges and Future of Multi-Agent Systems

Open Questions

Despite the appeal of this approach, it is important to recognize that multi-agent systems in scientific research are still relatively uncharted territory. Questions regarding the reproducibility of results, how agents justify their conclusions, and the extent to which their recommendations align with real-world experimental practices remain open.

Furthermore, the integration of corporate data with public databases raises concerns about confidentiality and security – especially in the pharmaceutical industry, where internal developments hold significant commercial value.

AiChemy looks like a step in the right direction: an attempt to bring disparate tools and sources together into a single, coordinated system that speaks the language of scientists, not programmers. How well it works in practice, only time – and, most importantly, real laboratory results – will tell.

Original Title: AiChemy: Next-Generation Agent with MCP, Skills and Custom Data for Drug Discovery
Publication Date: Apr 3, 2026
Databricks www.databricks.com A U.S.-based platform for data analytics and machine learning built on a Lakehouse architecture.
Previous Article Agent Mesh vs. Legacy Code: How Red Hat Is Using AI to Modernize Old Systems Next Article How Large Language Models Are Trained: Unveiling the Engineering Beyond Scaling

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.

Subscribe