Published on March 24, 2026

Adapting Large AI Models for Multiple Languages and Cultures The Sakana AI Approach

How to Adapt a Large AI Model for Dozens of Languages and Cultures: The Sakana AI Approach

Japanese lab Sakana AI has developed a technology to adapt large, general-purpose language models for specific languages and cultures.

Research 4 – 5 minutes min read
Event Source: Sakana AI 4 – 5 minutes min read

Today, large language models are trained on vast amounts of text, the majority of which is, by default, in English. This means the model “thinks” well in English and treats everything else as secondary. While languages like Japanese, Arabic, Swahili, and Hindi are formally supported, in practice, the model often struggles with nuances, misses cultural context, or simply provides worse responses than in English.

The Japanese research lab Sakana AI has decided to tackle this very problem. Its team developed an approach called Namazu–a post-training technology that allows taking an already-built large model and “fine-tuning” it for a specific language and culture, without retraining it from scratch.

What Does Model Adaptation Mean

What Does It Mean to “Adapt” a Model?

Imagine you have a highly educated person who has spent their entire life reading only American books, watching American movies, and working for American companies. They might speak Japanese, but their understanding of how life works, how people should interact, and what is considered polite versus rude would be shaped by a completely different culture. And you'd be able to tell.

Something similar happens with language models. Even if a model technically supports Japanese, it might respond in a way that feels like a “translation from English” rather than natural Japanese speech. It may not know local realities, understand subtext, or take into account the communication norms specific to a country.

Adaptation is precisely about working with this layer: not with the model's foundational knowledge, but with how it communicates, what it deems important, the examples it gives, and how it constructs an answer.

Namazu Fine Tune Dont Retrain

Namazu: Fine-Tune, Don't Retrain

The key idea behind Namazu is not to create a separate model for each language from scratch but to work with existing large, open-source models. To put it simply, you take a ready-made foundation and “tune” it to the required context through post-training.

This is important for several reasons. First, training a large model from scratch is expensive and time-consuming. Second, large models already have a vast knowledge base that you don't want to lose. The goal of post-training is to add desirable qualities without breaking what already works.

Namazu is specifically focused on the largest open-source models–that is, those available to a wide range of developers and researchers, not just major tech companies. This is a crucial point: developers from Japan, India, African nations, or Latin America can use this approach without needing access to closed, proprietary systems.

Why Adapting AI Models Is Important Now

Why This Matters Right Now

For the past few years, the AI industry has been in a race for scale: who can build a bigger, smarter, more powerful model. This has yielded results–models have indeed become significantly better. But simultaneously, a problem has emerged: the larger the model, the more difficult and expensive it becomes to adapt it for specific needs.

Meanwhile, the need for localization hasn't disappeared. In many countries, AI tools either perform poorly in the local language or aren't available in it at all. This creates an obvious gap: the technology exists, but not everyone can take full advantage of it.

Sakana AI's approach offers a way to bridge this gap. If post-training can indeed effectively adapt large models for different languages and cultures, it paves the way for creating quality local AI tools without having to build everything from scratch every time.

Unanswered Questions About AI Model Adaptation

What Questions Remain

For now, Namazu is in its alpha version–an honest acknowledgment that the technology is still developing. How well the approach scales to genuinely diverse languages, especially those that differ significantly from English in structure and cultural context, remains to be tested in practice.

Another open question is how deep this adaptation can be. Post-training is a powerful tool, but it has its limits: if the base model has seen almost no text in a particular language, no amount of fine-tuning will make it truly strong in it.

Nevertheless, the framing of the problem itself seems correct. General-purpose models are great, but the world speaks hundreds of languages, and each has its own logic, its own context, and its own expectations for interacting with a machine. A technology that helps take this into account is not just a technical improvement, but a step toward making AI truly useful to a greater number of people.

Original Title: 最大規模のオープン基盤モデルを各国仕様へ適応させる事後学習技術を開発
Publication Date: Mar 23, 2026
Sakana AI sakana.ai A Japanese research company exploring evolutionary approaches and self-learning AI systems.
Previous Article Oracle Launches Fusion Agentic Applications: When AI Agents Take Charge Next Article Sber Updates GigaChat: What's Changed and Why It Matters

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Want to know about new
experiments first?

Subscribe to our Telegram channel — we share all the latest
and exciting updates from NeuraBooks.

Subscribe