Published on March 31, 2026

How a Japanese Railway Halved Its Inquiry Processing Time With AI

How a Japanese Railway Halved Its Inquiry Processing Time, and Why It Didn't Happen by Itself

Japanese company JR West and AI developer ELYZA explain how generative AI helped a hundred employees work faster, but only after careful fine-tuning.

Business 4 – 5 minutes min read
Event Source: ELYZA.inc 4 – 5 minutes min read

Implementing AI into a company's real-world operations is almost never a story of «just install the model, and everything works.» It's far more often a journey of months of fine-tuning, hundreds of revisions, and people gradually getting used to a new tool. This is precisely what the case of the Japanese railway company JR West and the language model developer ELYZA is about.

What Actually Happened

JR West is one of Japan's major railway operators. Its customer service department handles a large volume of inquiries: complaints, questions, and requests. Each inquiry needs not only to be addressed but also documented – by creating a brief summary, recording the gist of the conversation, and passing the information on.

They focused on this very stage – post-processing inquiries. The task sounds simple: let the AI automatically generate a summary of each customer contact. But between «sounds simple» and «works seamlessly for a hundred employees» lies a gap that most companies underestimate.

Why AI Implementation Required Lengthy Fine-Tuning

Why It Required Lengthy Fine-Tuning, Not Just «Plug and Play»

Generative models can summarize texts – that's a fact. But a summary that satisfies a model developer during a demo and a summary that satisfies a live call center operator in their eighth hour of a shift are two different things.

Customer service has its own specifics: industry-specific jargon, internal record formats, and nuances in phrasing that are crucial in this particular context. A model trained on general data initially produces results that are technically correct but practically inconvenient – they still have to be reworked manually.

This is why ELYZA and the JR West team didn't just «launch» the system but engaged in what is known as operational fine-tuning: iteratively adjusting the model for specific tasks and specific users. Simply put – they trained the model on real examples, collected feedback from employees, made revisions, and tested it again.

Operational Challenges of Implementing AI for a Hundred Employees

A Hundred People Is More Than Just «Scale»

A separate point worth noting is that the tool was implemented for about a hundred employees. This is important because it's at this scale that problems unnoticed during pilot testing with five people begin to surface.

Different people work differently. Some adapt quickly to the new tool, some resist it, and others use it in unintended ways. For the system to truly take root – not just be formally listed as «implemented» – attention had to be paid not only to the model itself but also to the process: how employees interact with the system, where friction arises, and what hinders its regular use.

This, by the way, is one of the most underrated aspects of implementing AI in a corporate environment. The technical part is only half the job. The other half is organizational: training, support, and process adaptation.

The Result A 50% Reduction in Post-Processing Time

The Result: A 50% Reduction in Post-Processing Time

After all this work – fine-tuning the model, tailoring it to specific tasks, and structuring the usage process – the time employees spent on post-processing inquiries was cut roughly in half.

This is a tangible result. Especially considering that it's not about replacing people but about easing their routine workload: instead of manually drafting a summary for each conversation, an operator receives a ready-made draft and just needs to review it. The work still falls to a human – but it becomes significantly less labor-intensive.

What We Can Learn From This Experience

What We Can Learn From This

The story of JR West and ELYZA isn't about breakthrough technology. Generative models for text summarization have been around for several years. It's a story about how technology is integrated into real-world work.

Here are a few observations that can be read between the lines:

  • A model isn't a product; it's a starting point. Without adaptation for a specific task, even a good model yields mediocre results in real-world conditions.
  • Scaling requires special attention. What works for ten people doesn't always work for a hundred – and vice versa.
  • Operational support is as important as technical support. Implementing AI is about changing a workflow, not just installing a program.
  • Results are measured in real metrics. Not in «model accuracy on a benchmark», but in how much time people spend on a specific task.

In a broader context, it's cases like these – without loud claims, but with concrete numbers and an honest description of the process – that are gradually shaping our understanding of how AI actually works in a corporate environment. Not as a magic button, but as a tool that requires investment and, with the right approach, pays it off.

Original Title: JR西日本カスタマーリレーションズとELYZA、運用整備とモデル改善で生成AI要約を100名規模の業務で定着させ後処理時間を50%削減
Publication Date: Mar 31, 2026
ELYZA.inc elyza.ai A Japanese company developing large language models and AI solutions for text processing.
Previous Article How Mobile Networks Are Getting Smarter: SK Telecom and NTT DOCOMO Release Joint White Paper on the Future of Radio Networks Next Article Five Thousand Companies and Growing: What's Happening with AI Analytics in Yandex DataLens

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.

Subscribe