Some things are slow by definition. The federal permitting system in the U.S. is one of them. Before building a road, running a power line, or constructing a major infrastructure project, it is necessary to undergo an environmental assessment under the National Environmental Policy Act (NEPA). This is a mandatory analysis of a project's impact on the environment, and preparing it can take years. Documents often swell to hundreds of pages, and the process itself has long been a symbol of bureaucratic sluggishness.
This is precisely the problem that OpenAI and the Pacific Northwest National Laboratory (PNNL) – one of the largest federal science laboratories in the U.S., operated by the Department of Energy – have turned their attention to.
What's Happening with These Permits
The NEPA process requires organizations to prepare detailed reports: they must describe the project, assess its potential impact on nature, public health, and local communities, list alternatives, and justify their choice. This isn't just a formality – the law has been in place since 1970 and truly helps identify problems before construction begins. However, the volume of work is immense, and there are not enough specialists who can competently prepare such documents.
To put it simply: even if a project is technically ready, it can wait years for environmental approval. For infrastructure – energy, transportation, telecommunications – this is a significant bottleneck.
The Idea: Let an AI Agent Help Write Drafts
OpenAI and PNNL decided to test whether AI could take on part of this work – not to replace experts, but to help them move more quickly to substantive analysis by bypassing the routine part of document preparation.
To do this, they created a special evaluation tool: DraftNEPABench. It's a kind of test that measures how well AI agents can handle the task of preparing draft sections of NEPA documentation. The benchmark is not a product or a service; it's a testing methodology. It's needed to understand what AI is capable of in this specific area and to compare different approaches.
Why is this important? Without such an evaluation tool, it's impossible to answer the question: «Does this even work?» Before implementing AI in federal processes, it's necessary to be able to measure the results, and DraftNEPABench is designed for exactly that.
What the Initial Results Showed
According to preliminary data, using AI agents during the drafting stage can reduce the time to prepare NEPA documentation by about 15%. It might sound modest, but in the context of processes that drag on for months and years, this is quite significant.
It's important to understand that we are talking specifically about drafts. The AI doesn't make decisions, assess environmental risks on its own, or sign documents. It helps to quickly form the initial structure of the text, gather the necessary sections, and reduce the time spent on routine 'paperwork,' so that experts can spend their time on what requires real judgment.
Why a National Laboratory Is Working on This with OpenAI
PNNL is not a startup or a consulting firm. It is a research organization with many years of experience working in the national interest: energy, security, and environmental sciences. Its involvement in this partnership means that the project is focused not on commercial application, but on real-world use in government processes.
OpenAI, for its part, has recently been actively moving towards collaboration with federal agencies. This partnership is one example of how the company is trying to find applications for its technologies in specific institutional tasks, and not just in consumer products.
The joint development of the benchmark is also a signal: for AI to be genuinely used in public administration, we need not just tools, but also standards for evaluating them. DraftNEPABench is an attempt to establish such a standard for one specific task.
What Remains in Question
A 15% time saving is a result at the draft and preliminary testing level. Whether this figure will hold up in real-world application across different types of projects and various agencies remains to be seen.
The issue of trust also remains an open question. Government agencies are traditionally cautious about automating processes that carry legal and environmental liability. Even if AI helps write drafts, who will check their quality, and how? How can we ensure that an automatically generated text doesn't contain errors that will later turn into problems at the approval stage?
These questions don't render the project meaningless – on the contrary, it's precisely to solve them that a tool like DraftNEPABench is needed. But they serve as a reminder that there is still a considerable distance between «AI can write drafts» and «AI is integrated into the federal permitting process.»
Nevertheless, the very fact that such a partnership and such a benchmark have emerged indicates that the conversation about applying AI in public administration is gradually shifting from theoretical to practical. And that, perhaps, is the most interesting part of this story.