Published on April 1, 2026

ASUS UGen300: USB AI акселератор для периферийных вычислений

ASUS UGen300: The Flash Drive That Runs AI

ASUS has unveiled the UGen300 – a compact USB accelerator designed for running AI directly on-device, без cloud или подписок, and with support for language models.

Products 4 – 5 minutes min read
Event Source: ASUS 4 – 5 minutes min read

When you think of AI accelerators, you usually picture something large and expensive – a graphics card, a server module, or at least an expansion card. ASUS is offering a different format: a small USB dongle that connects to any device and handles artificial intelligence tasks.

On April 1, 2026, the company announced the UGen300 USB AI Accelerator – a device ASUS is calling the world's first USB accelerator for edge AI (that is, AI running directly on the device, not in the cloud). In addition to the USB version, an M.2 form factor is also available for those who prefer a built-in solution.

Что внутри ASUS UGen300

What's Inside the Little Box

The UGen300 is a device measuring 105 × 50 × 18 mm. In terms of size, it's roughly like a thick external SSD, only a bit smaller. Inside is the Hailo-10H processor, specifically designed for AI tasks. Its performance is rated at 40 AI TOPS. To put it simply, this is a unit that measures how many operations a chip can perform per second when working with neural networks. 40 TOPS is enough to run language models, models that work with images and text simultaneously, and other modern AI tools.

In addition to the processor, the device is equipped with 8 GB of its own LPDDR4 RAM, not borrowed from the host system. This is an important point: the UGen300 doesn't 'eat up' resources from the computer or smartphone it's connected to, but operates autonomously within the scope of its tasks.

It connects via USB-C. Power consumption under a typical load is just 2.5 watts. For comparison, a standard night light bulb uses more energy.

Для кого предназначен USB AI-акселератор

Who Is It For and Why

The main idea behind the device is to allow those without powerful hardware to run AI tasks. An old laptop, a single-board computer, an Android device – just plug in the UGen300, and the system gets hardware acceleration for neural networks.

Moreover, there's no need for manual setup: the device is plug-and-play, meaning you just plug it in and it works. It's compatible with Windows, Linux, and Android. It also supports common AI frameworks – the popular tools developers use to create and run neural networks.

The UGen300 can be used for a wide range of tasks: text generation, video summarization, speech recognition and its conversion into commands, and real-time visual information processing.

Преимущества локального ИИ-акселератора

No Cloud – It's Not Just Marketing

One of the key features of the UGen300 is its complete independence from cloud services. This means several things at once.

First, there are no subscriptions. Most cloud AI services charge for usage – either monthly or per request. The UGen300 operates locally, without any recurring payments.

Second, there's no latency. When a request travels to the cloud and back, it takes time. With local processing, the response is generated instantly on the device.

Third, privacy. Your data isn't sent anywhere – it's processed right on the device. For tasks where confidentiality is crucial, this is a significant advantage.

"By integrating the Hailo-10H into a ubiquitous USB device, ASUS is making the full power of AI and generative AI accessible to everyone. We can't wait to see how the developer community will use this plug-and-play accelerator to push the boundaries of AI on devices. This is how Hailo envisions the future of AI: accessible, affordable, and designed for everyone to work with," said Max Glover, Chief Revenue Officer at Hailo.

Возможности и применение ASUS UGen300

What This Changes – and for Whom

The UGen300 appeals to several groups of people.

For developers, it's a convenient tool for experimenting with local AI models without having to buy expensive hardware or pay for the cloud.

For enthusiasts and smart home hobbyists, it's a way to add AI capabilities to devices not originally designed for them.

For businesses that handle sensitive data, it's a solution for local AI without the risk of data leaks to external services.

However, a number of open questions remain. How smoothly heavy language models run with a limit of 8 GB of memory depends on the specific model and task. A performance of 40 TOPS is a good metric for a device of this class, but it's still no substitute for a full-fledged discrete graphics card. Instead, it's a tool for scenarios where portability, power efficiency, and autonomy are more important than maximum generation speed.

Overall, the UGen300 seems like an attempt to reframe our understanding of where and how AI can operate. Not in a data center, not on a powerful desktop – but in your pocket, at the end of a USB cable, next to any device you already have at hand.

Original Title: ASUS Announces UGen300 USB AI Accelerator
Publication Date: Apr 1, 2026
ASUS press.asus.com A Taiwanese technology company developing hardware and software AI solutions for consumer and professional devices.
Previous Article Aurora: How AI Learned to Predict Its Responses and Continuously Improve Next Article Falcon Perception: A Single Transformer for Vision, Text, and Document Understanding

Related Publications

You May Also Like

Explore Other Events

Events are only part of the bigger picture. These materials help you see more broadly: the context, the consequences, and the ideas behind the news.

From Source to Analysis

How This Text Was Created

This material is not a direct retelling of the original publication. First, the news item itself was selected as an event important for understanding AI development. Then a processing framework was set: what needs clarification, what context to add, and where to place emphasis. This allowed us to turn a single announcement or update into a coherent and meaningful analysis.

Neural Networks Involved in the Process

We openly show which models were used at different stages of processing. Each performed its own role — analyzing the source, rewriting, fact-checking, and visual interpretation. This approach maintains transparency and clearly demonstrates how technologies participated in creating the material.

1.
Claude Sonnet 4.6 Anthropic Analyzing the Original Publication and Writing the Text The neural network studies the original material and generates a coherent text

1. Analyzing the Original Publication and Writing the Text

The neural network studies the original material and generates a coherent text

Claude Sonnet 4.6 Anthropic
2.
Gemini 2.5 Pro Google DeepMind step.translate-en.title

2. step.translate-en.title

Gemini 2.5 Pro Google DeepMind
3.
Gemini 2.5 Flash Google DeepMind Text Review and Editing Correction of errors, inaccuracies, and ambiguous phrasing

3. Text Review and Editing

Correction of errors, inaccuracies, and ambiguous phrasing

Gemini 2.5 Flash Google DeepMind
4.
DeepSeek-V3.2 DeepSeek Preparing the Illustration Description Generating a textual prompt for the visual model

4. Preparing the Illustration Description

Generating a textual prompt for the visual model

DeepSeek-V3.2 DeepSeek
5.
FLUX.2 Pro Black Forest Labs Creating the Illustration Generating an image based on the prepared prompt

5. Creating the Illustration

Generating an image based on the prepared prompt

FLUX.2 Pro Black Forest Labs

Don’t miss a single experiment!

Subscribe to our Telegram channel —
we regularly post announcements of new books, articles, and interviews.

Subscribe