Intellectual hub of the topic

applied analysis

AI: Events

FLUX.2 [flex] Now Runs Three Times Faster

Infrastructure

The Pruna AI team has accelerated image generation in the FLUX.2 [flex] model threefold without compromising quality. We explain how this was achieved and what it means for users.

Pruna AIwww.pruna.ai Jan 29, 2026

AI: Events

OpenHands Index: A New Way to Compare AI Agents on Real-World Tasks

Development

OpenHands has launched a benchmark demonstrating how models handle real-world GitHub tasks – from bug fixes to implementing new features in open-source projects.

OpenHandsopenhands.dev Jan 29, 2026

AI: Events

How a Single Token Broke an Entire Model: The Story of a vLLM Bug

Technical context • Infrastructure

Engineers at AI21 Labs discovered a bizarre bug in vLLM that turned the Jamba model's normal responses into gibberish – and it was all down to a single incorrect token.

AI21 Labswww.ai21.com Jan 29, 2026

AI: Events

YouTube Now Allows Creators to Make Shorts Using AI Avatars

Products

YouTube creators can now leverage AI avatar technology to produce short videos, thanks to a new platform tool powered by Supertone.

Supertonewww.supertone.ai Jan 29, 2026

AI: Events

Chunk Size Depends on the Query: How AI21 Labs Proposes Solving a Major RAG System Challenge

Development

AI21 Labs demonstrated that a single «chunk» size in RAG systems is a compromise and proposed a simple way to adapt text segmentation to the user's query type.

AI21 Labswww.ai21.com Jan 29, 2026

AI: Events

Claude Taught to Write CUDA Kernels and Train Open Models

Technical context • Development

Anthropic has enhanced Claude's capabilities in handling low-level code and transferring knowledge to other models through its new «Extended Thinking» feature.

Hugging Facehuggingface.co Jan 28, 2026

AI: Events

AMD Quark ONNX: Automated Search for Optimal Quantization Strategies

Development

AMD has introduced a tool for automatically identifying the best quantization settings for ONNX models, eliminating the need for developers to manually sift through options.

AMDwww.amd.com Jan 28, 2026

AI: Events

How to Run an AI Coding Agent on AMD Instinct GPUs

Technical context • Infrastructure

AMD has demonstrated how to deploy OpenHands – an agent for automating code writing – on its server GPUs using the vLLM engine.

AMDwww.amd.com Jan 28, 2026

AI: Events

How to Index Huge Repositories in Seconds Instead of Hours

Development

Cursor found a way to speed up the indexing of large codebases by safely reusing indexes created by colleagues, reducing the time from hours to seconds.

Cursor AIcursor.com Jan 28, 2026

Want to know about new
experiments first?

Subscribe to our Telegram channel — we share all the latest
and exciting updates from NeuraBooks.