The Chinese text recognition model has been adapted for AMD GPUs – we break down what this means for those working with documents.
AI: Events
How a Single Token Broke an Entire Model: The Story of a vLLM Bug
Technical context • Infrastructure
Engineers at AI21 Labs discovered a bizarre bug in vLLM that turned the Jamba model's normal responses into gibberish – and it was all down to a single incorrect token.
AMD has introduced a tool for automatically identifying the best quantization settings for ONNX models, eliminating the need for developers to manually sift through options.
AMD has demonstrated how to deploy OpenHands – an agent for automating code writing – on its server GPUs using the vLLM engine.
Cursor found a way to speed up the indexing of large codebases by safely reusing indexes created by colleagues, reducing the time from hours to seconds.
The latest update to AMD's Ryzen AI Software includes support for new models, improved performance, and expanded tools for developers working with AI on Ryzen processors.
Microsoft has revealed its second-generation in-house AI chip, designed specifically for running trained models rather than training them.
At CES 2026, Qualcomm introduced a concept for intelligent devices featuring local artificial intelligence that adapts to every user and operates independently of the cloud.
Qualcomm shared insights into how the next generation of Wi-Fi and autonomous AI agents can transform wireless network architecture, making them faster and «smarter».