To ensure neural networks run quickly and reliably, they need special processors. This directly affects the kind of AI services we ultimately get.
The Pruna AI team has accelerated image generation in the FLUX.2 [flex] model threefold without compromising quality. We explain how this was achieved and what it means for users.
OpenHands has launched a benchmark demonstrating how models handle real-world GitHub tasks – from bug fixes to implementing new features in open-source projects.
YouTube creators can now leverage AI avatar technology to produce short videos, thanks to a new platform tool powered by Supertone.
Researchers at the Allen Institute for AI have created the Theorizer system, which analyzes arrays of scientific publications and attempts to formulate general patterns from them.
We explore the architectural solutions developers of Chinese open-source models are choosing and why decoder-based approaches continue to dominate the ecosystem.
AI: Events
Trinity Large: What's Inside and Why Arcee Released Three Versions of the Same Model
Technical context • Products
We dive into how Trinity Large from Arcee AI works as a new language model with a sparse architecture and three checkpoints to choose from.
AMD has demonstrated how to deploy OpenHands – an agent for automating code writing – on its server GPUs using the vLLM engine.
AI: Events
Open Coding Agents: AI Code Assistants That Work With Any Repository
Technical context • Development
The Allen Institute for AI has unveiled Open Coding Agents – open-source models for autonomous coding that adapt to a project's structure.