Intellectual hub of the topic

engineering

AI: Events

Tencent Hunyuan Reveals How to Pinpoint Bottlenecks in Language Model Training

Development

Researchers from Tencent have developed a tool that helps to precisely identify where failures occur during reinforcement learning model training.

Tencenthunyuan.tencent.com Feb 14, 2026

AI: Events

Gang Scheduling: Balancing Rigidity and Flexibility in AI Compute Allocation

Technical context • Infrastructure

We explore how Gang Scheduling technology helps efficiently allocate resources for training AI models and why striking a balance between rigidity and flexibility is crucial.

Alibaba Cloudwww.alibabacloud.com Feb 14, 2026

Lab

When Energy Draws Invisible Bridges: The Geography of Collapse in Mapping Spaces

Mathematics & Statistics

How mathematical maps lose energy when concentrating at a point and why this creates geometric necks connecting spaces via invisible threads.

Dr. Amalia Richter Feb 14, 2026

AI: Events

Higress: Gateway API Support and Extensions for AI Inference

Infrastructure

The Higress cloud gateway has been updated to support the Gateway API standard and now includes specialized features for working with artificial intelligence models.

Alibaba Cloudwww.alibabacloud.com Feb 14, 2026

AI: Events

AI Agents Write CUDA Kernels: GPT and Claude Learn to Generate GPU Code

Technical context • Development

Two AI agents can create optimized CUDA kernels to speed up operations straight from a task description. Let's dive into what this means for people working with models.

Hugging Facehuggingface.co Feb 13, 2026

AI: Events

MiniMax Introduces Forge: A Platform for Training AI Agents on Powerful Computing Clusters

Infrastructure

Chinese company MiniMax has released Forge, an open platform designed for training agents using reinforcement learning on large-scale GPU clusters.

MiniMaxwww.minimax.io Feb 13, 2026

AI: Events

How AMD and Qwen Optimized MI300X GPUs for Peak Performance

Technical context • Infrastructure

The Qwen team optimized their models to effectively run on AMD MI300X GPUs, achieving a response latency as low as 15 ms per token and full image generation in just 0.4 seconds.

LMSYS ORGlmsys.org Feb 13, 2026

AI: Events

Training Language Models with Feedback: verl Now Runs on AMD GPUs

Technical context • Infrastructure

The verl framework for training large language models with reinforcement learning has received support for AMD ROCm 7.0.0 and expanded scaling capabilities.

AMDwww.amd.com Feb 13, 2026

AI: Events

MiniMax M2.5: Open-Source Models Catch Up to Claude Sonnet

Products

Chinese company MiniMax has released M2.5, a family of open-weight models whose performance is approaching that of Claude 3.5 Sonnet.

OpenHandsopenhands.dev Feb 13, 2026

Want to dive deeper into the world
of neuro-creativity?

Be the first to learn about new books, articles, and AI experiments
on our Telegram channel!