Topic #model architecture

AI: Events

How AI Helped Double the Speed of a Hardware Algorithm in Two Weeks

Technical context • Development

Students leveraged language models to optimize a cryptographic core on an FPGA, achieving a twofold performance increase in just two weeks.

AMDwww.amd.com Apr 6, 2026

AI: Events

What Is a Mixture of Experts and Why Is Everyone Talking About It?

Development

We explain how the Mixture of Experts architecture works – an approach that makes models smarter without making them 'think' harder.

Hugging Facehuggingface.co Feb 26, 2026

AI: Events

How Data Shapes AI Thinking: The Role of Metadata and Knowledge Graphs in Artificial Intelligence's 'Memory'

Infrastructure

Why modern AI can't be truly smart without structured data, and how metadata, reference data, and knowledge graphs shape its 'brain'.

AMDwww.amd.com Feb 19, 2026

NeuroBlog

When the Neural Network Forgets What You Were Talking About

Artificial intelligence • Technologies

The longer the conversation with AI lasts, the more it loses the thread – like a conversational partner getting too tired to keep everything said earlier in their head.

Helen Chang Feb 14, 2026

AI: Events

Olmix: Allen AI's Approach to Data Mixing Across All Stages of Language Model Training

Development

Allen AI has introduced Olmix, an open-source framework for data mixing in the language model training process, including pre-training, instruction tuning, and alignment.

Ai2allenai.org Feb 13, 2026

AI: Events

Unsloth Speeds Up MoE Model Training 12x and Boosts Context Window

Technical context • Development

Unsloth's new kernels and mathematical optimizations slash memory requirements by 35%, boost training speeds by 12x, and enable context windows six times longer than the original.

Unslothunsloth.ai Feb 11, 2026

AI: Events

RDMA for Language Models: When Servers Learn to Talk Directly to Each Other

Technical context • Infrastructure

The Perplexity AI team has demonstrated how direct server-to-server data transfer technology helps language models run faster and more efficiently by eliminating bottlenecks in network infrastructure.

Perplexity AIresearch.perplexity.ai Feb 7, 2026

AI: Events

Zyphra Finds a Way to Make Neural Network Attention Mechanisms Faster and More Efficient

Technical context • Infrastructure

Zyphra's new OVQ-attention layer aims to reduce memory and computational overhead when working with long contexts while maintaining high sequence processing quality.

Zyphrawww.zyphra.com Feb 6, 2026

AI: Events

Hunyuan Launches Research Blog: How Context Is Changing the Approach to Language Models

Research

Yao Shunyu's team from Tencent demonstrated why the ability to work with context may become a key factor for applying models in real-world tasks.

Tencenthunyuan.tencent.com Feb 4, 2026