We dive into how to make language models run faster and cheaper – from runtime optimization to distributed request processing.
AI: Events
Cognizant and Uniphore Team Up to Develop Industry-Tailored AI for Business Needs
Business
The companies have announced a strategic partnership aimed at creating AI solutions tailored to specific industries, rather than relying on one-size-fits-all general-purpose models.
A team of engineers has figured out how to convert neural networks into standard logic chains, boosting performance on weak processors by 15% without sacrificing accuracy.
Version 1.2 expands editing and audio capabilities in the Suno Studio generative workstation, providing users with more control over the final mix.
AI: Events
How AMD GPUs Accelerate Graph Visualization – And Where AI Fits In
Technical context • Development
AMD demonstrated how to port graph layout algorithms to GPUs using the ROCm platform, employing AI as an assistant for writing and adapting code.
The Perplexity team shared the story behind their search engine, which handles 200 million queries daily and operates in tandem with large language models.
Anthropic has proposed a way to standardize the integration of language models with external sources – from databases to work tools. We explore how the MCP protocol solves the problem of fragmented integrations.
AI: Events
Perplexity Shows How to Train Trillion-Parameter Models on AWS
Technical context • Infrastructure
The Perplexity team has adapted a framework for training ultra-large neural networks for Amazon's cloud infrastructure. This allowed them to eliminate the rigid dependency on proprietary NVIDIA hardware and utilize standard networking solutions.
AI: Events
SenseTime Unveils SenseNova-SI-1.3: A Model Featuring Advanced Spatial Intelligence
Products
The Chinese firm has open-sourced an AI model that topped the charts in eight spatial environment understanding benchmarks.