multi-GPU Communication

NVIDIA Dynamo Addresses Multi-Node LLM Inference Challenges

Serving Large Language Models (LLMs) at scale is complex. Modern LLMs now exceed the memory and compute capacity of a single GPU or even a single multi-GPU node. As a result, inference workloads for ...

SDxCentral

Google opens up AI with Cloud GPU communication and agentic interfacing

AI enthusiasts rejoice, for Google has released a new open source agent solution on top of updates to its supercomputing platform in Google Cloud. The Google AI Hypercomputer now includes support for ...

Network World

Nvidia turns to software to speed up its data center networking hardware for AI

New algorithms will fine-tune the performance of Nvidia Spectrum-X systems used to connect GPUs across multiple servers and even between data centers. Nvidia wants to make long-haul GPU-to-GPU ...

Network World

Perplexity’s open-source tool to run trillion-parameter models without costly upgrades

TransferEngine enables GPU-to-GPU communication across AWS and Nvidia hardware, allowing trillion-parameter models to run on older systems. Perplexity AI has released an open-source software tool that ...

TweakTown

AMD still rumored with high-end 'multi-chiplet' Radeon GPU, could debut with UDNA architecture

TL;DR: AMD's new multi-chiplet gaming GPU patent introduces a "smart switch" to reduce latency and optimize memory access, potentially overcoming monolithic design limits. This innovation, inspired by ...

Forbes

DigitalOcean And AMD Deliver Doubled Inference Performance For Character.ai

This voice experience is generated by AI. Learn more. This voice experience is generated by AI. Learn more. Cloud providers are increasingly competing based on inference results such as throughput, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results