AMD says its MI300X AI accelerator is quicker than Nvidia's H100

AMD says its MI300X AI accelerator is quicker than Nvidia's H100 - MI300X vs H100 - NVIDIA H100 - Intel Gaudi 2 vs Nvidia H1

Last updated 14 month ago

AI
Hardware
amd
nvidia

AMD says its MI300X AI accelerator is quicker than Nvidia's H100



A hot potato: AMD is combating back at Nvidia's claims about the H100 GPU accelerator, which in keeping with Team Green is quicker than the opposition. But Team Red said Nvidia didn't tell the complete story, and provided similarly benchmark outcomes with enterprise-widespread inferencing workloads.

AMD has eventually launched its Instinct MI300X accelerators, a new era of server GPUs designed to offer compelling overall performance stages for generative AI workloads and different excessive-performance computing (HPC) applications. MI300X is faster than H100, AMD stated in advance this month, however Nvidia attempted to refute the competitor's statements with new benchmarks launched multiple days in the past.

Nvidia examined its H100 accelerators with TensorRT-LLM, an open-source library and SDK designed to efficiently accelerate generative AI algorithms. According to the GPU business enterprise, TensorRT-LLM turned into able to run 2x faster on H100 than on AMD's MI300X with right optimizations.

AMD is now providing its own model of the tale, refuting Nvidia's statements about H100 superiority. Nvidia used TensorRT-LLM on H100, instead of vLLM utilized in AMD benchmarks, at the same time as evaluating overall performance of FP16 datatype on AMD Instinct MI300X to FP8 datatype on H100. Furthermore, Team Green inverted AMD's posted overall performance data from relative latency numbers to absolute throughput.

AMD shows that Nvidia tried to rig the game, at the same time as it is still busy figuring out new paths to unlock performance and uncooked electricity on Instinct MI300 accelerators. The company provided the cutting-edge overall performance tiers achieved through the Llama 70B chatbot version on MI300X, displaying a fair higher aspect over Nvidia's H100.

By the usage of the vLLM language model for both accelerators, MI300X became able to attain 2.1x the performance of H100 thanks to the modern day optimizations in AMD's software stack (ROCm). The agency highlighted a 1.4x performance advantage over H100 (with equivalent datatype and library setup) earlier in December. VLLM became selected because of its huge adoption within the network and the potential to run on each GPU architectures.

Even while the usage of TensorRT-LLM for H100, and vLLM for MI300X, AMD turned into nonetheless able to offer a 1.3x improvement in latency. When the usage of decrease-precision FP8 and TensorRT-LLM for H100, and better-precision FP16 with vLLM for MI300X, AMD's accelerator became apparently able to reveal a performance advantage in absolute latency.

vLLM does not assist FP8, AMD explained, and FP16 datatype become selected for its popularity. AMD said that its results display how MI300X the use of FP16 is similar to H100 even if the use of its excellent performance settings with FP8 datatype and TensorRT-LLM.

  • MI300X vs H100

  • NVIDIA H100

  • Intel Gaudi 2 vs Nvidia H100

  • NVIDIA H100 price

  • AMD MI300 performance

  • AMD MI300X release date

  • Intel Gaudi 3 vs Nvidia

  • MI300X vs A100

Apple unveils M3 chips powering new MacBook Pros and iMac at some point of Scary Fast Halloween event

Apple unveils M3 chips powering new MacBook Pros and iMac at some point of Scary Fast Halloween event

What just happened? Apple held its Scary Fast Halloween occasion on the uncommon time of eight pm ET / five pm PT the previous day, wherein it confirmed off the trendy M3, M3 Pro and M3 Max chips. The SoCs will seem wit...

Last updated 16 month ago

An upgraded Sony PS5 DualSense controller with 12-hour battery life may be in the works

An upgraded Sony PS5 DualSense controller with 12-hour battery life may be in the works

Rumor mill: The PlayStation 5 DualSense controller is pretty notable however it is now not with out flaws that Sony may want to enhance upon. Understandably, news of a revision speedy sparked exhilaration, in particular...

Last updated 13 month ago

Microsoft's AI found a new fabric to replace lithium in li-ion batteries

Microsoft's AI found a new fabric to replace lithium in li-ion batteries

Forward-looking: Modern lithium-ion rechargeable batteries depend on lithium and other rare earth metals. While they provide an efficient power source with an extended cycle existence, they also can pose environmental i...

Last updated 13 month ago

Swiss glaciers are melting at alarming rate, dropping 10% extent in just  years

Swiss glaciers are melting at alarming rate, dropping 10% extent in just years

Why it subjects: In what is greater regarding news about the surroundings, a new record has discovered that glaciers in Switzerland have lost 10% of their quantity in only two years, extra than they did inside the 3 a l...

Last updated 17 month ago

Sony came up with a better region to keep and charge its new earbuds; interior your controller

Sony came up with a better region to keep and charge its new earbuds; interior your controller

 It's not frequently that a patent has a lot of an impact on me – I can commonly take them or leave them. However, Sony's recent patent for a controller that doubles as a charger and storage for its upcoming wireless ea...

Last updated 16 month ago

18-12 months-vintage hacker behind GTA VI leak sentenced to lifestyles in stable health facility facility

18-12 months-vintage hacker behind GTA VI leak sentenced to lifestyles in stable health facility facility

What just occurred? The teenage mastermind behind the infamous GTA VI hack and leak that happened ultimate year has been sentenced to indefinite imprisonment interior a sanatorium jail. The member of hacking organizatio...

Last updated 14 month ago


safirsoft.com© 2023 All rights reserved

HOME | TERMS & CONDITIONS | PRIVACY POLICY | Contact