Skip Navigation

Very large amounts of gaming gpus vs AI gpus

GPUVRAMPrice (€)Bandwidth (TB/s)TFLOP16€/GB€/TB/s€/TFLOP16
NVIDIA H200 NVL141GB362844.891671257742321
NVIDIA RTX PRO 6000 Blackwell96GB84501.79126.088472067
NVIDIA RTX 509032GB22991.79104.871128422
AMD RADEON 9070XT16GB6650.644697.324110317
AMD RADEON 907016GB6190.644672.25389608.5
AMD RADEON 9060XT16GB3820.322351.282311867.45

This post is part "hear me out" and part asking for advice.

Looking at the table above AI gpus are a pure scam, and it would make much more sense to (atleast looking at this) to use gaming gpus instead, either trough a frankenstein of pcie switches or high bandwith network.

so my question is if somebody has build a similar setup and what their experience has been. And what the expected overhead performance hit is and if it can be made up for by having just way more raw peformance for the same price.

43 comments
  • The AI cards prioritize compute density instead of frame rate, etc so you can't directly compare price points between them like that without including that data. You could cluster gaming cards, though, using NVLink or the AMD Fabric thing. You aren't going to get any where near the same performance, and you are really going to rely on quantization to make it work, but depending on your use case in self-hosting you probably don't need a $30,000 card.

    Its not a scam, but its also something you probably don't need.

43 comments