Datacenter GPU service life can be surprisingly short – only 1-3 years

by nabla9on 6/6/25, 3:05 PMwith 3 comments
by rbanffyon 6/6/25, 3:16 PM

I can't wait to see the GPU equivalent of Backblaze's hard disk reliability reports. And, considering Intel is building a 1000W CPU with direct liquid cooling, I would love to see one for CPUs as well.

I wonder if multi-phase cooling, fridge-stye, would be an option - pushing sub-zero fluids to the heat exchangers on top of the chips to remove more heat than just water would.

As a fun side note, IBM mainframe CPUs run at 5.2 GHz continuously for years without significant expected failures. The latest one uses liquid cooling with glicol.