This report implies that GPUs have a higher failure rate than CPUs in "A.I." data centers.
https://www.datacenterdynamics.com/en/news/meta-report-details-hundreds-of-gpu-and-hbm3-related-interruptions-to-llama-3-training-run/
@dtabb73 Well, the majority of computations during training is run on the GPUs though...
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.