I also had this problem, but mine is different. After continuously running for more than 24 hours it occurred.
It seems like we have the same error log. But my rig is running on a ssd, which is copied from a usb by following the official tutorial, but I’ve never checked whether this ssd had utilized its full size(120G), so I though maybe it was something wrong with the ssd.
After rebooting I couldn’t even connect the machine through my local network. Therefore, I removed all the graphic cards but one in the main PCIE-16 slot and connected it through a hdmi monitor, but it didn’t work either. I had tried to reset my bios setting, removed the ssd and it didn’t work either.
But the thing is that my rig is now runnning on a motherboard, a gpu, a memory memory and a power supply, the configuration is similar to the time when I first built them.
Anything is fine but the memory, very cheap and costs approximately 13.6$, maybe had some problems. I had to order a new memory made by Kingston and to wait to see what would happen when I get the new memory. It it doesn’t work, the last resort is to change the motherboard. There is nothing wrong with gpu, since everyone here has multiple gpus, if someting goes wrong with it, it can easily be spotted.
So the conclusion is that buying a reliable memory is sensible