I use two Supermicro servers in a homelab. I have had an issue with one of them where if you put it under heavy load it hard resets.
So far there has been nothing in the logs either on the host or within IPMI.
I have two identical hosts and I swapped memory and replaced both HDDs but the issue persisted with the bad host. So I have suspected motherboard, CPU or other hardware issue.
This morning I went to boot it up and I got the following PSOD.
It looks to be an issue with one of the SSDs which forms part of the vSAN disk group. Both SSDs in the host have been replaced and the issue was present on the old ones.
This leads me to believe it is a hardware issue on the box itself, such as the CPU, motherboard, hard drive controller etc.
Is there anything I can check to back up my thoughts?