I’ve gotten this alert a few times, started yesterday. But when I check everything looks fine.
The message log shows:
*2022-03-04 00:55:20] GPU 0: detected DEAD (0d:00.0), will execute restart script watchdog.sh [2022-03-04 00:55:20] *Watchdog script executor thread executing script ‘watchdog.sh’
Card seems fine when I check… Not running too hot or anything. Could it be a glitch or could the card be failing?
amd-info:
=== GPU 0, 0d:00.0 Radeon RX 5500 XT 8176 MB ===
Bios: 113-D3322003-O05, UUID: T3MW210012110303
Core: 1232 MHz 843mV, Mem: 990 MHz
PerfCtrl: manual, Load: 99%, MemLoad: 90%, Power: 69.0 W, Cap: 135 W
Core: 47°C, HotSpot: 54°C, Mem: 76°C, Fan: 56%, RPM: 1881
Core state: 2, clocks: 300 775 1250*
Mem state: 3, clocks: 100 500 625 990*
SOC state: 2, clocks: 304 785 1266*
DCEF state: 0, clocks: 304* 785 1266
F state: 2, clocks: 304 785 1266*
PCIE Link speed: GEN2 (5.0GT/s), PCIE Link width: x1
Memory total: 8176.00 MB, used: 8011.14 MB, free: 164.86 MB, type: Micron GDDR6
VDDGfx: 856mV, VDDCI: 850mV, VDDCR_SOC: 893mV, MVDD: 1350mV, MVDDQ: 1350mV