I frequently get errors from two of my cards in my rig of 12. It’s either GPU0, which is a 3060ti, or GPU11, which is a 3070. T-Rex Miner gives me these lines frequently:
WARN: NVML: can't get fan speed for GPU #0, error code 999
[FAIL] 39/43 - Low difficulty or invalid share, 34ms ... GPU #0
When I try to modify overclock settings in the web GUI I get this type of error:
=== GPU 11, 10:00.0 GeForce RTX 3070 7982 MB, PL: 100 W, 270 W, 300 W === 16:49:31 SET POWER LIMIT: 125.0 W [Unknown Error] (exicode=123) Max Perf mode: 4 (auto) ERROR: Error assigning value 95 to attribute 'GPUTargetFanSpeed' (NB1:0[fan:16]) as specified in assignment '[fan:16]/GPUTargetFanSpeed=95' (Unknown Error). ERROR: Error assigning value 95 to attribute 'GPUTargetFanSpeed' (NB1:0[fan:17]) as specified in assignment '[fan:17]/GPUTargetFanSpeed=95' (Unknown Error). ERROR: Error assigning value 0 to attribute 'GPUGraphicsClockOffset' (NB1:0[gpu:11]) as specified in assignment '[gpu:11]/GPUGraphicsClockOffset[4]=0' (Unknown Error). ERROR: Error assigning value 0 to attribute 'GPUMemoryTransferRateOffset' (NB1:0[gpu:11]) as specified in assignment '[gpu:11]/GPUMemoryTransferRateOffset[4]=0' (Unknown Error). Attribute 'GPUFanControlState' (NB1:0[gpu:11]) assigned value 1.
I run on the latest stable, but has also tried latest beta. A reboot seems to fix it, but it comes back quickly on either GPU #0 or #11.
Any ideas why this is?