3080 – GPU Driver Error, no temps

Johnnmarnell · June 29, 2021, 7:22pm

Hey how did it go? I am following you. Mine was doing well the other day around 100mhs but maybe the temperature outside it got hot even though in basement.

jmw60546 · June 29, 2021, 10:06pm

I have had to just manage the overclock settings to keep it stable. Each card has its own limits and it takes time.

The hot weather makes it hard to keep cool and I get more failures than usual as a result.

Johnnmarnell · July 4, 2021, 6:48pm

I think just a HIVEOS issue if you are only getting the fan missing. If fan missing, just verify in machine fan is running and ignore it. If you are getting more GPU issues then pull back the overclock. Temp outside makes a big difference as well. I have my internal fans, then two extrenal fans blowing air in and out of my rig. on 4 of 6 I can get overclock mem up to 2550 and stable doing like 101 mhs. on one of them only 1700 and on another one only 1800. If I put the 1700 up to 1800 will eventually error out. IF I put 1800 up to 1900 same after awhile. You just have to find the stable speed. BUT if just fan mssing ignore it as long as you know it is running. I also found a cool way to make Hiveos reboot for me each time a GPU error does occur. If you need to know let me know. With 6 3080 I am at 598.5mhs. I really want to hit 600 lol.

CristhianJP · July 23, 2021, 8:30am

Hi bro, could you please tell me how did you do that?

Thanks in advance!

Johnnmarnell · July 23, 2021, 12:13pm

I had a few issues lately with it as IFTTT is having lots of problems. What I did though is set my hashrate in Hiveos to restart after it drops below 500. This just means one of my workers goes offfline, OR if my GPU goes down. I set the restart to 1 minute and the reboot to 1 minute. Then I have it set to trigger the notification to my telegram account. So, anytime it goes down and is supposed to reboot I get a telegram notification. Then I bought a KASA smart plug on Amazon for under $20 and created an IFTTT (free) so that anytime I get a notification from Hiveos saying offline or whatever keyword they send it shuts my KASA plug off. And then added another notification after that to turn it back on. But, lately IFTTT has not been triggering well and think the company might just stink, lol. One last thing you have to link your IFTTT and allow it to access your telegram by creating a group in telegram with your HIVEOS and IFTTT allowed in that group. This is how you create the rule. Are you using Linux? If so, I found a cool notification program you can install directly into Linux and playing around with that but so far cant quite get it to work yet. It can not be this hard and shocked someone does not have a simple script like I do for all my windows machnines. If you want to know for windows I use a very simple script that checkes and makes sure it is always running.

Johnnmarnell · July 23, 2021, 12:18pm

And then hate to say it but worst case, fine your overclock setting that causes an error. One of mine can hash 101 per sec, one only 95 per sec so each one you tweak on its own. Once you find that, lower the memory by like 3000 (so my max on the 101 /s is about 2700 overclock memory. I reduce that to 2400. You may lose a little, but then I am up for weeks and I restart it every week just for fun as in my head l think it is good for no reason. HA ha. But rather than keep it close to the max overclock and have it shutting down once in awhile. I can go from like 590/s with 6 gpu down to 582 with 6 GPU and when it is up all the time it ends up averaging out much better. So lose some but VERY stable. One of mine max out at 1700 I just set it to 1400 as that ends up being only a 1.5/s hash loss and still get 95 out of it. THEN DEF GET A SMART PLUG. Cause no matter where I am if I see hiveos down or get notificaiton and it does not work, I just can restart it on the app so never down more than maybe an hour.

CristhianJP · July 23, 2021, 4:17pm

Thanks for your response my bro, i have 6 rtx 3080 having oc issues with an evga ftw3 just reduced the mem clock to 1800 and testing right now (it was 2000 at 97.4 mh/s, now 96.2 mh/s), hope it helps.

What are your 6 gpus if you can share?

vinniel · July 29, 2021, 8:32pm

Has anyone attempted to flash RTX 3060 to ensure they all have same bios in a single rig?

I have cards that are either 94.06.14.40.45 or 94.06.25.00.63. It seems logical to have them all in same bios rev for consistency but afraid of bricking it by trying

edit here’s my settings running ERGO on Herominers:

kkxl1306 · August 29, 2021, 7:25am

Help me. I have five 3080 and overclocking at - 2002500. The normal computing power is about 100, but I have a graphics card that can only be placed at about 2100. If it is higher, an error will occur. After running for a period of time, you will encounter GPU driver error no temps. Please tell me your method. thank you

harish621 · September 4, 2021, 10:08pm

I’ve been stressing out about this for a couple weeks and finally found a fix for my situation. . I was getting this GPU driver error no temps on my 3080 Dell 10GB GPUs. I tried different risers, cables, USB drives, etc. This is what I did and have been able to runs my rig for over 24 hours without any errors or issues.

Fix:
• Delete current worker
• Create new worker
• Download the new rig.conf file and replace that with your current in your USB drive
• Put USB drive back in
• Turn rig back on
• Once everything reboots, go back to the overclock settings and change everything to a moderate OC setting. (mine are -200 core clock, 1800 memory, 220 PL, 75% fan)
That seamed to work for me. I have 6 3080 GPU on this rig and average over 90 mh per GPU. I’ll waiit for some time and slowly try to increase my OC to get this higher, but I’m happy that this is at least stable. Hope this helps others.

ayap · September 6, 2021, 8:49pm

you mean create a new config file and delete the old one?

harish621 · September 12, 2021, 11:47am

Yes that is correct, did it work for you?

Alfiethecat · September 13, 2021, 8:11pm

here i am looking for some explanation about why my rig keeps crashing odd hourly on hive os. And after i read here i will have a very bad news for you all. well i have 6 card rig with 3060ti founders edition and 2 beautiful rtx 3080 liquid cooled EVGA hybirds, they always run around 39 C to 40 C max, didn really look at junction but assuming they are around 85-90 C. After i read here i start to think its not the cards its something to do with Hiveos or some odd drivers crashing this. does anyone using windows having these issues ? by the way one of my 3080 makes 100.9mh/s 235W , the other one makes 97.9 mh/s ( what ever i do ) its crashes everytime if i overclock a little higher. didnt try the absulute clock. Now i am sitting here thinking may be i should try windows for a week which i hate ! its clearly not temps. as my cards are very very cool.

winross83 · September 16, 2021, 12:34pm

Exactly same problem on two gigabyte RTX 3080 Gaming OC, i have change all pads near gpu by gelid extrem 2mm and behind the backplate by a 3 mm anyway same result : after a run of 7 hours i have an GPU Driver Error, no temps ! I have test the cards on Windows : with a little OC (-200 Core clock - Mem up to +1000 Mhz and power limit at 240W) : GPU Temp is at 48-50C and memory junction is about 80-82°C so fine for this … but when i try to test a little more oc (CC -200, Mem +1200 MHz, Power L. 240W) i have a black screen instantanly . I Think the memory of Gigabyte is poor : GDR6x Micron … and i think we have to flash bios of theses cards … bad news …

highflyer · September 23, 2021, 5:38pm

try using absolute core clock of 1050 for 3080s and 1200 for 3090s it will change your life brotha! You can get a 3080 down to 200watts same speed using absolute core clocks, it’s a big deal. Look up more but I gave you the sweet spot for your 3080 and 3090, just put it in and it will change to absolute mode. Enjoy!

T_S · October 3, 2021, 6:27am

Could you solve your issue? Did you flash the bios?

I’ve been searching for answers since a month. None of the forums have a solution till date.
I suppose its something to do with bad memory as you said. Silicon lottery…

I am also not sure how some people reach 3000MHz on Memory with -200 on Core. My GPU crashes at values above 1800MHz

I am able to get 79 MH/s max at -200 Core and 1770 MHz. Values above this are unstable and causes black screen and reboot.

The two most annoying messages that crash the mining are: -
“GPU driver error, no temps” and
“LA > 36 rebooting”

still searching for fixes.

winross83 · October 3, 2021, 9:00am

Already test with absolut core clock valor same prob dude

winross83 · October 3, 2021, 9:03am

Hi i’m french and i have search on french forums. Some people have this probs too and that was no solution ! just a fu*king silicon joke lotery . Flashing bios is riskly and i haven’t test it for this time

Danilak · October 4, 2021, 11:05am

For me it was all about memory clock on my hynix 3060ti lhr. I lowered it from 2400 to 1800 and hadnt had a carsh since then. That card was crashing my rig after 1-60min intervals on t-rex, miniZ, nbminer. Said GPU error, no temp.

Try to lower your memory down to like 1500 and increase by 50 after running rig for 1hr and its stable.

GL.

Krazip · October 11, 2021, 9:44am

I got the same no temp error on 3070ti. I solved that by lowered the memory clock. I lowered bit by bit (50) until I found the stable clock. Good luck!

Edit: Each cards are different, don’t stick too much on what clock you found on internet. It’s just a reference point, actual stable clock might be higher or lower.