this post was submitted on 19 Oct 2023
66 points (97.1% liked)

Linux

48655 readers
413 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
 

Lately my PC has started crashing while it plays videos. It freezes completely, screen frozen and not responding to any input (keyboard, mouse), I mean I cannot change TTY (alt + ctrl + F(1-2-...)), and it cannot even respond to alt + PrntScr + REISUB. I have to force power off by holding down the power button.

After I reboot I have tried checking all logs available and I cannot find anything logged right before the incident. Last entries are always different and not indicating anything.

I suspect it has to do with the graphics card but I'm looking for ways that I can dig deeper on that and confirm it or not.

What else should I check? How can I find more info?

OS: Lubuntu 22.04.3 LTS (latest updates) I'm using the nvidia proprietary drivers (nvidia-driver-390)

UPDATE:

First of all thank you all for your input and fresh ideas. Now I've already tried some of them and I will continue with the other ones until I get some results.

till now I have tried

  • memtest and it didn't show any errors.
  • boot from a live distro and see if problem also occurs. Well it didn't occur but on the live distro you cannot change the graphics driver. So it was using the open source nouveau driver, also it didn't happen during the 1 hour I let it play. The thing is that it never was punctual even before. It could happen during the first hour or the third or sometime later.

Next steps are to

  • open the case and clean it up to remove the possibility of high temp because of that,
  • change my drivers to be the nouveau and try again,
  • try with only the onboard GPU on,
  • remove extra disks to reduce the load of the PSU

thank you all again.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 9 points 1 year ago (2 children)

I was in a similar situation not too long ago and couldn't find anything to fix it either at first. One thing that was high on my list was changing my PSU since a defect or weak one often seems to be a problem in such cases. Besides a general hardware failure of course. If it's the hardware that could be anything really. Motherboard, RAM, GPU, PSU. PSU is the easiest to switch tho, so if you go that route I would try that first.

Anyways, I never had to do this cause in my case, believe it or not, a BIO update fixed my problem. I am still not 100% sure what happened but I think the update fixed the GPU voltage distribution or something similar.

Hope that help at least a little bit.

[–] [email protected] 3 points 1 year ago (1 children)

good idea about the PSU. I hadn't thought of that. The PSU is not any high-performance/high-quality and is already 5 years old. Being unable to provide the required voltage may be a possibility if we accept that the performance degrades in time. (Was working without issues for 5 years in the same PC configuration).

I think I'll try by first removing the extra HDDs so reducing the load and check again. Thanks for your input

[–] [email protected] 1 points 1 year ago

If your processor/MB has onboard video, it would probably be easier to pull the gpu and test. If you still suspect power management, pulling other components like additional HDDs after adding the gpu back would confirm it.

[–] reddit_sux 3 points 1 year ago

PSU is the last thing we check but is usually the first to fail under load if it is old or cheap. Try reducing the load on it like not using ur GPU, HDDs or any other peripheral that is unnecessary.

Next check ur RAM, that too can give random errors under load.