this post was submitted on 29 Apr 2024
124 points (97.7% liked)

Technology

60011 readers
2165 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
 

Earlier this month, we wrote that some of Intel's recent high-end Core i9 and Core i7 processors had been crashing and exhibiting other weird issues in some games and that Intel was investigating the cause.

An Intel statement obtained by Igor's Lab suggests that Intel's investigation is wrapping up, and the company is pointing squarely in the direction of enthusiast motherboard makers that are turning up power limits and disabling safeguards to try to wring a little more performance out of the processors.

"While the root cause has not yet been identified, Intel has observed the majority of reports of this issue are from users with unlocked/overclock capable motherboards," the statement reads. "Intel has observed 600/700 Series chipset boards often set BIOS defaults to disable thermal and power delivery safeguards designed to limit processor exposure to sustained periods of high voltage and frequency."

top 28 comments
sorted by: hot top controversial new old
[–] GuStJaR 49 points 7 months ago (1 children)

I was having issues with crashes in multiple games but rdr2 was the worst. I had a rig built with an i9 14900k and Asus hero z790.

I think I finally found the solution and it was to do with the default bios settings for my Asus MB and my i9 14900k.

In the document linked here...

https://www.intel.com/content/www/us/en/content-details/743844/13th-generation-intel-core-and-intel-core-14th-generation-processors-datasheet-volume-1-of-2.html

Page 98, Table 17, Row 3: Reveals the stock turbo power limits for the 13900K and 14900K CPUs are 253W, not the 4,000+ my MB's Bios settings default to. Page 184, Table 77, Row 6: Lists the maximum current limit at 307A, far below the MB's default of 500+A.

I found this information in a Reddit post (https://www.reddit.com/r/overclocking/comments/1axepvu/optimizing_stability_for_intel_13900k_and_14900k/) and followed the settings as follows:

ASUS Z790 Motherboards:

Save your current settings into a profile so you can return to them later if you want.

Reset your BIOS to default settings. Ai Tweaker tab:

Disable MultiCore Enhancement.

Enable XMP(if your RAM supports it).

Set SVID behavior to Typical Scenario.

Set short duration turbo power = 253

Set long duration turbo power = 253

Set max core/cache current = 307Amps

Doing this immediately stabilised the CPU temps as well as bring down the average temp by ~10 to 15c. It's been a few months now with zero crashes.

Hope this helps someone

[–] JackFrostNCola 0 points 7 months ago* (last edited 7 months ago) (3 children)

This is not a typo right, 307Amps?!
What creative maths have they done to get this number?

The PCB tracks on the motherboard are what, about 0.5mm thick and about 2mm wide (for the larger channels)? I can absolutely guarantee you arent getting 300+ Amps through those tracks.

Update: Thanks for the replies, it makes sense when dealing with these extremely low voltages and TIL a lot. Cheers!

[–] TechNerdWizard42 5 points 7 months ago (1 children)

Oh but you are. It's at 0.8v to 1.2v range so it's high current.

This is what all the VRM design is for. The motherboards are generally 20-30 layers nowadays with 2oz copper in the power layers. The traces are short and you do get hundreds of amps.

And yes, I've designed them on the silicon side.

[–] JackFrostNCola 1 points 7 months ago
[–] RedWeasel 3 points 7 months ago (1 children)

That is 253watts at 1.21ish volts. Multiply those together and you get around 307. Divide 307 by 253 to get the exact voltage based on those number.

[–] JackFrostNCola 1 points 7 months ago
[–] AnyOldName3 3 points 7 months ago (1 children)

It's a 250W+ part running at around 1V, so it's going to draw a lot of current. Power is supplied via many pins on the back of the CPU, and they're connected to many traces, so it's not putting all that current through just one. It still puts out a lot of heat anyway, which is why modern motherboards have large heat sinks, sometimes with fans, on their VRMs.

[–] JackFrostNCola 2 points 7 months ago
[–] [email protected] 25 points 7 months ago (1 children)

From what I understood from Hardware Unboxed, running without hard power limits is essentially "supported" by Intel and motherboard manufacturers weren't compelled to stick to the "recommended" power limits.

The fact that the new "Intel Baseline" profile that was pushed to motherboards via a BIOS update is vastly inconsistent between manufacturers leads be to believe that Intel doesn't clearly state "do this and this as default".

I find it a bit cheap to put the blame solely on motherboard manufacturers here.

There are also reports of instabilities with CPUs running at supposedly safe power limits. I can't confirm this but I also wouldn't be surprised if these power limits also caused silicon degradation at an unexpectedly fast pace.

[–] RedWeasel 3 points 7 months ago

What I found interesting is that the “Intel baseline” setting doesn’t seem to be the default. So if a builder sells a pc and manually sets it and the user needs to update/reset the settings to default, they will go back to unlimited.

[–] Cossty 17 points 7 months ago

Hardware Unboxed recently made a video and said that intel is mostly to blame, because they don't have clearly defined defaults, because they want motherboards to "overclock" CPUs because it looks good on benchmarks.

[–] [email protected] 14 points 7 months ago* (last edited 7 months ago) (3 children)

I've been reading news about this for a bit.

I believe that I may have damaged an i9-13900KF with stock Asus motherboard settings myself (though I can still make it work by disabling all but one core, sees constant problems now with multiple cores active).

If you're getting one of these yourself, no joke, give serious consideration to using more-conservative-then-stock-motherboard settings.

[–] paraphrand 15 points 7 months ago (2 children)

I never choose to mess with overclocking. This situation would have burned someone like me who assumes defaults are safer. What a mess.

[–] [email protected] 7 points 7 months ago* (last edited 7 months ago)

Yeah, I could believe that there would be overclocking settings in a BIOS that would let you damage a CPU. I just was also thinking that whatever motherboard vendors chose as defaults wouldn't. But, well, I suppose that their own qualification process might not be as rigorous as Intel's.

[–] [email protected] 1 points 7 months ago

In the past it has been considered pretty safe to play with a moderate OC because the CPUs have decent thermal protection built in. Seems like that era might be over.

[–] Audalin 2 points 7 months ago (1 children)

Any guidance on choosing appropriate conservative settings for i7-13700K? I may be hit with the same as you in the future (sometimes I have to do some heavy multithreaded combinatorial computations which run several days with 100°C temperature, using all cores). The motherboard has options for customising pretty much everything there is, but I didn't touch anything overclocking-related, so I have Asus defaults.

[–] [email protected] 5 points 7 months ago* (last edited 7 months ago) (1 children)

The article has a bunch of settings that they say that Intel's flagged as "don't use". Intel will be a better source than me.

[–] Audalin 1 points 7 months ago

I see, thanks. Will check. I just thought perhaps you figured out something other than those from your experience.

[–] [email protected] 1 points 7 months ago (1 children)

Did thoes defaults include XMP though? XMP is also overclocking.

[–] [email protected] 2 points 7 months ago (1 children)

On my own motherboard, it is a default, but the article doesn't list it as being a setting believed to be problematic from a CPU damage standpoint.

[–] [email protected] 1 points 7 months ago

I guess not for your specific cpu, but Asus fried some ryzen 7000 cpus with XMP last year

[–] monkeyman512 14 points 7 months ago

My guess is the motherboard manufacturers could get away with this in the past without any issues. But Intel is pushing chips so close to redline out of the box that now it causes problems.

[–] [email protected] 1 points 7 months ago

This is the best summary I could come up with:


An Intel statement obtained by Igor's Lab suggests that Intel's investigation is wrapping up, and the company is pointing squarely in the direction of enthusiast motherboard makers that are turning up power limits and disabling safeguards to try to wring a little more performance out of the processors.

"While the root cause has not yet been identified, Intel has observed the majority of reports of this issue are from users with unlocked/overclock capable motherboards," the statement reads.

"Intel has observed 600/700 Series chipset boards often set BIOS defaults to disable thermal and power delivery safeguards designed to limit processor exposure to sustained periods of high voltage and frequency."

As we reported previously, the problems primarily affect high-end unlocked Core i9 CPUs like the i9-13900K and i9-14900K, as well as KF and KS variants of the same processors.

Intel releases these K-series chips to satisfy overclockers and tinkerers, but it's clear that there just isn't a lot of performance headroom left in these chips since Intel is already pushing their clock speeds and voltages to squeeze out generational performance improvements.

If it is, we should know more about the company's recommendations for safe power settings sometime next month, and we'll hopefully see new BIOS releases from the motherboard manufacturers that reinstate some of Intel's safeguards for these high-end chips.


The original article contains 355 words, the summary contains 218 words. Saved 39%. I'm a bot and I'm open source!

[–] [email protected] 0 points 7 months ago

Nah, just Intel shifting the blame.