Jump to content


 


Register a free account to unlock additional features at BleepingComputer.com
Welcome to BleepingComputer, a free community where people like yourself come together to discuss and learn how to use their computers. Using the site is easy and fun. As a guest, you can browse and view the various discussions in the forums, but can not create a new topic or reply to an existing one unless you are logged in. Other benefits of registering an account are subscribing to topics and forums, creating a blog, and having no ads shown anywhere on the site.


Click here to Register a free account now! or read our Welcome Guide to learn how to use this site.

Photo

Persistent reboots, BSODs, and hangs


  • Please log in to reply
8 replies to this topic

#1 Carp20

Carp20

  • Members
  • 4 posts
  • OFFLINE
  •  
  • Local time:10:52 PM

Posted 30 November 2016 - 05:48 AM

I've been getting irregular errors on a desktop that take one of the following forms, listed from most to least frequent:
 
- Unexpected restart
- A hard freeze/hang in which the machine becomes completely unresponsive, requiring a hard reboot
- A BSOD displaying 0x124 WHEA_UNCORRECTABLE_ERROR
 
These errors do not appear to be linked to any particular type or level of activity; they happen just as often word processing or web surfing as watching HD video or gaming.  Sometimes I can use the desktop for several consecutive days with no problems, and sometimes I witness 3 or 4 restarts/BSODs/freezes in a single day.
 
The desktop in question is a Lenovo ThinkStation S30 with a new SSD and GPU.
 
The OS is Windows 10 x64
The computer originally had Windows 7 on it, but was upgraded to 10 before I acquired it.  I do not know the exact age of the system or exactly when Windows 10 was installed.
 
System: Lenovo Thinkstation S30
CPU: Intel Xeon E5-1620 v2 3.70 GHz
GPU: ASUS GeForce GTX 950 MINI-GTX950-2G 2GB 128-Bit GDDR5
Mobo: Lenovo S30
PSU: ACBel FSA034 610W max
 
I have attached the two requested .zip files.  I also have the most recent BSOD crash dump, which is too big to upload but which I can pull some details from if that's helpful.
 
Thanks for any help you can provide me with!

Attached Files



BC AdBot (Login to Remove)

 


#2 usasma

usasma

    Still visually handicapped (avatar is memory developed by my Dad


  • BSOD Kernel Dump Expert
  • 25,091 posts
  • OFFLINE
  •  
  • Gender:Male
  • Location:Southeastern CT, USA
  • Local time:02:52 AM

Posted 30 November 2016 - 07:41 AM

Your UEFI/BIOS (version A2KT57AUS) dates from September of 2016.  Please check at the manufacturer's website to see if there are any UEFI/BIOS updates available for your system.  This is just in case there has been a more recent update.
FYI - W8 and W10 communicate more with the UEFI/BIOS than previous versions of Windows, so it's important to ensure that the UEFI/BIOS is kept up to date (and that outdated UEFI/BIOS' may be the cause of some compatibility issues).

Although you appear to have a reasonable number of Windows Update hotfixes for this version of your OS, please double check for any new Windows Updates.  It only takes one update to cause a problem, so it's essential that you have all of them.  The actual number is not important.  Rather it's important that you checked manually, installed any available updates, and didn't experience any errors when checking or updating.

I see that you have Seatools and Furmark installed.  Have you run those diagnostics?  If so, please let us know which tests you ran and how they came out.

I strongly suggest that you complete ALL of the free diagnostics on this page:  http://www.carrona.org/hwdiag.html

When you finish them all, please let us know the results.

 

All of the memory dumps are STOP 0x124's  More info on them here:  http://www.carrona.org/bsodindx.html#0x00000124

In general, these errors are hardware related - but they may also be due to compatibility issues or problems with low-level drivers.

 

Your Open Hardware Monitor driver dates from 2008.  I'd have to seriously wonder if this program is compatible w/W10

I suggest uninstalling it to see if that stops/helps the BSOD's

 

The driver for your Secunia software dates from 2013, I'd suggest checking it for compatibility with W10 also.

I also suggest checking the Lenovo website to update ALL of your software/drivers from there.

 

Analysis:
The following is for information purposes only. The following information contains the relevant information from the blue screen analysis:
**************************Wed Nov 23 00:36:35.401 2016 (UTC - 5:00)**************************
Loading Dump File [C:\Users\john\SysnativeBSODApps\112216-8859-01.dmp]
Windows 10 Kernel Version 14393 MP (8 procs) Free x64
Built by: 14393.447.amd64fre.rs1_release_inmarket.161102-0100
System Uptime:7 days 20:30:25.082
Probably caused by :GenuineIntel
BugCheck 124, {0, ffffc089b0f26028, fe000104, 1000c1}
BugCheck Info: WHEA_UNCORRECTABLE_ERROR (124)
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffc089b0f26028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000fe000104, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000001000c1, Low order 32-bits of the MCi_STATUS value.
BUGCHECK_STR:  0x124_GenuineIntel
PROCESS_NAME:  System
FAILURE_BUCKET_ID: 0x124_GenuineIntel_MEMORY__UNKNOWN
CPUID:        "Intel® Xeon® CPU E5-1620 v2 @ 3.70GHz"
MaxSpeed:     3700
CurrentSpeed: 3700
  BIOS Version                  A2KT57AUS
  BIOS Release Date             09/29/2016
  Manufacturer                  LENOVO
  Product Name                  43516F9
  Baseboard Product             LENOVO
¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨``
**************************Tue Nov 15 04:05:15.106 2016 (UTC - 5:00)**************************
Loading Dump File [C:\Users\john\SysnativeBSODApps\111516-6703-01.dmp]
Windows 10 Kernel Version 14393 MP (8 procs) Free x64
Built by: 14393.447.amd64fre.rs1_release_inmarket.161102-0100
System Uptime:0 days 3:10:06.786
Probably caused by :GenuineIntel
BugCheck 124, {0, ffffaf08c9504028, fe00007f, 1000c1}
BugCheck Info: WHEA_UNCORRECTABLE_ERROR (124)
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffaf08c9504028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000fe00007f, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000001000c1, Low order 32-bits of the MCi_STATUS value.
BUGCHECK_STR:  0x124_GenuineIntel
PROCESS_NAME:  System
FAILURE_BUCKET_ID: 0x124_GenuineIntel_MEMORY__UNKNOWN
CPUID:        "Intel® Xeon® CPU E5-1620 v2 @ 3.70GHz"
MaxSpeed:     3700
CurrentSpeed: 3700
  BIOS Version                  A2KT57AUS
  BIOS Release Date             09/29/2016
  Manufacturer                  LENOVO
  Product Name                  43516F9
  Baseboard Product             LENOVO
¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨``
**************************Mon Nov 14 04:49:18.360 2016 (UTC - 5:00)**************************
Loading Dump File [C:\Users\john\SysnativeBSODApps\111416-6890-01.dmp]
Windows 10 Kernel Version 14393 MP (8 procs) Free x64
Built by: 14393.447.amd64fre.rs1_release_inmarket.161102-0100
System Uptime:0 days 1:24:35.041
Probably caused by :GenuineIntel
BugCheck 124, {0, ffffcf86e7f04028, fe000045, 1000c1}
BugCheck Info: WHEA_UNCORRECTABLE_ERROR (124)
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffffcf86e7f04028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000fe000045, High order 32-bits of the MCi_STATUS value.
Arg4: 00000000001000c1, Low order 32-bits of the MCi_STATUS value.
BUGCHECK_STR:  0x124_GenuineIntel
PROCESS_NAME:  System
FAILURE_BUCKET_ID: 0x124_GenuineIntel_MEMORY__UNKNOWN
CPUID:        "Intel® Xeon® CPU E5-1620 v2 @ 3.70GHz"
MaxSpeed:     3700
CurrentSpeed: 3700
  BIOS Version                  A2KT57AUS
  BIOS Release Date             09/29/2016
  Manufacturer                  LENOVO
  Product Name                  43516F9
  Baseboard Product             LENOVO
¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨``
The rest of the memory dump summaries are hidden in the Spoiler tag below.  Click on "Show" to reveal them.

Spoiler




3rd Party Drivers:
The following is for information purposes only. My recommendations were given above. The drivers that follow belong to software or devices that were not developed by Microsoft.  You can find links to the driver information and where to update the drivers in the section after the code box:

**************************Wed Nov 23 00:36:35.401 2016 (UTC - 5:00)**************************
psi_mf_amd64.sys            Thu Feb  7 04:28:21 2013 (511373B5)
IOMap64.sys                 Wed Oct 22 20:52:12 2014 (5448513C)
iaStorS.sys                 Fri Nov 20 08:03:33 2015 (564F1A25)
eudskacs.sys                Wed Dec  9 16:51:42 2015 (5668A26E)
eubakup.sys                 Wed Dec  9 16:51:48 2015 (5668A274)
EUBKMON.sys                 Wed Dec  9 16:52:08 2015 (5668A288)
EuFdDisk.sys                Wed Dec  9 16:52:13 2015 (5668A28D)
e1i63x64.sys                Fri Mar  4 16:46:29 2016 (56DA0235)
TeeDriverW8x64.sys          Mon Mar 28 14:59:41 2016 (56F97F1D)
nvvad64v.sys                Tue Apr 12 04:46:52 2016 (570CB5FC)
nvhda64v.sys                Tue May 10 04:59:01 2016 (5731A2D5)
NvStreamKms.sys             Fri Jun  3 07:28:21 2016 (575169D5)
intelppm.sys                Fri Jul 15 22:10:43 2016 (578997A3)
nvlddmkm.sys                Fri Sep 16 18:21:27 2016 (57DC7067)
¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨¨``
**************************Tue Nov 15 04:05:15.106 2016 (UTC - 5:00)**************************
OpenHardwareMonitor.sys     Sat Jul 26 09:29:37 2008 (488B26C1)

http://www.carrona.org/drivers/driver.php?id=psi_mf_amd64.sys
http://www.carrona.org/drivers/driver.php?id=IOMap64.sys
http://www.carrona.org/drivers/driver.php?id=iaStorS.sys
http://www.carrona.org/drivers/driver.php?id=eudskacs.sys
http://www.carrona.org/drivers/driver.php?id=eubakup.sys
http://www.carrona.org/drivers/driver.php?id=EUBKMON.sys
http://www.carrona.org/drivers/driver.php?id=EuFdDisk.sys
http://www.carrona.org/drivers/driver.php?id=e1i63x64.sys
http://www.carrona.org/drivers/driver.php?id=TeeDriverW8x64.sys
http://www.carrona.org/drivers/driver.php?id=nvvad64v.sys
http://www.carrona.org/drivers/driver.php?id=nvhda64v.sys
http://www.carrona.org/drivers/driver.php?id=NvStreamKms.sys
http://www.carrona.org/drivers/driver.php?id=intelppm.sys
http://www.carrona.org/drivers/driver.php?id=nvlddmkm.sys
http://www.carrona.org/drivers/driver.php?id=OpenHardwareMonitor.sys
 


Edited by usasma, 30 November 2016 - 09:50 AM.

My browser caused a flood of traffic, sio my IP address was banned. Hope to fix it soon. Will get back to posting as soon as Im able.

- John  (my website: http://www.carrona.org/ )**If you need a more detailed explanation, please ask for it. I have the Knack. **  If I haven't replied in 48 hours, please send me a message. My eye problems have recently increased and I'm having difficult reading posts. (23 Nov 2017)FYI - I am completely blind in the right eye and ~30% blind in the left eye.<p>If the eye problems get worse suddenly, I may not be able to respond.If that's the case and help is needed, please PM a staff member for assistance.

#3 JinXiang91

JinXiang91

  • Members
  • 97 posts
  • OFFLINE
  •  

Posted 30 November 2016 - 09:40 AM

 

I've been getting irregular errors on a desktop that take one of the following forms, listed from most to least frequent:
 
- Unexpected restart
- A hard freeze/hang in which the machine becomes completely unresponsive, requiring a hard reboot
- A BSOD displaying 0x124 WHEA_UNCORRECTABLE_ERROR
 
These errors do not appear to be linked to any particular type or level of activity; they happen just as often word processing or web surfing as watching HD video or gaming.  Sometimes I can use the desktop for several consecutive days with no problems, and sometimes I witness 3 or 4 restarts/BSODs/freezes in a single day.
 
The desktop in question is a Lenovo ThinkStation S30 with a new SSD and GPU.
 
The OS is Windows 10 x64
The computer originally had Windows 7 on it, but was upgraded to 10 before I acquired it.  I do not know the exact age of the system or exactly when Windows 10 was installed.
 
System: Lenovo Thinkstation S30
CPU: Intel Xeon E5-1620 v2 3.70 GHz
GPU: ASUS GeForce GTX 950 MINI-GTX950-2G 2GB 128-Bit GDDR5
Mobo: Lenovo S30
PSU: ACBel FSA034 610W max
 
I have attached the two requested .zip files.  I also have the most recent BSOD crash dump, which is too big to upload but which I can pull some details from if that's helpful.
 
Thanks for any help you can provide me with!

 

Hi,

Please kindly run a memtest86 : http://www.memtest86.com/ and revert back the result.

Thanks.



#4 Carp20

Carp20
  • Topic Starter

  • Members
  • 4 posts
  • OFFLINE
  •  
  • Local time:10:52 PM

Posted 01 December 2016 - 03:37 AM

Thank you both for your prompt replies.
 
Today I proceeded through the following steps:
 
1. Checked the Lenovo support site for updates and attempted to install updates to the Intel Lan Driver, RSTe SCU/SATA Management Utility, Chipset Driver, AMT Driver, and the Realtek Audio Driver.  All succeeded with the exception of the SCU/SATA Management Utility.  Attempting to download and install this utility manually produced a warning that the version I was attempting to install was older than the current version, at which point I desisted.
2. Updated the BIOS to the latest version, A2JY58USA
3. Checked Windows Update, which found no new updates.
4. Deleted Open Hardware Monitor and its driver.
 
Subsequently I encountered another freeze requiring a hard reset, so I suspect the issue was not resolved by the above.
 
I will now begin the various memory/gpu/etc. tests that you've recommended and will report back when those are complete.  Some of them I have already tried in the past when I was trying to diagnose the issue myself (which is why Furmark was already installed on the system), but I'll re-run everything I did to be thorough.
 
Again, I appreciate your help.


#5 usasma

usasma

    Still visually handicapped (avatar is memory developed by my Dad


  • BSOD Kernel Dump Expert
  • 25,091 posts
  • OFFLINE
  •  
  • Gender:Male
  • Location:Southeastern CT, USA
  • Local time:02:52 AM

Posted 01 December 2016 - 05:53 AM

Often BSOD's are caused by the interaction of several problems.

In a lot of cases we end up seeing that the system will freeze even though the BSOD's have stopped.

 

I suggest continuing with the hardware diagnostics - as they are the most likey to reveal a problem (IMO)
You can also let the system freeze a couple of more times.  In those cases, continue to do the hard reset.
After you get 3 (or more) of them, please do the hard reset and then generate this report:

 

Please do the following:
- open Event Viewer (run eventvwr.msc from the "Run" dialog)
- expand the Custom Views category (left click on the > next to the words "Custom Views")
- right click on the "Administrative Events" heading
- select "Save all Events in Custom View as..."
- save the file as Admin.evtx
- zip up the file (right click on it, select "Send to", select "Compressed (zipped) folder")
- upload it with your next post (if it's too big, then upload it to a free file-hosting service and post a link here).

FYI - If we're looking for Event ID 41 errors (unexplained shutdowns), there's more info on that here:  http://support.microsoft.com/kb/2028504

While waiting for a reply, please monitor your temps with this free utility: 

SpeedFan v. 4.5.1 and later (free from here:  http://www.almico.com/sfdownload.php ) can log temperatures in a CSV file:
You may not need to log temps.  If uncertain, please ask.

To make it work (log the temps to a file) you have to do BOTH of the below:

1. Enable logging in general: Configure...Log...check "Enabled" then click on OK to save.

2. Enable logging for specific checks: Configure...Temperatures..left click on each sensor, then click on "Logged" at the bottom of the Window (for our purposes we want them all) then once you've selected "Logged" for all sensors, then click on OK to save.

The log will be located at C:\Program Files (x86)\SpeedFan

Naming: log files are named SFLogYYYYMMDD.csv, where YYYY is the year (four digits), MM is the month (2 digits, zero padded) and DD is the day (2 digits, zero padded). If a file already exists by that name, the file that already exists is renamed according to the following naming scheme: SFLogYYYYMMDD-CCCC.csv, where CCCC is a increasing number. The new file is then created with the standard file name scheme.

Notes: whenever you change the options related with logging, SpeedFan starts a new log file.

NOTE:You may want to turn logging off when we're done - as I don't know it's impact on performance or on the system.

The hard restarts will generate Critical errors in the log file (Event ID 41 errors) - and those will be the markers where we start looking for related entries/patterns in the Admin logfile.


My browser caused a flood of traffic, sio my IP address was banned. Hope to fix it soon. Will get back to posting as soon as Im able.

- John  (my website: http://www.carrona.org/ )**If you need a more detailed explanation, please ask for it. I have the Knack. **  If I haven't replied in 48 hours, please send me a message. My eye problems have recently increased and I'm having difficult reading posts. (23 Nov 2017)FYI - I am completely blind in the right eye and ~30% blind in the left eye.<p>If the eye problems get worse suddenly, I may not be able to respond.If that's the case and help is needed, please PM a staff member for assistance.

#6 Carp20

Carp20
  • Topic Starter

  • Members
  • 4 posts
  • OFFLINE
  •  
  • Local time:10:52 PM

Posted 02 December 2016 - 12:05 AM

I started the diagnostic list by running memtest86+ and letting it run overnight, and after coming home this evening I found that a number of errors had popped up on the 3rd pass.  (A total of six passes were complete by the time I terminated the test.)  Following usasma's instructions on his page, I'm now testing the RAM sticks one by one to try and establish which, if any, are problematic.

 

Is there any kind of error or output I should be looking for in particular with memtest86+, or am I really only looking for the presence of errors?



#7 JinXiang91

JinXiang91

  • Members
  • 97 posts
  • OFFLINE
  •  

Posted 02 December 2016 - 01:15 AM

I started the diagnostic list by running memtest86+ and letting it run overnight, and after coming home this evening I found that a number of errors had popped up on the 3rd pass.  (A total of six passes were complete by the time I terminated the test.)  Following usasma's instructions on his page, I'm now testing the RAM sticks one by one to try and establish which, if any, are problematic.
 
Is there any kind of error or output I should be looking for in particular with memtest86+, or am I really only looking for the presence of errors?

Hi,

Any presence of errors being found in memtest86+ is denote that your RAM is faulty and needs to be replace.

Once you had done the test thoroughly, do have the faulty RAM remove from the system and check whether the problem of your system remains.

Thanks.

#8 Carp20

Carp20
  • Topic Starter

  • Members
  • 4 posts
  • OFFLINE
  •  
  • Local time:10:52 PM

Posted 04 December 2016 - 07:05 AM

I tested all of the memory sticks in the computer individually, but after seven passes on each not one of them produced errors.  So I tried a different approach: I took out all the computer's RAM and replaced it with older (but known good) RAM from another computer.  The desktop which had experienced the freezes/restarts/BSODs has now been running for three days without a single problem.

 

So, on the one hand, it seems obvious that the sticks are the issue; on the other hand, each one of them tested good when running memtest on them individually.

 

At this point, all I can think of (short of just tossing all the sticks and buying all-new RAM) is to "test" the sticks by inserting them one at a time into the computer at an interval of several days/weeks between each stick and seeing when the problems start again.

 

In any case, thank you both for directing me to memtest and helping me locate the problem.



#9 usasma

usasma

    Still visually handicapped (avatar is memory developed by my Dad


  • BSOD Kernel Dump Expert
  • 25,091 posts
  • OFFLINE
  •  
  • Gender:Male
  • Location:Southeastern CT, USA
  • Local time:02:52 AM

Posted 05 December 2016 - 05:26 PM

Memtest is a software test of a hardware component.  It's designed to detect problems using software methods.
We have seen (both online and at my work) that sometimes you can spot memory errors with Prime95 that aren't seen with Memtest.

As such, it's my opinion that the software tests aren't always 100% accurate - and that they should be used to help select replacement components with this in mind.

 

My testing methods were developed (over the years) to speed up the resolution of BSOD errors.
As such, they sacrifice the testing of individual things (both hardware and software) in favor of a quick resolution. 

That's why I suggest multiple things in my posts to you.  If I didn't do it this way, we could take months to test each individual suspect.

 

I'd put the old RAM back in (being sure to use anti-static precautions) and see if it fails on Prime95's Blend test.

If it does, then test those sticks individually.

Good luck!


My browser caused a flood of traffic, sio my IP address was banned. Hope to fix it soon. Will get back to posting as soon as Im able.

- John  (my website: http://www.carrona.org/ )**If you need a more detailed explanation, please ask for it. I have the Knack. **  If I haven't replied in 48 hours, please send me a message. My eye problems have recently increased and I'm having difficult reading posts. (23 Nov 2017)FYI - I am completely blind in the right eye and ~30% blind in the left eye.<p>If the eye problems get worse suddenly, I may not be able to respond.If that's the case and help is needed, please PM a staff member for assistance.




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users