Crash Dump Analysis - Santosh Kumar Singh.

Slides:



Advertisements
Similar presentations
Chapter 3 Process Description and Control
Advertisements

Daniel Pearson David Solomon Expert Seminars SVR302.
2 © 2004, Cisco Systems, Inc. All rights reserved. IT Essentials I v. 3 Module 4 Operating System Fundamentals.
计算机系 信息处理实验室 Lecture 5 Startup and Shutdown
MCITP Guide to Microsoft Windows Server 2008 Server Administration (Exam #70-646) Chapter 3 Configuring the Windows Server 2008 Environment.
ADM390 Microsoft® Windows® Crash Dump Analysis
2.1 © 2004 Pearson Education, Inc. Exam Managing and Maintaining a Microsoft® Windows® Server 2003 Environment Lesson 2: Installing Windows Server.
Chapter 14 Chapter 14: Server Monitoring and Optimization.
Advanced OS Chapter 3p2 Sections 3.4 / 3.5. Interrupts These enable software to respond to signals from hardware. The set of instructions to be executed.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 12: Managing and Implementing Backups and Disaster Recovery.
Cs238 Lecture 3 Operating System Structures Dr. Alan R. Davis.
Week:#14 Windows Recovery
1 Chapter Overview Installing New Hardware Installing Updates Managing Client Access Licenses Troubleshooting Boot Problems.
Driver Verifier Advancements In Windows 7 Daniel Mihai Principal Software Design Engineer Windows Engineering Tools.
Ch 11 Managing System Reliability and Availability 1.
Windows Debugging Demystified
W INDOWS BLUE SCREEN OF DEATH AFTER CRASH DEBUGGING Alex Mclean Amy Valley Derek Visch.
Administering Windows 7 Lesson 11. Objectives Troubleshoot Windows 7 Use remote access technologies Troubleshoot installation and startup issues Understand.
®® Microsoft Windows 7 for Power Users Tutorial 8 Troubleshooting Windows 7.
Hands-On Microsoft Windows Server 2008
The Functions of Operating Systems Interrupts. Learning Objectives Explain how interrupts are used to obtain processor time. Explain how processing of.
Operating Systems Lecture 7 OS Potpourri Adapted from Operating Systems Lecture Notes, Copyright 1997 Martin C. Rinard. Zhiqing Liu School of Software.
Chapter 33 Troubleshooting Windows Errors. STOP Errors  When Microsoft Windows XP encounters a serious problem  And the operating system can't continue.
Windows Crash Dump Analysis Daniel Pearson David Solomon Expert Seminars.
NT Kernel CS Spring Overview Interrupts and Exceptions: Trap Handler Interrupt Request Levels and IRT DPC’s, and APC’s System Service Dispatching.
Unit OS11: Performance Evaluation Lab Manual.
11 INSTALLING AND MANAGING HARDWARE Chapter 6. Chapter 6: Installing and Managing Hardware2 INSTALLING AND MANAGING HARDWARE  Install hardware in a Microsoft.
Interrupt driven I/O. MIPS RISC Exception Mechanism The processor operates in The processor operates in user mode user mode kernel mode kernel mode Access.
System Components ● There are three main protected modules of the System  The Hardware Abstraction Layer ● A virtual machine to configure all devices.
Windows Operating System Internals - by David A. Solomon and Mark E. Russinovich with Andreas Polze Unit OS3: Concurrency 3.3. Advanced Windows Synchronization.
Interrupt driven I/O Computer Organization and Assembly Language: Module 12.
1 © 2004 Cisco Systems, Inc. All rights reserved. Session Number Presentation_ID Cisco Technical Support Seminar Using the Cisco Technical Support Website.
Troubleshooting Windows Vista Lesson 11. Skills Matrix Technology SkillObjective DomainObjective # Troubleshooting Installation and Startup Issues Troubleshoot.
windows-10 /alexwaston14/fix-pc-error u/0/b/ /pages/Fix-PC- Error/
WORKING OF SCHEDULER IN OS
Benefits of a Virtual SIL
OPERATING SYSTEM CONCEPT AND PRACTISE
REAL-TIME OPERATING SYSTEMS
An Interrupt is either a Hardware generated CALL (externally derived from a hardware signal) OR A Software-generated CALL (internally derived from.
Interrupts and exceptions
Chapter Objectives In this chapter, you will learn:
MICROPROCESSOR BASED SYSTEM DESIGN
Operating Systems •The kernel is a program that constitutes the central core of a computer operating system. It has complete control over everything that.
Hands-On Microsoft Windows Server 2008

Chapter 2: System Structures
Process Description and Control
Steps to fix Windows 10 Flashing Screen Problem
Introduction to Operating System (OS)
Files Used in the Boot Process
Senior Software Engineering Student Robertas Sys
Real-time Software Design
CONFIGURING HARDWARE DEVICE & START UP PROCESS
Structure of Processes
Chapter 3: Windows7 Part 2.
Windows Internals Brown-Bag Seminar Chapter 1 – Concepts and Tools
Lesson #7 MCTS Cert Guide Microsoft Windows 7, Configuring Chapter 7 Configuring Devices and Updates.
Processor Fundamentals
Chapter 3: Windows7 Part 2.
CSE 451: Operating Systems Spring 2012 Module 6 Review of Processes, Kernel Threads, User-Level Threads Ed Lazowska 570 Allen.
Process Description and Control
Lecture Topics: 11/1 General Operating System Concepts Processes
Process Description and Control
Unit OS5: Memory Management
Architectural Support for OS
Multithreaded Programming
Process Description and Control
Chapter 2 Processes and Threads 2.1 Processes 2.2 Threads
Architectural Support for OS
In Today’s Class.. General Kernel Responsibilities Kernel Organization
Presentation transcript:

Crash Dump Analysis - Santosh Kumar Singh

Windows Crash The “blue screen of death.” Occurs when windows crashes, or stops executing, because of a catastrophic fault or an internal condition that prevents the system from continuing to run

Why windows crash A device driver or an operating system function running in kernel mode incurs an unhandled exception, such as a memory access violation. A call to a kernel support routine results in a reschedule, such as waiting for an unsignaled dispatcher object when the interrupt request level (IRQL) is DPC/dispatch level or higher.

Why windows crashes A page fault on memory backed by data in a paging file or a memory mapped file occurs at an IRQL of DPC/dispatch level or above A device driver or operating system function explicitly crashes the system (by calling the system function KeBugCheckEx) because it detects an internal condition that indicates either a corruption or some other situation that indicates the system can’t continue execution without risking data corruption

Why windows crashes A hardware error, such as a machine check or a nonmaskable interrupt (NMI), occurs.

The Blue Screen Regardless of the reason for a system crash, the function that actually performs the crash is KeBugCheckEx. This function takes a stop code (sometimes called a bug check code) and four parameters that are interpreted on a per–stop code basis. After KeBugCheckEx masks out all interrupts on all processors of the system, it switches the display into a low-resolution VGA graphics mode (one implemented by all Windows-supported video cards), paints a blue background, and then displays the stop code, followed by some text suggesting what the user can do. Finally, KeBugCheckEx calls any registered device driver bug check callbacks (registered by calling the KeRegisterBugCheckCallback function), allowing drivers an opportunity to stop their devices.

The Blue Screen

Possible Recovery F8 Key for last best configuration. Uninstalling last installed software. Drivers that cause problems can be identified and suitable action can be taken.

Crash Dump Files By default, all Windows systems are configured to attempt to record information about the state of the system when the system crashes. Three levels of information can be recorded on a system crash: Complete memory dump Kernel memory dump Small memory dump (Minidump)

Crash Dump Generation When the system boots, it checks the crash dump options configured by reading the registry value HKLM\System\CurrentControlSet\Control\CrashControl Error Reporting

Crash Dump Generation

Online Crash Analysis Dumprep generates an XML-formatted file containing a basic description of the system, including the operating system version, a list of drivers installed on the machine, and the list of Plug and Play drivers loaded on the system at the time of the crash The file and minidump is sent to http://Watson.Microsoft.Com

Online Crash Analysis The analysis generates a bucket ID, which identifies the particular type of crash. It queries the database for more information. If a hot fix or patch is available, a URL is sent that refers to http://oca.microsoft.com If its not found, an email is sent to the user.

NotmyFault You can use the Notmyfault utility from www.sysinternals.com/windowsinternals to generate the crashes described here. Notmyfault consists of an executable named Notmyfault.exe and a driver named Myfault.sys. When you run the Notmyfault executable, it loads the driver and presents the dialog box, which allows you to crash the system in various ways or to cause the driver to leak paged pool.

NotmyFault

Using Crash Troubleshooting Tools If there are one or more drivers you consider likely sources of the crashes, enable them for verification using the Driver Verifier and check all the verification options except for low resources simulation. Enable the same level of verification as in level 1 on all signed and unsigned drivers in the system.

Buffer overrun Pool corruption usually occurs when a driver suffers from a buffer overrun or buffer underrun bug that causes it to overwrite data past either the end or start of a buffer it has allocated from paged or nonpaged pool. This is usually hard to debug due to data corruption

Special Pool

Advanced Crash Dump Analysis Use the !process 0 0 debugger command to look at the processes running and make sure that you understand the purpose of each one. Use the lm command with the kv option to list the loaded kernel-mode drivers Use the !vm command to see whether the system has exhausted virtual memory, paged pool, or nonpaged pool.

Stack Trashes Stack overrun or stack trashing results from buffer overrun or underrun bugs. the target buffer is on the stack of the thread that executes the bug. Hung or Unresponsive Systems A device driver does not return from its interrupt service (ISR) routine or deferred procedure call (DPC) routine A high priority real-time thread preempts the windowing system driver’s input threads A deadlock (when two threads or processors hold resources each other wants and neither will yield what they have) occurs in kernel mode

- End of Presentation