Analyzing Monitoring Results
The baseline you develop establishes the typical counter values you should expect to see when your system is performing satisfactorily. The following section provides guidelines to help you interpret the counter values and eliminate false or misleading data that might cause you to set your own target values inappropriately.When you are collecting and evaluating data to establish a valid performance baseline, consider the following guidelines:
When monitoring processes of the same name, watch for unusually large values for one instance and not the other. This can occur because System Monitor sometimes misrepresents data for separate instances of processes of the same name by reporting the combined values of the instances as the value of a single instance. Tracking processes by process identifier can help you get around this problem. For information about monitoring processes, see "Analyzing Processor Activity" later in this book.When you are monitoring several threads and one of them stops, the data for one thread might appear to be reported for another. This is because of the way threads are numbered. For example, you begin monitoring and have three threads, numbered 0, 1, and 2. If one of them stops, all remaining threads are resequenced. That means that the original thread 0 is now gone and the original thread 1 is renamed to 0. As a result, data for the stopped thread 0 could be reported along with data for the running thread 1 because old thread 1 is now old thread 0. To get around this problem, you can include the thread identifiers of the process's threads in your log or display. Use the ThreadThread ID counter for this purpose.Do not give too much weight to occasional spikes in data. These might be due to startup of a process and are not an accurate reflection of counter values for that process over time. The effect of spikes can linger over time when using counters that average.For monitoring over an extended period of time, use graphs instead of reports or histograms because these views only show last values and averages. As a result, they might not give an accurate picture of values if you are looking for spikes.Unless you specifically want to include startup events in your baseline, exclude these events because the temporary high values tend to skew overall performance results.Investigate zero values or missing data. These can impede your ability to establish a meaningful baseline. There are several possible explanations for this. For more information, see "Troubleshooting Problems with Performance Tools" later in this chapter.
Identifying Potential Bottlenecks
Deviations from your baseline provide the best indicator of performance problems. However, as a secondary reference, Table 27.8 describes recommended thresholds for object counters. Use this table to help you identify when a performance problem is developing on your system. If the values listed are consistently reported on your system, consult additional chapters in the Performance Monitoring section of this book for how to investigate and correct the problems causing these values.Table 27.8 Recommended Thresholds for the Minimum Set of System Counters
Resource | ObjectCounter | Suggested Threshold | Comments |
---|---|---|---|
Disk | LogicalDisk% Free Space | 15 percent | None |
Disk | LogicalDisk% Disk Time | 90 percent | None |
Disk | PhysicalDisk Disk Reads/sec, PhysicalDisk Disk Writes/sec | Depends on manufacturer's specifications | Check the specified transfer rate for your disks to verify that this rate does not exceed the specifications. In general, Ultra Wide SCSI disks can handle 50 to 70 I/O operations per second. |
Disk | PhysicalDisk Current Disk Queue Length | Number of spindles plus 2 | This is an instantaneous counter; observe its value over several intervals. For an average over time, use PhysicalDiskAvg. Disk Queue Length. |
Memory | Memory Available Bytes | Less than 4 MB | Research memory usage and add memory if needed. |
Memory | Memory Pages/sec | 20 | Research paging activity. |
Network | Network Segment% Net Utilization | Depends on type of network | You must determine the threshold based on the type of network you are running. For Ethernet networks, for example, 30 percent is the recommended threshold. |
Paging File | Paging File% Usage | Above 70 percent | Review this value in conjunction with Available Bytes and Pages/sec to understand paging activity on your computer. |
Processor | Processor% Processor Time | 85 percent | Find the process that is using a high percentage of processor time. Upgrade to a faster processor or install an additional processor. |
Processor | Processor Interrupts/sec | Depends on processor; for current CPUs, use a threshold of 1500 interrupts per second | A dramatic increase in this counter value without a corresponding increase in system activity indicates a hardware problem. Identify the network adapter or disk controller card causing the interrupts. You might need to install an additional adapter or controller card. |
Server | ServerBytes Total/sec | If the sum of Bytes Total/sec for all servers is roughly equal to the maximum transfer rates of your network, you might need to segment the network. | |
Server | ServerWork Item Shortages | 3 | If the value reaches this threshold, consider tuning the InitWorkItems or MaxWorkItems entries in the registry (in HKEY_LOCAL_MACHINESYSTEMCurrentControlSetServiceslanmanserverParameters). For more information about MaxWorkItems, see the Microsoft Knowledge Base link on the Web Resources page at http://windows.microsoft.com/windows2000/reskit/webresources Caution: Do not use a registry editor to edit the registry directly unless you have no alternative. The registry editors bypass the standard safeguards provided by administrative tools. These safeguards prevent you from entering conflicting settings or settings that are likely to degrade performance or damage your system. Editing the registry directly can have serious, unexpected consequences that can prevent the system from starting and require that you reinstall Windows 2000. To configure or customize Windows 2000, use the programs in Control Panel or Microsoft Management Console whenever possible. |
Server | Server Work QueuesQueue Length | 4 | If the value reaches this threshold, there might be a processor bottleneck. This is an instantaneous counter; observe its value over several intervals. |
Multiple Processors | SystemProcessor Queue Length | 2 | This is an instantaneous counter; observe its value over several intervals. |