vSphere 6 ESXTOP quick Overview for Troubleshooting
ESXTOP Command overview
CPU c Fields: D F
For changing to the different views type:
m Memory
c CPU
n Network
f
V
2
8
i
d
u
Interrupts
Disk Adapter
Disk Device
for add/remove fields
show only virtual machine instances
highlight a row scrolling down
highlight a row scrolling up
spacebar:
s 2:
v
p
x
Disk VM
Power states
vsan
o
k
e
change field order
kill a world
expand/rollup (where available)
%USED: CPU Core cycles used by a VM. High values are an indicator for VMs causing performance
problems on ESXi Hosts.
%SYS: Percentage of time spent by system to process interrupts and to perform other system activities
on behalf of the world.
Possible cause: maybe caused by high I/O VM
refresh screen
refresh screen every two seconds
%RDY: Percentage of time a VM was waiting to be scheduled. If you note values
between five and ten percent take care.
Possible reasons: too many vCPUs, too many vSMP VMs or a CPU limit setting
(check %MLMTD)
Note: for SMP VMs with multiple vCPUs ESXTOP accumulate %rdy for all vCPUs,
resulting in higher values. If you want to see the values for each dedicated vCPU,
press e to Expand/Rollup CPU statistics and insert the GID of the VM you want to
analyse.
>10
>10
CPU load average for the last one, five and 15 minutes
Network n Fields: A B C D E F K L
%CSTP: This value is interesting if you are using vSMP virtual machines. It shows
the percentage of time a ready to run VM has spent in co-deschedule state.
_3
>
If value is >3 decrease the number of vCPUs from the VM concerned.
%MLMTD: Counter showing percentage of time a ready to run vCPU was not
scheduled because of a CPU limit setting. Remove limit for better performance.
%VMWAIT: percentage of time a VM was waiting for some VMkernel activity to complete (such as I/O)
before it can continue. Includes %SWPWT and blocked, but not IDLE Time (as %WAIT does).
Possible cause: Storage performance issue | latency to a device in the VM configuration eg. USB
device, serial pass-through device or parallel pass-through device
100
Used-by/Team-PNIC: provide information what physical NIC a VM is actually using.
%DRPTX, %DRPRX: Dropped Packages transmitted/Dropped Packages received. Values
larger 0 are a sign for high network utilization
_1
>
Memory m Fields: B D J K Q
>1
%SWPWT: Counter showing how long a VM has to wait for swapped pages read
from disk. A reason for this could be memory overcommitment. Pay attention if
%SWPWT is >5!
_5
>
Disk d Fields: A B G J
Memory State:
high
enough free memory available
(normal TPS cycles)
clear <100% of minFree: ESXi actively
calls TPS to collapse pages
soft
<64% of minFree: Host reclaims
memory by balloon driver + TPS
hard <32% of minFree: Host starts to
swap, compress + TPS / no more
ballooning
low
<16% of minFree: ESXi blocks
VMs from allocating more RAM +
average memory overcommitment for the last one, five and 15 minutes
MCTLSZ: Amount of guest physial memory (MB) the ESXi Host is reclaiming by ballon driver. A reason for this
is memory overcommitment.
_1
>
How to calculate minFree: minFree
depends on the host memory configuration: for the first 28 GB RAM minFree =
899 MB + 1% from the remaining RAM
Eg. a host with 100 GB RAM: 899 MB
+720 MB (1% of 72 GB RAM) = minFree
1619MB
SWCUR: Memory (in MB) that has been swapped by VMKernel.
Possible cause: memory overcommitment.
>1
CACHEUSD: Memory (in MB) compressed
by ESXi Host
_1
>
SWR/s, SWW/s: Rate at which the ESXi Host is writing to or reading from swapped memory.
Possible cause: memory overcommitment.
>1
ZIP/s: Values larger 0 indicate that the host
is actively compressing memory.
UNZIP/s: Values larger 0 indicate that the
host is accessing compressed memory.
Reason: memory overcommitment.
_1
>
DAVG: Latency at the device driver level
Indicator for storage performance troubles
ABRTS/s: Commands aborted per
second
If the storage system has not
responded within 60 seconds VMs
with an Windows Operating
_1
System will issue an abort.
>
Resets/s: number of commands
reset per second
_1
>
>25
KAVG: Latency caused by VMKernel
Possible cause: Queuing (wrong queue depth parameter or wrong failover policy)
GAVG: GAVG = DAVG + KAVG
>3
>25
NUMA m (change to memory view) Fields: D G
NLMEM: VM Memory (in MB) located at local Node
NMN: Numa Node where the VM is located
NRMEM: VM Memory (in MB) located at remote Node
N%L: Percentage of VM Memory located at the
local NUMA Node. If this value is less than 80
percent the VM will experience performance
issues.
<80
VSAN x
running-system.com
Copyright 2015 running-system.com
Designed By: Andi Lesslhumer | Version 6.01
@ lessi001
ROLE
READS/s
Name of VSAN DOM Role.
Number of read operations completed per second.
MBREAD/s
WRITE/s
Megabytes read per second.
Number of write operations completed per second.
MNWRITE/s
RECOWR/s
MBRECOWR/s
SDLA
AVGLAT
Megabytes written per second.
Number of recovery write operations completed per second.
Megabytes written per second for recovery
Standard deviation of latency in millisecs for read, write and recovery write.
Average latency in millisecs for read, write and recovery write.
>25