Sunday, July 10, 2016

How to troubleshoot disk related issues in aix?

How to troubleshoot disk related issues in aix?



If the disk is local disk, then please execute the below commands to troubleshoot the issue.


Commands
 Explanation
# lspv
To list the disk details
# lsvg -p <vg_name>
To check the vg whether it is in missing state
# varyonvg <vg_name>
To bring disk from missing to active
# errpt 
To check the error details
# errpt -N hdiskX
To check the error details for the particular disk.
# errpt -aj <identifier>
To check the detailed error output of disk
# lquerypv -h /dev/hdiskX
The output should not be all zero


if the issues found, then log a case with IBM and replace the disk.

-========================================================================


If the disk is virtual which is provided by VIOS , then please execute the below commands to troubleshoot the issue.


Commands
Explanation
# lspv
To list the disk details
# lsdev -Cc disk
To check whether the disk is local,virtual
# errpt
To check the error details
# errpt -N hdiskX
To check the error details for the particular disk.
# lspath
To check paths that is is single/dual path
# lscfg -vpl hdiskX
To get the C# of the disk (example: C110)
Execute below cmd on VIO 
# lsdev -slots|grep -i CXX
To identify the vhost details corresponding to the disk on the lpar
# lsmap -all|grep -p vhostX
To get the disk details corresponding to the disk on the lpar
# errpt
To check the error details (execute on aix prompt)
# errpt -N hdiskX
To check error details for particular disk.(execute on aix prompt)
# errpt -aj <identifier> 
To check the detailed error output of disk. (execute on aix prompt)

if the issues found, then log a case with IBM and replace the disk.


======================================================================


If the disk is provided by any other storage,  then please execute the below commands to troubleshoot the issue.



Commands
Explanation
# lsdev -Cc
To check the disk is local/virtual/storage
# lsdev -Cl hdiskX -F parent
To identify the parent of the hard disk
# lsdev -Cl fscsiX -F parent
To identify the fcs details.
# fcstat fcsX
To check the FC card details
# fcstat fcsX|awk '/Transmit/,/Words/'  
if the data transfer and send is not changing 
-then we have connectivity problem


if everything seems to be normal up on fcs, the problem should be on the storage side.. pass it to storage team.