🛠️ ESXi Host Troubleshooting Checklist (With Complete Log Locations)
✅ 1. Host Status & Connectivity
- Check host state in vCenter (Connected / Not Responding / Disconnected)
- Review CPU, RAM, and datastore usage
- Validate HA/DRS recommendations
✅ 2. Hardware Health
- Monitor hardware sensors (CPU, DIMMs, fans, PSU, RAID)
- Check RAID controller logs via vendor tools
- View hardware status in:
- vCenter → Monitor → Hardware Health
- DCUI → Hardware Status
✅ 3. Network Validation
- Verify vSwitches, Port Groups, NIC teaming, VLAN tagging
- Check vmkernel ports (mgmt, vMotion, iSCSI, vSAN)
- Test reachability using:Shell
- Validate physical NIC status and link speed
✅ 4. Storage & Datastore Checks
- Confirm datastore accessibility
- Rescan storage adapters (iSCSI/FC/NFS)
- Validate multipathing (Round Robin / Fixed / MRU)
- Check datastore latency with
esxtop
✅ **5. Critical Logs & Their Locations (Full List)
Here are key ESXi log files that every VMware admin should know:
🔹 Core ESXi Logs
| Log File | Path | Purpose |
|---|---|---|
| vmkernel.log | /var/log/vmkernel.log | Storage, HBA, NIC drivers, device errors, kernel events |
| hostd.log | /var/log/hostd.log | Host management service; VM operations, host config |
| vpxa.log | /var/log/vpxa.log | vCenter agent communication from ESXi side |
| vmkwarning.log | /var/log/vmkwarning.log | System warnings from kernel |
| messages | /var/log/messages | General OS-level events |
🔹 VM & Storage Specific
| Log File | Path | Purpose |
|---|---|---|
| vmfs.log | /var/log/vmfs.log | VMFS volumes, mount/unmount, metadata issues |
| fdm.log | /var/log/fdm.log | HA agent logs |
| vobd.log | /var/log/vobd.log | VOB notifications (system health, storage alerts) |
| nfs.log | /var/log/nfs.log | NFS datastore issues |
| iscsi.log | /var/log/iscsi.log | iSCSI storage adapter logs |
| sanfs.log | /var/log/sanfs.log | vSAN file service logs |
🔹 VM-Specific Log
Each VM has its own log file inside its folder:
/vmfs/volumes/<datastore>/<VM-Name>/vmware.log
🔹 ESXi System Services
| Log File | Path | Purpose |
|---|---|---|
| auth.log | /var/log/auth.log | SSH logins, authentication attempts |
| syslog.log | /var/log/syslog.log | System-level logging, forwarding info |
| shell.log | /var/log/shell.log | ESXi Shell/SSH activity |
✅ 6. Restart Management Services
If host becomes unresponsive to vCenter:
Restart management stack:
Shell
Restart individually:
Shell
Or via DCUI → Troubleshooting Options.
✅ 7. VM-Level Checks
- Check VM power state
- Validate VMware Tools
- Monitor for snapshot sprawl
- Review VM log file (
vmware.log)
✅ 8. Patch, Firmware & Compatibility
- Ensure latest ESXi patch level
- Verify firmware/driver/hardware support on VMware HCL
- Confirm compatibility with vCenter version
Comments
Post a Comment