Skip to main content

 In vSphere, Permanent Device Loss (PDL) and All Paths Down (APD) are two distinct storage connectivity issues. PDL indicates a permanent and unrecoverable loss of a storage device, while APD signifies a temporary loss of access, where the system expects the device to potentially return. VMware vSphere, including VMware Cloud Foundation (VCF), provides mechanisms to handle both PDL and APD to ensure virtual machine availability. 

Here's a more detailed explanation:

Permanent Device Loss (PDL):

Definition:

PDL occurs when a storage device is permanently lost, meaning it's unlikely to be recovered. This can be due to hardware failure, a device being removed without proper procedures, or other unrecoverable errors. 

vSphere Handling:

vSphere interprets certain SCSI sense codes from the storage array as indicators of PDL. Once a PDL is detected, the ESXi host stops retrying I/O to the affected device and the affected VMs are failed over if VM Component Protection (VMCP) is enabled. 

Example:

A storage controller reporting a failure or a LUN being permanently removed would be considered PDL. 

All Paths Down (APD):

Definition:

APD occurs when a storage device becomes unavailable to the ESXi host, and the host cannot determine if the loss is temporary or permanent. 

vSphere Handling:

The ESXi host will retry I/O to the affected device for a configured timeout period (default is 140 seconds). If the device recovers within that time, operations continue without interruption. If the device remains unavailable after the timeout, the host starts to fast-fail I/O operations to the device, but virtual machine I/O continues to be retried indefinitely. 

Example:

A network issue causing a storage array to become unreachable, or a host reboot, could lead to an APD condition. 

VMware Cloud Foundation (VCF) and PDL/APD: 

Integration:

VCF, which integrates vSphere, vSAN, and NSX, also provides mechanisms to handle PDL and APD events. 

Automated Remediation:

VCF's automation capabilities can be leveraged to orchestrate failover procedures and ensure virtual machines are migrated to healthy hosts in the event of PDL or APD. 

Consistency:

VCF aims to provide a consistent and reliable infrastructure, and its management capabilities play a crucial role in mitigating the impact of storage connectivity issues. 




Comments

Popular posts from this blog

Quick Guide to VCF Automation for VCD Administrators

  Quick Guide to VCF Automation for VCD Administrators VMware Cloud Foundation 9 (VCF 9) has been  released  and with it comes brand new Cloud Management Platform –  VCF Automation (VCFA)  which supercedes both Aria Automation and VMware Cloud Director (VCD). This blog post is intended for those people that know VCD quite well and want to understand how is VCFA similar or different to help them quickly orient in the new direction. It should be emphasized that VCFA is a new solution and not just rebranding of an old one. However it reuses a lot of components from its predecessors. The provider part of VCFA called Tenenat Manager is based on VCD code and the UI and APIs will be familiar to VCD admins, while the tenant part inherist a lot from Aria Automation and especially for VCD end-users will look brand new. Deployment and Architecture VCFA is generaly deployed from VCF Operations Fleet Management (former Aria Suite LCM embeded in VCF Ops. Fleet Management...
  Issue with Aria Automation Custom form Multi Value Picker and Data Grid https://knowledge.broadcom.com/external/article?articleNumber=345960 Products VMware Aria Suite Issue/Introduction Symptoms: Getting  error " Expected Type String but was Object ", w hen trying to use Complex Types in MultiValue Picker on the Aria for Automation Custom Form. Environment VMware vRealize Automation 8.x Cause This issue has been identified where the problem appears when a single column Multi Value Picker or Data Grid is used. Resolution This is a known issue. There is a workaround.  Workaround: As a workaround, try adding one empty column in the Multivalue picker without filling the options. So we can add one more column without filling the value which will be hidden(there is a button in the designer page that will hide the column). This way the end user will receive the same view.  
  "Cloud zone insights not available yet, please check after some time" message on Aria Automation https://knowledge.broadcom.com/external/article?articleNumber=314894 Products VMware Aria Suite Issue/Introduction Symptoms: The certificate for Aria operations has been replaced since it was initially added to Aria Automation as an integration. When accessing the Insights pane under  Cloud Assembly  ->  Infrastructure  ->  Cloud Zone  ->  Insights  the following message is displayed:   "Cloud zone insights not available yet, please check after some time." The  /var/log/services-logs/prelude/hcmp-service-app/file-logs/hcmp-service-app.log  file contains ssl errors similar to:   2022-08-25T20:06:43.989Z ERROR hcmp-service [host='hcmp-service-app-xxxxxxx-xxxx' thread='Thread-56' user='' org='<org_id>' trace='<trace_id>' parent='<parent_id>' span='<span_id>'] c.v.a.h.a.common.AlertEnu...