Skip to main content

 

Replacing a VMware Aria Automation appliance node


When a VMware Aria Automation appliance in a multiple-node, high availability (HA) configuration has failed, you might need to replace the faulty node.

Caution:Before proceeding, VMware recommends that you contact technical support to troubleshoot the HA issue and verify that the problem is isolated to one node.

If technical support determines that you need to replace the node, take the following steps.

  1. In vCenter, take backup snapshots of every appliance in the HA configuration.

    In the backup snapshots, don't include virtual machine memory.

  2. Shut down the faulty node.
  3. Make note of the faulty node VMware Aria Automation software build number, and network settings.

    Note the FQDN, IP address, gateway, DNS servers, and especially MAC address. Later, you assign the same values to the replacement node.

  4. Check the status of the primary database node. From a root command line on any healthy node, run the following:
    > kubectl get pod `vracli status | jq -r '.databaseNodes[] | select(.["Role"] == "primary") | .["Node name"]' | cut -d '.' -f 1` -n prelude -o wide --no-headers=true primary-db-node-name 1/1 Running 0 39h 12.123.2.14 vc-vm-224-84.company.com <none> <none>
    Important:The primary database node must be one of the healthy nodes.

    If the primary database node is faulty, contact technical support instead of proceeding.

  5. From the root command line of the healthy node, remove the faulty node.

    vracli cluster remove faulty-node-FQDN

  6. Use vCenter to deploy a new, replacement VMware Aria Automation node.

    Deploy the same VMware Aria Automation software build number, and apply the network settings from the faulty node. Include the FQDN, IP address, gateway, DNS servers, and especially MAC address that you noted earlier.

  7. Power on the replacement node.
  8. Log in as root to the command line of the replacement node.
  9. Verify that the initial boot sequence has finished by running the following command.

    vracli status first-boot

    Look for a First boot complete message.

  10. From the replacement node, join the VMware Aria Automation cluster.
    Note:If your VMware Aria Automation deployment is patched, refer to the workaround in KB 96619.

    vracli cluster join primary-DB-node-FQDN

  11. Log in as root to the command line of the primary database node.
  12. Deploy the repaired cluster by running the following script:

    /opt/scripts/deploy.sh

Comments

Popular posts from this blog

  Issue with Aria Automation Custom form Multi Value Picker and Data Grid https://knowledge.broadcom.com/external/article?articleNumber=345960 Products VMware Aria Suite Issue/Introduction Symptoms: Getting  error " Expected Type String but was Object ", w hen trying to use Complex Types in MultiValue Picker on the Aria for Automation Custom Form. Environment VMware vRealize Automation 8.x Cause This issue has been identified where the problem appears when a single column Multi Value Picker or Data Grid is used. Resolution This is a known issue. There is a workaround.  Workaround: As a workaround, try adding one empty column in the Multivalue picker without filling the options. So we can add one more column without filling the value which will be hidden(there is a button in the designer page that will hide the column). This way the end user will receive the same view.  

57 Tips Every Admin Should Know

Active Directory 1. To quickly list all the groups in your domain, with members, run this command: dsquery group -limit 0 | dsget group -members –expand 2. To find all users whose accounts are set to have a non-expiring password, run this command: dsquery * domainroot -filter “(&(objectcategory=person)(objectclass=user)(lockoutTime=*))” -limit 0 3. To list all the FSMO role holders in your forest, run this command: netdom query fsmo 4. To refresh group policy settings, run this command: gpupdate 5. To check Active Directory replication on a domain controller, run this command: repadmin /replsummary 6. To force replication from a domain controller without having to go through to Active Directory Sites and Services, run this command: repadmin /syncall 7. To see what server authenticated you (or if you logged on with cached credentials) you can run either of these commands: set l echo %logonserver% 8. To see what account you are logged on as, run this command: ...
  The Guardrails of Automation VMware Cloud Foundation (VCF) 9.0 has redefined private cloud automation. With full-stack automation powered by Ansible and orchestrated through vRealize Orchestrator (vRO), and version-controlled deployments driven by GitOps and CI/CD pipelines, teams can build infrastructure faster than ever. But automation without guardrails is a recipe for risk Enter RBAC and policy enforcement. This third and final installment in our automation series focuses on how to secure and govern multi-tenant environments in VCF 9.0 with role-based access control (RBAC) and layered identity management. VCF’s IAM Foundation VCF 9.x integrates tightly with enterprise identity providers, enabling organizations to define and assign roles using existing Active Directory (AD) groups. With its persona-based access model, administrators can enforce strict boundaries across compute, storage, and networking resources: Personas : Global Admin, Tenant Admin, Contributor, Viewer Projec...