Skip to main content

 ABX database grows large and causes postgres startup failure

https://knowledge.broadcom.com/external/article?articleNumber=325840


Products

VMwareVMware Aria Suite

Issue/Introduction

Symptoms:
  • You are using VMware vRealize Automation 8.8 - 8.11.1.
  • VM disk 2 (/data) continuously runs out of space.
  • /opt/scripts/deploy.sh script runs for very long time or times out while trying to start postgres pods.
  • Upgrade runs for very long time or times out during service startup in deploy.sh.

Verifying Symptoms

  1. SSH to one node in the Aria Automation cluster.
  2. Run the following command
    du -sh /data/db/live
  3. Run the following command to connect to the ABX database
    vracli dev psql abx-db
  4. Run the following query to check the size of the database and row counts
    SELECT pg_size_pretty(
        (cast(pg_total_relation_size('abx_action_content') AS decimal)
        / (SELECT COUNT(*) FROM abx_action_content))
        * (SELECT COUNT(*) FROM abx_action_content WHERE id NOT IN (
        	SELECT content_id FROM abx_action WHERE content_id IS NOT NULL)));
  5. The numbers from Step 4 are in the range of hundreds of gigabytes and is close to the number reported in Step #2.


Environment

VMware vRealize Automation SaltStack Config 8.11.x
VMware vRealize Automation 8.10.x
VMware vRealize Automation 8.8.x
VMware vRealize Automation 8.x
VMware vRealize Automation 8.9.x
VMware vRealize Automation 8.11.x
VMware Aria Automation 8.12.x
VMware Aria Automation Config 8.12.x

Cause

Upon each action update, ABX stores its content in the database. In some cases, an obsolete version of this content remains in the database causing a gradual growth with not needed records.

Resolution

This data records leak is resolved in VMware vRealize Automation 8.11.2 and above.

Clean-up logic is introduced in VMware Aria Automation 8.13.0 (formerly vRealize Automation).

See the Workaround section for additional information for versions 8.8. - 8.11.2.

Workaround:

Patch Procedure for 8.8 - 8.12.2

Prerequisites

  • Create simultaneous VM snapshots without memory (8.8 - 8.9) for each appliance in the cluster.
    • For versions 8.9. and above, use the quiesce option.

Procedure

  1. SSH into one appliance in the cluster.
  2. Run the following command:
    echo "[Unit]
Description=Clean-up service for ABX database for issue 48848

[Service]
Type=simple
ExecStart=/bin/bash -c 'echo "IyEvYmluL2Jhc2gKCiMKIyBDb3B5cmlnaHQgKGMpIDIwMjMgVk13YXJlLCBJbmMuICBBbGwgcmlnaHRzIHJlc2VydmVkLgojCgpzZXQgK3gKc2V0ICtlCgpsb2dEaXI9Ii92YXIvbG9nL3Ztd2FyZS9wcmVsdWRlIgp0aW1lc3RhbXA9JChkYXRlICcrJVktJW0tJWQtJUgtJU0tJVMnKQpwcj0iNDg4NDgiCkxPR19QQVRIPSIkbG9nRGlyL21haW50ZW5hbmNlLXBhdGNoLXByJHByLSR0aW1lc3RhbXAubG9nIgpleHBvcnQgS1VCRUNPTkZJRz0vZXRjL2t1YmVybmV0ZXMvYWRtaW4uY29uZgoKbWtkaXIgLXAgIiRsb2dEaXIiCgpmdW5jdGlvbiBsb2coKSB7CiAgbG9jYWwgbXNnPSIkMSIKICBsb2NhbCBsZXZlbD0iJDIiCiAgbG9jYWwgZHQ9JChkYXRlICcrJVktJW0tJWQgJUg6JU06JVMnKQogIGxvY2FsIGhvc3ROYW1lPSIkKGhvc3RuYW1lIC1mKSIKCiAgZWNobyAiWyRsZXZlbF1bJGR0XVskaG9zdE5hbWVdICRtc2ciIHwgdGVlIC1hICIkTE9HX1BBVEgiCn0KCmZ1bmN0aW9uIGxvZ19pbmZvKCkgewogIGxvZyAiJDEiICJJTkZPIgp9CgpmdW5jdGlvbiBsb2dfZXJyb3IoKSB7CiAgbG9nICIkMSIgIkVSUk9SIgp9CgpmdW5jdGlvbiBjaGVja19hcHBsaWFuY2VfdmVyc2lvbigpIHsKICBsb2NhbCB2YW1pVmVyPSQoL29wdC92bXdhcmUvYmluL3ZhbWljbGkgdmVyc2lvbiAtLWFwcGxpYW5jZSkKICBsb2NhbCBhcHBsVmVyPSQoZWNobyAiJHZhbWlWZXIiIHwgZ3JlcCAiVmVyc2lvbiAtICIgfCBhd2sgJ3twcmludCAkM30nKQoKICBsb2dfaW5mbyAiQXBwbGlhbmNlIHZlcnNpb246ICRhcHBsVmVyIgoKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC44LjAiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguOC4xIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjguMiIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC45LjEiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTAuMCIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMC4xIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjEwLjIiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTEuMSIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMS4yIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjEyLjAiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTIuMSIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMi4yIiAmJiByZXR1cm4gMAoKICByZXR1cm4gMQp9CgpmdW5jdGlvbiBrOHNfcmVhZHkoKSB7CiAgbG9jYWwgbm9kZXNTdGF0dXM9JCh0aW1lb3V0IDMwMCAvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIGdldCBub2RlcyAtLW5vLWhlYWRlcnMgfCBhd2sgJ3twcmludCAkMn0nKQogIGxvY2FsIHJlc3VsdD0kPwoKICAjIFRoZXJlJ3MgUmVhZHkgc3RhdGUgaW4gdGhlIG5vZGVzIGxpc3QuCiAgW1sgJHJlc3VsdCAtZXEgMCBdXSAmJiB7IGVjaG8gIiRub2Rlc1N0YXR1cyIgfCBzb3J0IC11IHwgZ3JlcCAtcSAiUmVhZHkiICYmIHJlc3VsdD0wIHx8IHJlc3VsdD0yOyB9CgogICMgVGhlIG9ubHkgYXZhaWxhYmxlIG5vZGUgc3RhdGUgaXMgUmVhZHkuCiAgW1sgJHJlc3VsdCAtZXEgMCBdXSAmJiB7IFtbICQoZWNobyAiJG5vZGVzU3RhdHVzIiB8IHNvcnQgLXUgfCB3YyAtbCkgLWVxIDEgXV0gJiYgcmVzdWx0PTAgfHwgcmVzdWx0PTM7IH0KCiAgIyBDbHVzdGVyIHJlbW90ZSBjb21tdW5pY2F0aW9uIGlzIE9OIGUuZy4gYWxsIHJlbGV2YW50IHBvZHMgaW4ga3ViZS1zeXN0ZW0gYXJlIHVwLgogIFtbICRyZXN1bHQgLWVxIDAgXV0gJiYgeyB0aW1lb3V0IDMwMCB2cmFjbGkgY2x1c3RlciBleGVjIC0tIGJhc2ggLWMgJy91c3IvYmluL2RhdGUnICYmIHJlc3VsdD0wIHx8IHJlc3VsdD00OyB9CgogIHJldHVybiAkcmVzdWx0Cn0KCmZ1bmN0aW9uIHBzcWxfcmVhZHkoKSB7CiAgbG9jYWwgY291bnQ9JCgvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIC1uIHByZWx1ZGUgZ2V0IHBvZHMgfCBncmVwIHBvc3RncmVzIHwgZ3JlcCAiUnVubmluZyIgfCBncmVwICIxLzEiIHwgd2MgLWwpCgogIFtbICRjb3VudCAtZXEgMyBdXSAmJiByZXR1cm4gMCB8fCByZXR1cm4gMQp9CgpmdW5jdGlvbiBwc3FsX3ByaW1hcnkoKSB7CiAgbG9jYWwgLW4gcHJpbWFyeT0kMQogIHByaW1hcnk9IiQodnJhY2xpIHN0YXR1cyB8IGpxIC1yICcuZGF0YWJhc2VOb2Rlc1tdIHwgc2VsZWN0KC5Sb2xlPT0icHJpbWFyeSIpLiJOb2RlIG5hbWUiJyB8IGN1dCAtZCAnLicgLWYgMSkiCn0KCmZ1bmN0aW9uIHBzcWxfZGVsZXRlX2JhdGNoKCkgewogIGxvY2FsIHByaW1hcnk9IiQxIgogIGxvY2FsIHJvd3NBZmZlY3RlZD0iJCgvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIGV4ZWMgLW4gcHJlbHVkZSAiJHByaW1hcnkiIC0tIGNocHN0IC11IHBvc3RncmVzIHBzcWwgXAogICAgLWQgImFieC1kYiIgXAogICAgLWMgIgogICAgICBERUxFVEUKICAgICAgRlJPTSBhYnhfYWN0aW9uX2NvbnRlbnQKICAgICAgV0hFUkUgaWQgaW4gKAogICAgICAgIFNFTEVDVCBpZAogICAgICAgIEZST00gYWJ4X2FjdGlvbl9jb250ZW50CiAgICAgICAgV0hFUkUgaWQgbm90IElOICgKICAgICAgICAgIFNFTEVDVCBhY3Rpb24tPj4nY29udGVudElkJyBBUyBjb250ZW50X2lkCiAgICAgICAgICBGUk9NIHB1YmxpYy5hYnhfYWN0aW9uX3ZlcnNpb24KICAgICAgICAgIFdIRVJFIGFjdGlvbi0+Pidjb250ZW50SWQnIElTIE5PVCBOVUxMCiAgICAgICAgICBVTklPTgogICAgICAgICAgU0VMRUNUIGNvbnRlbnRfaWQKICAgICAgICAgIEZST00gcHVibGljLmFieF9hY3Rpb24KICAgICAgICAgIFdIRVJFIGNvbnRlbnRfaWQgSVMgTk9UIE5VTEwKICAgICAgICAgICkgTElNSVQgMTAKICAgICAgICApCiAgICAgICIpIgogIGxvY2FsIHJlc3VsdD0kPwogIFtbICRyZXN1bHQgLW5lIDAgXV0gJiYgcmV0dXJuIDIKICBlY2hvICIkcm93c0FmZmVjdGVkIiB8IGdyZXAgLXEgIkRFTEVURSAwIiAmJiByZXR1cm4gMAogIHJldHVybiAxCn0KCmZ1bmN0aW9uIHBzcWxfcmVwb3J0X3JlbWFpbmluZygpIHsKICBsb2NhbCBwcmltYXJ5PSIkMSIKCiAgL3Vzci9sb2NhbC9iaW4va3ViZWN0bCBleGVjIC1uIHByZWx1ZGUgIiRwcmltYXJ5IiAtLSBjaHBzdCAtdSBwb3N0Z3JlcyBwc3FsIFwKICAgIC1kICJhYngtZGIiIFwKICAgIC1jICIKICAgICBTRUxFQ1QgQ09VTlQoKikgRlJPTSBhYnhfYWN0aW9uX2NvbnRlbnQgV0hFUkUgaWQgTk9UIElOICgKICAgICAgICAgICAgU0VMRUNUIGNvbnRlbnRfaWQgRlJPTSBhYnhfYWN0aW9uIFdIRVJFIGNvbnRlbnRfaWQgSVMgTk9UIE5VTEwpOwogICAgICAiID4+ICIkTE9HX1BBVEgiCn0KCmZ1bmN0aW9uIHBzcWxfdmFjdXVtKCkgewogIGxvY2FsIHByaW1hcnk9IiQxIgogIC91c3IvbG9jYWwvYmluL2t1YmVjdGwgZXhlYyAtbiBwcmVsdWRlICIkcHJpbWFyeSIgLS0gY2hwc3QgLXUgcG9zdGdyZXMgcHNxbCAtZCAiYWJ4LWRiIiAtYyAiVkFDVVVNIEZVTEwgYWJ4X2FjdGlvbl9jb250ZW50OyIKfQoKZnVuY3Rpb24gc3lzdGVtZF9yZW1vdmVfc2VydmljZSgpIHsKICBzeXN0ZW1jdGwgc3RvcCBhYngtZGItY2xlYW51cC00ODg0OC5zZXJ2aWNlCiAgcm0gLWYgL2V0Yy9zeXN0ZW1kL3N5c3RlbS9hYngtZGItY2xlYW51cC00ODg0OC5zZXJ2aWNlCiAgcm0gLWYgL3Zhci9ydW4vYWJ4LWRiLWNsZWFudXAtNDg4NDguc2gKICBzeXN0ZW1jdGwgZGFlbW9uLXJlbG9hZAp9CgojIFZlcnNpb24gY2hlY2sKY2hlY2tfYXBwbGlhbmNlX3ZlcnNpb24KcmVzdWx0PSQ/CgppZiBbICRyZXN1bHQgLW5lIDAgXQp0aGVuCiAgbG9nX2Vycm9yICJUaGUgbWFpbnRlbmFuY2UgcGF0Y2ggaXMgbm90IGNvbXBhdGlibGUgd2l0aCB0aGlzIHByb2R1Y3QgdmVyc2lvbi4iCiAgc3lzdGVtZF9yZW1vdmVfc2VydmljZQoKICBleGl0IDAKZmkKCndoaWxlIFsgMSAtZXEgMSBdCmRvCiAgc2xlZXAgMTAKCiAgbG9nX2luZm8gIkNoZWNraW5nIGF2YWlsYWJpbGl0eSBvZiBLdWJlcm5ldGVzLiIKICBrOHNfcmVhZHkKICByZXN1bHQ9JD8KICBpZiBbICRyZXN1bHQgLW5lIDAgXQogIHRoZW4KICAgIGxvZ19lcnJvciAiS3ViZXJuZXRlcyBub3QgcmVhZHkuIFJldHJ5aW5nIGluIDEwIHNlY29uZHMgLi4uIgogICAgY29udGludWUKICBmaQoKICBsb2dfaW5mbyAiQ2hlY2tpbmcgYXZhaWxhYmlsaXR5IG9mIFBvc3RncmVzLiIKICBwc3FsX3JlYWR5CiAgcmVzdWx0PSQ/CiAgaWYgWyAkcmVzdWx0IC1uZSAwIF0KICB0aGVuCiAgICBsb2dfZXJyb3IgIlBvc3RncmVzIG5vdCByZWFkeS4gUmV0cnlpbmcgaW4gMTAgc2Vjb25kcyAuLi4iCiAgICBjb250aW51ZQogIGZpCgogIGxvZ19pbmZvICJTZWFjaGluZyBmb3IgcG9zdGdyZXMgcHJpbWFyeS4iCiAgcHNxbF9wcmltYXJ5IHBzcWxfcG9kX3ByaW1hcnkKICBpZiBbIC16ICIkcHNxbF9wb2RfcHJpbWFyeSIgXQogIHRoZW4KICAgIGxvZ19lcnJvciAiUG9zdGdyZXMgcHJpbWFyeSBub3QgZm91bmQuIFJldHJ5aW5nIGluIDEwIHNlY29uZHMgLi4uIgogICAgY29udGludWUKICBmaQoKICBsb2dfaW5mbyAiUmVtYWluaW5nIG9ycGhhbmVkIHJlY29yZHMgZm9yIHJlbW92YWw6IgogIHBzcWxfcmVwb3J0X3JlbWFpbmluZyAiJHBzcWxfcG9kX3ByaW1hcnkiCgogIGlkeD0wCiAgd2hpbGUgWyAxIC1lcSAxIF0KICBkbwogICAgbG9nX2luZm8gIkRlbGV0aW5nIGEgYmF0Y2ggb2Ygb3JwaGFuZWQgZGF0YSByZWNvcmRzIC4uLiIKICAgIHBzcWxfZGVsZXRlX2JhdGNoICIkcHNxbF9wb2RfcHJpbWFyeSIKICAgIHJlc3VsdD0kPwoKICAgIGlmIFsgJHJlc3VsdCAtZXEgMSBdCiAgICB0aGVuCiAgICAgIHNsZWVwIDIKCiAgICAgICgoaWR4KyspKQogICAgICBpZiBbICQoKGlkeCAlIDUwKSkgLWVxIDAgXQogICAgICB0aGVuCiAgICAgICAgbG9nX2luZm8gIlJlbWFpbmluZyBvcnBoYW5lZCByZWNvcmRzIGZvciByZW1vdmFsczoiCiAgICAgICAgcHNxbF9yZXBvcnRfcmVtYWluaW5nICIkcHNxbF9wb2RfcHJpbWFyeSIKICAgICAgZmkKICAgICAgY29udGludWUKICAgIGVsaWYgWyAkcmVzdWx0IC1lcSAyIF0KICAgIHRoZW4KICAgICAgbG9nX2Vycm9yICJBbiBlcnJvciBvY2N1cnJlZCB3aGlsZSBkZWxldGluZyBvcnBoYW5lZCBkYXRhIHJlY29yZHMiCiAgICAgIGJyZWFrCiAgICBlbHNlCiAgICAgIGxvZ19pbmZvICJCYXRjaCBkZWxldGVzIGNvbXBsZXRlLiIKCiAgICAgIGlmIFsgJGlkeCAtZ3QgMSBdCiAgICAgIHRoZW4KICAgICAgICBsb2dfaW5mbyAiVmFjdXVtaW5nIC4uLiIKICAgICAgICBwc3FsX3ZhY3V1bSAiJHBzcWxfcG9kX3ByaW1hcnkiCiAgICAgIGZpCiAgICAgIGJyZWFrCiAgICBmaQogIGRvbmUKCiAgaWYgWyAkcmVzdWx0IC1lcSAwIF0KICB0aGVuCiAgICBsb2dfaW5mbyAiQ2xlYW4tdXAgY29tcGxldGVkIHN1Y2Nlc3NmdWxseS4iCiAgICBicmVhawogIGZpCmRvbmUKCiMgU2VsZiBkZWxldGUKc3lzdGVtZF9yZW1vdmVfc2VydmljZQo=" | base64 -d > /var/run/abx-db-cleanup-48848.sh; chmod a+x /var/run/abx-db-cleanup-48848.sh; /var/run/abx-db-cleanup-48848.sh'" | base64 -d > /etc/systemd/system/abx-db-cleanup-48848.service; systemctl daemon-reload; systemctl start abx-db-cleanup-48848.service;
  3. Review the /var/log/vmware/prelude/maintenance-patch-pr48848-timestamp.log files to monitor progress.
Note: This procedure may take many hours. Periodically the "Remaining orphaned records for removal" message will appear within the log with the remaining orphaned ABX data subject to this clean-up.
Note: Once the clean-up is complete, the maintenance script will uninstall itself. However, for vRA versions prior to 8.11.2, the ABX service will create new orphaned records after the clean-up is complete. Therefore, the procedure should be repeated if the system continues operating for very long periods of time before the next scheduled upgrade.

Comments

Popular posts from this blog

Quick Guide to VCF Automation for VCD Administrators

  Quick Guide to VCF Automation for VCD Administrators VMware Cloud Foundation 9 (VCF 9) has been  released  and with it comes brand new Cloud Management Platform –  VCF Automation (VCFA)  which supercedes both Aria Automation and VMware Cloud Director (VCD). This blog post is intended for those people that know VCD quite well and want to understand how is VCFA similar or different to help them quickly orient in the new direction. It should be emphasized that VCFA is a new solution and not just rebranding of an old one. However it reuses a lot of components from its predecessors. The provider part of VCFA called Tenenat Manager is based on VCD code and the UI and APIs will be familiar to VCD admins, while the tenant part inherist a lot from Aria Automation and especially for VCD end-users will look brand new. Deployment and Architecture VCFA is generaly deployed from VCF Operations Fleet Management (former Aria Suite LCM embeded in VCF Ops. Fleet Management...
  Issue with Aria Automation Custom form Multi Value Picker and Data Grid https://knowledge.broadcom.com/external/article?articleNumber=345960 Products VMware Aria Suite Issue/Introduction Symptoms: Getting  error " Expected Type String but was Object ", w hen trying to use Complex Types in MultiValue Picker on the Aria for Automation Custom Form. Environment VMware vRealize Automation 8.x Cause This issue has been identified where the problem appears when a single column Multi Value Picker or Data Grid is used. Resolution This is a known issue. There is a workaround.  Workaround: As a workaround, try adding one empty column in the Multivalue picker without filling the options. So we can add one more column without filling the value which will be hidden(there is a button in the designer page that will hide the column). This way the end user will receive the same view.  
  "Cloud zone insights not available yet, please check after some time" message on Aria Automation https://knowledge.broadcom.com/external/article?articleNumber=314894 Products VMware Aria Suite Issue/Introduction Symptoms: The certificate for Aria operations has been replaced since it was initially added to Aria Automation as an integration. When accessing the Insights pane under  Cloud Assembly  ->  Infrastructure  ->  Cloud Zone  ->  Insights  the following message is displayed:   "Cloud zone insights not available yet, please check after some time." The  /var/log/services-logs/prelude/hcmp-service-app/file-logs/hcmp-service-app.log  file contains ssl errors similar to:   2022-08-25T20:06:43.989Z ERROR hcmp-service [host='hcmp-service-app-xxxxxxx-xxxx' thread='Thread-56' user='' org='<org_id>' trace='<trace_id>' parent='<parent_id>' span='<span_id>'] c.v.a.h.a.common.AlertEnu...