ABX database grows large and causes postgres startup failure
https://knowledge.broadcom.com/external/article?articleNumber=325840
Products
VMwareVMware Aria Suite
Issue/Introduction
Symptoms:
- You are using VMware vRealize Automation 8.8 - 8.11.1.
- VM disk 2 (/data) continuously runs out of space.
- /opt/scripts/deploy.sh script runs for very long time or times out while trying to start postgres pods.
- Upgrade runs for very long time or times out during service startup in deploy.sh.
Verifying Symptoms
- SSH to one node in the Aria Automation cluster.
- Run the following command
du -sh /data/db/live
- Run the following command to connect to the ABX database
vracli dev psql abx-db
- Run the following query to check the size of the database and row counts
- The numbers from Step 4 are in the range of hundreds of gigabytes and is close to the number reported in Step #2.
Environment
VMware vRealize Automation SaltStack Config 8.11.x
VMware vRealize Automation 8.10.x
VMware vRealize Automation 8.8.x
VMware vRealize Automation 8.x
VMware vRealize Automation 8.9.x
VMware vRealize Automation 8.11.x
VMware Aria Automation 8.12.x
VMware Aria Automation Config 8.12.x
VMware vRealize Automation 8.10.x
VMware vRealize Automation 8.8.x
VMware vRealize Automation 8.x
VMware vRealize Automation 8.9.x
VMware vRealize Automation 8.11.x
VMware Aria Automation 8.12.x
VMware Aria Automation Config 8.12.x
Cause
Upon each action update, ABX stores its content in the database. In some cases, an obsolete version of this content remains in the database causing a gradual growth with not needed records.
Resolution
This data records leak is resolved in VMware vRealize Automation 8.11.2 and above.
Clean-up logic is introduced in VMware Aria Automation 8.13.0 (formerly vRealize Automation).
See the Workaround section for additional information for versions 8.8. - 8.11.2.
Workaround:
Clean-up logic is introduced in VMware Aria Automation 8.13.0 (formerly vRealize Automation).
See the Workaround section for additional information for versions 8.8. - 8.11.2.
Workaround:
Patch Procedure for 8.8 - 8.12.2
Prerequisites
- Create simultaneous VM snapshots without memory (8.8 - 8.9) for each appliance in the cluster.
- For versions 8.9. and above, use the quiesce option.
Procedure
- SSH into one appliance in the cluster.
- Run the following command:
echo "[Unit]
Description=Clean-up service for ABX database for issue 48848

[Service]
Type=simple
ExecStart=/bin/bash -c 'echo "IyEvYmluL2Jhc2gKCiMKIyBDb3B5cmlnaHQgKGMpIDIwMjMgVk13YXJlLCBJbmMuICBBbGwgcmlnaHRzIHJlc2VydmVkLgojCgpzZXQgK3gKc2V0ICtlCgpsb2dEaXI9Ii92YXIvbG9nL3Ztd2FyZS9wcmVsdWRlIgp0aW1lc3RhbXA9JChkYXRlICcrJVktJW0tJWQtJUgtJU0tJVMnKQpwcj0iNDg4NDgiCkxPR19QQVRIPSIkbG9nRGlyL21haW50ZW5hbmNlLXBhdGNoLXByJHByLSR0aW1lc3RhbXAubG9nIgpleHBvcnQgS1VCRUNPTkZJRz0vZXRjL2t1YmVybmV0ZXMvYWRtaW4uY29uZgoKbWtkaXIgLXAgIiRsb2dEaXIiCgpmdW5jdGlvbiBsb2coKSB7CiAgbG9jYWwgbXNnPSIkMSIKICBsb2NhbCBsZXZlbD0iJDIiCiAgbG9jYWwgZHQ9JChkYXRlICcrJVktJW0tJWQgJUg6JU06JVMnKQogIGxvY2FsIGhvc3ROYW1lPSIkKGhvc3RuYW1lIC1mKSIKCiAgZWNobyAiWyRsZXZlbF1bJGR0XVskaG9zdE5hbWVdICRtc2ciIHwgdGVlIC1hICIkTE9HX1BBVEgiCn0KCmZ1bmN0aW9uIGxvZ19pbmZvKCkgewogIGxvZyAiJDEiICJJTkZPIgp9CgpmdW5jdGlvbiBsb2dfZXJyb3IoKSB7CiAgbG9nICIkMSIgIkVSUk9SIgp9CgpmdW5jdGlvbiBjaGVja19hcHBsaWFuY2VfdmVyc2lvbigpIHsKICBsb2NhbCB2YW1pVmVyPSQoL29wdC92bXdhcmUvYmluL3ZhbWljbGkgdmVyc2lvbiAtLWFwcGxpYW5jZSkKICBsb2NhbCBhcHBsVmVyPSQoZWNobyAiJHZhbWlWZXIiIHwgZ3JlcCAiVmVyc2lvbiAtICIgfCBhd2sgJ3twcmludCAkM30nKQoKICBsb2dfaW5mbyAiQXBwbGlhbmNlIHZlcnNpb246ICRhcHBsVmVyIgoKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC44LjAiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguOC4xIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjguMiIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC45LjEiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTAuMCIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMC4xIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjEwLjIiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTEuMSIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMS4yIiAmJiByZXR1cm4gMAogIGVjaG8gIiRhcHBsVmVyIiB8IGdyZXAgLXEgIl44LjEyLjAiICYmIHJldHVybiAwCiAgZWNobyAiJGFwcGxWZXIiIHwgZ3JlcCAtcSAiXjguMTIuMSIgJiYgcmV0dXJuIDAKICBlY2hvICIkYXBwbFZlciIgfCBncmVwIC1xICJeOC4xMi4yIiAmJiByZXR1cm4gMAoKICByZXR1cm4gMQp9CgpmdW5jdGlvbiBrOHNfcmVhZHkoKSB7CiAgbG9jYWwgbm9kZXNTdGF0dXM9JCh0aW1lb3V0IDMwMCAvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIGdldCBub2RlcyAtLW5vLWhlYWRlcnMgfCBhd2sgJ3twcmludCAkMn0nKQogIGxvY2FsIHJlc3VsdD0kPwoKICAjIFRoZXJlJ3MgUmVhZHkgc3RhdGUgaW4gdGhlIG5vZGVzIGxpc3QuCiAgW1sgJHJlc3VsdCAtZXEgMCBdXSAmJiB7IGVjaG8gIiRub2Rlc1N0YXR1cyIgfCBzb3J0IC11IHwgZ3JlcCAtcSAiUmVhZHkiICYmIHJlc3VsdD0wIHx8IHJlc3VsdD0yOyB9CgogICMgVGhlIG9ubHkgYXZhaWxhYmxlIG5vZGUgc3RhdGUgaXMgUmVhZHkuCiAgW1sgJHJlc3VsdCAtZXEgMCBdXSAmJiB7IFtbICQoZWNobyAiJG5vZGVzU3RhdHVzIiB8IHNvcnQgLXUgfCB3YyAtbCkgLWVxIDEgXV0gJiYgcmVzdWx0PTAgfHwgcmVzdWx0PTM7IH0KCiAgIyBDbHVzdGVyIHJlbW90ZSBjb21tdW5pY2F0aW9uIGlzIE9OIGUuZy4gYWxsIHJlbGV2YW50IHBvZHMgaW4ga3ViZS1zeXN0ZW0gYXJlIHVwLgogIFtbICRyZXN1bHQgLWVxIDAgXV0gJiYgeyB0aW1lb3V0IDMwMCB2cmFjbGkgY2x1c3RlciBleGVjIC0tIGJhc2ggLWMgJy91c3IvYmluL2RhdGUnICYmIHJlc3VsdD0wIHx8IHJlc3VsdD00OyB9CgogIHJldHVybiAkcmVzdWx0Cn0KCmZ1bmN0aW9uIHBzcWxfcmVhZHkoKSB7CiAgbG9jYWwgY291bnQ9JCgvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIC1uIHByZWx1ZGUgZ2V0IHBvZHMgfCBncmVwIHBvc3RncmVzIHwgZ3JlcCAiUnVubmluZyIgfCBncmVwICIxLzEiIHwgd2MgLWwpCgogIFtbICRjb3VudCAtZXEgMyBdXSAmJiByZXR1cm4gMCB8fCByZXR1cm4gMQp9CgpmdW5jdGlvbiBwc3FsX3ByaW1hcnkoKSB7CiAgbG9jYWwgLW4gcHJpbWFyeT0kMQogIHByaW1hcnk9IiQodnJhY2xpIHN0YXR1cyB8IGpxIC1yICcuZGF0YWJhc2VOb2Rlc1tdIHwgc2VsZWN0KC5Sb2xlPT0icHJpbWFyeSIpLiJOb2RlIG5hbWUiJyB8IGN1dCAtZCAnLicgLWYgMSkiCn0KCmZ1bmN0aW9uIHBzcWxfZGVsZXRlX2JhdGNoKCkgewogIGxvY2FsIHByaW1hcnk9IiQxIgogIGxvY2FsIHJvd3NBZmZlY3RlZD0iJCgvdXNyL2xvY2FsL2Jpbi9rdWJlY3RsIGV4ZWMgLW4gcHJlbHVkZSAiJHByaW1hcnkiIC0tIGNocHN0IC11IHBvc3RncmVzIHBzcWwgXAogICAgLWQgImFieC1kYiIgXAogICAgLWMgIgogICAgICBERUxFVEUKICAgICAgRlJPTSBhYnhfYWN0aW9uX2NvbnRlbnQKICAgICAgV0hFUkUgaWQgaW4gKAogICAgICAgIFNFTEVDVCBpZAogICAgICAgIEZST00gYWJ4X2FjdGlvbl9jb250ZW50CiAgICAgICAgV0hFUkUgaWQgbm90IElOICgKICAgICAgICAgIFNFTEVDVCBhY3Rpb24tPj4nY29udGVudElkJyBBUyBjb250ZW50X2lkCiAgICAgICAgICBGUk9NIHB1YmxpYy5hYnhfYWN0aW9uX3ZlcnNpb24KICAgICAgICAgIFdIRVJFIGFjdGlvbi0+Pidjb250ZW50SWQnIElTIE5PVCBOVUxMCiAgICAgICAgICBVTklPTgogICAgICAgICAgU0VMRUNUIGNvbnRlbnRfaWQKICAgICAgICAgIEZST00gcHVibGljLmFieF9hY3Rpb24KICAgICAgICAgIFdIRVJFIGNvbnRlbnRfaWQgSVMgTk9UIE5VTEwKICAgICAgICAgICkgTElNSVQgMTAKICAgICAgICApCiAgICAgICIpIgogIGxvY2FsIHJlc3VsdD0kPwogIFtbICRyZXN1bHQgLW5lIDAgXV0gJiYgcmV0dXJuIDIKICBlY2hvICIkcm93c0FmZmVjdGVkIiB8IGdyZXAgLXEgIkRFTEVURSAwIiAmJiByZXR1cm4gMAogIHJldHVybiAxCn0KCmZ1bmN0aW9uIHBzcWxfcmVwb3J0X3JlbWFpbmluZygpIHsKICBsb2NhbCBwcmltYXJ5PSIkMSIKCiAgL3Vzci9sb2NhbC9iaW4va3ViZWN0bCBleGVjIC1uIHByZWx1ZGUgIiRwcmltYXJ5IiAtLSBjaHBzdCAtdSBwb3N0Z3JlcyBwc3FsIFwKICAgIC1kICJhYngtZGIiIFwKICAgIC1jICIKICAgICBTRUxFQ1QgQ09VTlQoKikgRlJPTSBhYnhfYWN0aW9uX2NvbnRlbnQgV0hFUkUgaWQgTk9UIElOICgKICAgICAgICAgICAgU0VMRUNUIGNvbnRlbnRfaWQgRlJPTSBhYnhfYWN0aW9uIFdIRVJFIGNvbnRlbnRfaWQgSVMgTk9UIE5VTEwpOwogICAgICAiID4+ICIkTE9HX1BBVEgiCn0KCmZ1bmN0aW9uIHBzcWxfdmFjdXVtKCkgewogIGxvY2FsIHByaW1hcnk9IiQxIgogIC91c3IvbG9jYWwvYmluL2t1YmVjdGwgZXhlYyAtbiBwcmVsdWRlICIkcHJpbWFyeSIgLS0gY2hwc3QgLXUgcG9zdGdyZXMgcHNxbCAtZCAiYWJ4LWRiIiAtYyAiVkFDVVVNIEZVTEwgYWJ4X2FjdGlvbl9jb250ZW50OyIKfQoKZnVuY3Rpb24gc3lzdGVtZF9yZW1vdmVfc2VydmljZSgpIHsKICBzeXN0ZW1jdGwgc3RvcCBhYngtZGItY2xlYW51cC00ODg0OC5zZXJ2aWNlCiAgcm0gLWYgL2V0Yy9zeXN0ZW1kL3N5c3RlbS9hYngtZGItY2xlYW51cC00ODg0OC5zZXJ2aWNlCiAgcm0gLWYgL3Zhci9ydW4vYWJ4LWRiLWNsZWFudXAtNDg4NDguc2gKICBzeXN0ZW1jdGwgZGFlbW9uLXJlbG9hZAp9CgojIFZlcnNpb24gY2hlY2sKY2hlY2tfYXBwbGlhbmNlX3ZlcnNpb24KcmVzdWx0PSQ/CgppZiBbICRyZXN1bHQgLW5lIDAgXQp0aGVuCiAgbG9nX2Vycm9yICJUaGUgbWFpbnRlbmFuY2UgcGF0Y2ggaXMgbm90IGNvbXBhdGlibGUgd2l0aCB0aGlzIHByb2R1Y3QgdmVyc2lvbi4iCiAgc3lzdGVtZF9yZW1vdmVfc2VydmljZQoKICBleGl0IDAKZmkKCndoaWxlIFsgMSAtZXEgMSBdCmRvCiAgc2xlZXAgMTAKCiAgbG9nX2luZm8gIkNoZWNraW5nIGF2YWlsYWJpbGl0eSBvZiBLdWJlcm5ldGVzLiIKICBrOHNfcmVhZHkKICByZXN1bHQ9JD8KICBpZiBbICRyZXN1bHQgLW5lIDAgXQogIHRoZW4KICAgIGxvZ19lcnJvciAiS3ViZXJuZXRlcyBub3QgcmVhZHkuIFJldHJ5aW5nIGluIDEwIHNlY29uZHMgLi4uIgogICAgY29udGludWUKICBmaQoKICBsb2dfaW5mbyAiQ2hlY2tpbmcgYXZhaWxhYmlsaXR5IG9mIFBvc3RncmVzLiIKICBwc3FsX3JlYWR5CiAgcmVzdWx0PSQ/CiAgaWYgWyAkcmVzdWx0IC1uZSAwIF0KICB0aGVuCiAgICBsb2dfZXJyb3IgIlBvc3RncmVzIG5vdCByZWFkeS4gUmV0cnlpbmcgaW4gMTAgc2Vjb25kcyAuLi4iCiAgICBjb250aW51ZQogIGZpCgogIGxvZ19pbmZvICJTZWFjaGluZyBmb3IgcG9zdGdyZXMgcHJpbWFyeS4iCiAgcHNxbF9wcmltYXJ5IHBzcWxfcG9kX3ByaW1hcnkKICBpZiBbIC16ICIkcHNxbF9wb2RfcHJpbWFyeSIgXQogIHRoZW4KICAgIGxvZ19lcnJvciAiUG9zdGdyZXMgcHJpbWFyeSBub3QgZm91bmQuIFJldHJ5aW5nIGluIDEwIHNlY29uZHMgLi4uIgogICAgY29udGludWUKICBmaQoKICBsb2dfaW5mbyAiUmVtYWluaW5nIG9ycGhhbmVkIHJlY29yZHMgZm9yIHJlbW92YWw6IgogIHBzcWxfcmVwb3J0X3JlbWFpbmluZyAiJHBzcWxfcG9kX3ByaW1hcnkiCgogIGlkeD0wCiAgd2hpbGUgWyAxIC1lcSAxIF0KICBkbwogICAgbG9nX2luZm8gIkRlbGV0aW5nIGEgYmF0Y2ggb2Ygb3JwaGFuZWQgZGF0YSByZWNvcmRzIC4uLiIKICAgIHBzcWxfZGVsZXRlX2JhdGNoICIkcHNxbF9wb2RfcHJpbWFyeSIKICAgIHJlc3VsdD0kPwoKICAgIGlmIFsgJHJlc3VsdCAtZXEgMSBdCiAgICB0aGVuCiAgICAgIHNsZWVwIDIKCiAgICAgICgoaWR4KyspKQogICAgICBpZiBbICQoKGlkeCAlIDUwKSkgLWVxIDAgXQogICAgICB0aGVuCiAgICAgICAgbG9nX2luZm8gIlJlbWFpbmluZyBvcnBoYW5lZCByZWNvcmRzIGZvciByZW1vdmFsczoiCiAgICAgICAgcHNxbF9yZXBvcnRfcmVtYWluaW5nICIkcHNxbF9wb2RfcHJpbWFyeSIKICAgICAgZmkKICAgICAgY29udGludWUKICAgIGVsaWYgWyAkcmVzdWx0IC1lcSAyIF0KICAgIHRoZW4KICAgICAgbG9nX2Vycm9yICJBbiBlcnJvciBvY2N1cnJlZCB3aGlsZSBkZWxldGluZyBvcnBoYW5lZCBkYXRhIHJlY29yZHMiCiAgICAgIGJyZWFrCiAgICBlbHNlCiAgICAgIGxvZ19pbmZvICJCYXRjaCBkZWxldGVzIGNvbXBsZXRlLiIKCiAgICAgIGlmIFsgJGlkeCAtZ3QgMSBdCiAgICAgIHRoZW4KICAgICAgICBsb2dfaW5mbyAiVmFjdXVtaW5nIC4uLiIKICAgICAgICBwc3FsX3ZhY3V1bSAiJHBzcWxfcG9kX3ByaW1hcnkiCiAgICAgIGZpCiAgICAgIGJyZWFrCiAgICBmaQogIGRvbmUKCiAgaWYgWyAkcmVzdWx0IC1lcSAwIF0KICB0aGVuCiAgICBsb2dfaW5mbyAiQ2xlYW4tdXAgY29tcGxldGVkIHN1Y2Nlc3NmdWxseS4iCiAgICBicmVhawogIGZpCmRvbmUKCiMgU2VsZiBkZWxldGUKc3lzdGVtZF9yZW1vdmVfc2VydmljZQo=" | base64 -d > /var/run/abx-db-cleanup-48848.sh; chmod a+x /var/run/abx-db-cleanup-48848.sh; /var/run/abx-db-cleanup-48848.sh'" | base64 -d > /etc/systemd/system/abx-db-cleanup-48848.service; systemctl daemon-reload; systemctl start abx-db-cleanup-48848.service;
- Review the /var/log/vmware/prelude/maintenance-patch-pr48848-timestamp.log files to monitor progress.
Note: This procedure may take many hours. Periodically the "Remaining orphaned records for removal" message will appear within the log with the remaining orphaned ABX data subject to this clean-up.
Note: Once the clean-up is complete, the maintenance script will uninstall itself. However, for vRA versions prior to 8.11.2, the ABX service will create new orphaned records after the clean-up is complete. Therefore, the procedure should be repeated if the system continues operating for very long periods of time before the next scheduled upgrade.
Note: Once the clean-up is complete, the maintenance script will uninstall itself. However, for vRA versions prior to 8.11.2, the ABX service will create new orphaned records after the clean-up is complete. Therefore, the procedure should be repeated if the system continues operating for very long periods of time before the next scheduled upgrade.
Comments
Post a Comment