Skip to main content

 Aria Orchestrator Automation /var/log is 100% full

https://knowledge.broadcom.com/external/article?articleNumber=378276

Products

VMware Aria Suite

Issue/Introduction

  • VMware Aria Orchestrator/Automation (vRA/vRO) appliance /var/log file system shows as 100% used.

Environment

  • VMware Aria Orchestrator/Automation 8.x

Cause

  • Two main scenarios that can cause the filling of the /var/log/ files system in are:
    1. Improper configuration of log file rotation: If not configured properly, the log rotation can cause large-sized files and also numerous log files, which over time, can consume all the available space. 
    2. Creation of large Java Memory dump (.hprof ) files:
      • Aria Orchestrator/Automation can run into an OutOfMemory exception based upon configuration and system usage which will dump an hprof file to disk.  This can subsequently cause a full drive.

Resolution

  • We can use the below methods to clear and reclaim space:  (take snapshots of the appliance before executing the steps)

1. Identify Large Log Files:

    • Start by identifying the largest files consuming space in /var/log:
      du -sh /var/log/* | sort -rh | head -20
    • This command will list the largest files and directories, allowing you to pinpoint the culprits.

2. Clear or Compress Large Log Files:

    • Delete old logs: If certain log files are no longer needed (e.g., older rotated logs), you can safely delete them:
      sudo rm -f /var/log/<logfile>.gz
    • Truncate large log files: For logs that are currently being written to, you can truncate the file to clear its contents without deleting it:
      sudo truncate -s 0 /var/log/<logfile>

3. Use Log Rotation:

    • Ensure log rotation is working: Log files should be rotated and compressed automatically by logrotate. Check the configuration in /etc/logrotate.conf or /etc/logrotate.d/.
    • If necessary, manually trigger log rotation:
      sudo logrotate -f /etc/logrotate.conf

4. Examine Specific VMware Logs:

    • Aria Orchestrator and vRA logs: These logs may include large files. You can check:
      du -sh /var/log/vmware/* | sort -rh | head -20
    • vRO specific logs: Aria Orchestrator logs are typically found under /var/log/vmware/vco/ Clean unnecessary logs from there.
    • If /var/log/journal is large, check step 8

5. Remove Old Core Dumps or Unused Packages:

    • Check for core dumps or unused package files that may be consuming space:

      sudo find /var/crash -type f -exec rm -f {} \;
      sudo apt-get clean   # For Ubuntu-based systems
      sudo yum clean all   # For RHEL/CentOS-based systems

6. Check for Orphaned Docker Images (Optional):

    • check for unused Docker images, containers, and volumes that may consume space:
      docker system prune -a

7. Delete  /service-logs/prelude/vco-app/file-logs/vco.hprof heap dump files and write a script file to delete the vco.hprof files periodically.

Note: hprof files are heap dumps in binary format, which can be used for detailed analysis of memory-related problems in the Java stack.  These are safe to delete and are not actively in use by a healthy running system.
Note: A cron job can be created to run on a schedule to check for large *hprof files on the system to be removed.

8. Retain only the last X MB of journal logs with  journalctl --vacuum-size=XMB , e.g. to retain only the last 200MB of journal logs: journalctl --vacuum-size=200MB

Note: you may also opt to retain the last X days of logs with: journalctl --vacuum-time=Xd, e.g. to retain the last 2 days of journal logs: journalctl --vacuum-time=2d

Comments

Popular posts from this blog

Quick Guide to VCF Automation for VCD Administrators

  Quick Guide to VCF Automation for VCD Administrators VMware Cloud Foundation 9 (VCF 9) has been  released  and with it comes brand new Cloud Management Platform –  VCF Automation (VCFA)  which supercedes both Aria Automation and VMware Cloud Director (VCD). This blog post is intended for those people that know VCD quite well and want to understand how is VCFA similar or different to help them quickly orient in the new direction. It should be emphasized that VCFA is a new solution and not just rebranding of an old one. However it reuses a lot of components from its predecessors. The provider part of VCFA called Tenenat Manager is based on VCD code and the UI and APIs will be familiar to VCD admins, while the tenant part inherist a lot from Aria Automation and especially for VCD end-users will look brand new. Deployment and Architecture VCFA is generaly deployed from VCF Operations Fleet Management (former Aria Suite LCM embeded in VCF Ops. Fleet Management...
  Issue with Aria Automation Custom form Multi Value Picker and Data Grid https://knowledge.broadcom.com/external/article?articleNumber=345960 Products VMware Aria Suite Issue/Introduction Symptoms: Getting  error " Expected Type String but was Object ", w hen trying to use Complex Types in MultiValue Picker on the Aria for Automation Custom Form. Environment VMware vRealize Automation 8.x Cause This issue has been identified where the problem appears when a single column Multi Value Picker or Data Grid is used. Resolution This is a known issue. There is a workaround.  Workaround: As a workaround, try adding one empty column in the Multivalue picker without filling the options. So we can add one more column without filling the value which will be hidden(there is a button in the designer page that will hide the column). This way the end user will receive the same view.  
  "Cloud zone insights not available yet, please check after some time" message on Aria Automation https://knowledge.broadcom.com/external/article?articleNumber=314894 Products VMware Aria Suite Issue/Introduction Symptoms: The certificate for Aria operations has been replaced since it was initially added to Aria Automation as an integration. When accessing the Insights pane under  Cloud Assembly  ->  Infrastructure  ->  Cloud Zone  ->  Insights  the following message is displayed:   "Cloud zone insights not available yet, please check after some time." The  /var/log/services-logs/prelude/hcmp-service-app/file-logs/hcmp-service-app.log  file contains ssl errors similar to:   2022-08-25T20:06:43.989Z ERROR hcmp-service [host='hcmp-service-app-xxxxxxx-xxxx' thread='Thread-56' user='' org='<org_id>' trace='<trace_id>' parent='<parent_id>' span='<span_id>'] c.v.a.h.a.common.AlertEnu...