Open OnDemand Troubleshooting

From UFRC
Revision as of 14:27, 26 September 2023 by Moskalenko (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Open OnDemand is a web platform for accessing and managing HPC resources. If you're encountering issues while using Open OnDemand, here are some common troubleshooting steps you can take:

Check the Logs

The first thing you should do when encountering an issue with Open OnDemand is to check the logs. The logs can provide useful information about what went wrong and why.

You can find the logs in the ~/ondemand/data/sys/dashboard/batch_connect/sys/APPLICATION directory where APPLICATION is the name of the tool you are running. Inside that directory there will be a number of subdirectories with unique names matching the 'Session ID's from your OOD Session List. Change into the appropriate directory and page through the 'output.log' log. The error is likely to be listed at the bottom of the file and say something obvious like 'out of disk space' or 'Some of your processes may have been killed by the cgroup out-of-memory handler'.

Full Home Directory

If you are unable to access your files or launch jobs, it may be due to your home directory being full. Check your disk usage with the command du -sh ~/* or use the 'ncdu' tool from the 'ufrc' environment module in the terminal. If you find that your home directory is full, you remove unnecessary files. Make sure to move all data used in jobs to the Blue filesystem to avoid filling up the home directory and violating the RC storage policy.

Proxy Errors

If you are unable to connect to Open OnDemand, you may be experiencing issues with your network proxy settings. Try configuring your proxy settings to allow connections to the ondemand.rc.ufl.edu domain. If you are still unable to connect, please contact us for assistance.

Clear Browser Cache

If you're encountering issues with the OnDemand web interface, try clearing your browser cache. This can often resolve issues with outdated or corrupted cached files.

Check Network Connectivity

Ensure that your computer is connected to the network and that you can reach the OnDemand server. You can use the ping command to test network connectivity:

ping <ondemand_server_ip>

Replace <ondemand_server_ip> with the IP address of the OnDemand server.