Open OnDemand Troubleshooting

From UFRC
Revision as of 20:18, 18 March 2023 by Israel.herrera (talk | contribs)
Jump to navigation Jump to search

Open OnDemand is a web platform for accessing and managing HPC resources. If you're encountering issues while using Open OnDemand, here are some common troubleshooting steps you can take:

Full Home Directory

If you are unable to access your files or launch jobs, it may be due to your home directory being full. Check your disk usage with the command du -sh ~/* in the terminal. If you find that your home directory is full, you can try removing unnecessary files or requesting additional disk space.

Proxy Errors

If you are unable to connect to Open OnDemand, you may be experiencing issues with your network proxy settings. Try configuring your proxy settings to allow connections to the ondemand.rc.ufl.edu domain. If you are still unable to connect, please contact us for assistance.

Check File and Directory Permissions

Make sure that the OnDemand files and directories have the correct permissions. You can check the permissions by running the following command:

ls -l /opt/ood

Make sure that the files and directories are owned by the correct user and group, and that the permissions are set correctly.

Check Configuration Files

Make sure that the OnDemand configuration files are set up correctly. You can check the configuration files by running the following command:

ls /etc/ood/config

Make sure that the files exist and are set up correctly. If you need to modify

Example Example

Clear Browser Cache

If you're encountering issues with the OnDemand web interface, try clearing your browser cache. This can often resolve issues with outdated or corrupted cached files.

Check the Logs

The first thing you should do when encountering an issue with Open OnDemand is to check the logs. The logs can provide useful information about what went wrong and why.

You can find the logs in the /var/log/ondemand-nginx directory. Look for log files that correspond to the time when you encountered the issue. The logs are typically named access.log and error.log. You can use the tail command to view the last few lines of a log file:

tail -f /var/log/ondemand-nginx/access.log
tail -f /var/log/ondemand-nginx/error.log

Check the OnDemand Status

You can check the status of the OnDemand services by running the following command:

systemctl status httpd24-httpd ondemand-nginx

This will show you whether the services are running or not. If a service is not running, you can start it by running the following command:

systemctl start <service_name>

Replace <service_name> with the name of the service that is not running.

Restart the OnDemand Services

Sometimes restarting the OnDemand services can resolve issues. You can restart the services by running the following commands:

systemctl restart httpd24-httpd ondemand-nginx

Check Network Connectivity

Ensure that your computer is connected to the network and that you can reach the OnDemand server. You can use the ping command to test network connectivity:

ping <ondemand_server_ip>

Replace <ondemand_server_ip> with the IP address of the OnDemand server.

Check Firewall Rules

Make sure that the firewall on the OnDemand server is not blocking incoming traffic. You can check the firewall rules by running the following command:

iptables -L

If you need to open a port, you can do so by running the following command:

iptables -A INPUT -p tcp --dport <port_number> -j ACCEPT

Replace <port_number> with the number of the port that you want to open.

Check SELinux Settings

If you're encountering issues with permissions, check the SELinux settings on the OnDemand server. You can check the SELinux status by running the following command:

sestatus

If SELinux is enabled, you may need to modify the settings to allow the OnDemand services to access the necessary files and directories.