WebMO - Computational chemistry on the WWW
Recent news

WebMO 17.0 is now available for free download!

WebMO 17.0 Pro and Enterprise have a variety of additional features and is available for purchase.

The WebMO app is now available for iOS and Android.

August 17, 2019

WebMO PBS qsub: cannot connect to server Log Out | Topics | Search
Moderators | Edit Profile

WebMO Support Forum » Running WebMO Jobs » Job Manager » WebMO PBS qsub: cannot connect to server « Previous Next »

Author Message
Mike Conner
Unregistered guest
Posted on Thursday, October 26, 2017 - 3:38 pm:   

We've been running WebMO on a small cluster with torque/maui for some time without issues. Recently we added some compute nodes to the cluster (and upgraded to infiniband) and now all jobs submitted through WebMO fail with the error:

socket_connect_unix failed: 15137
qsub: cannot connect to server (null) (errno=15137) could not connect to trqauthd


The webmo user can submit jobs to the batch queue on the command line. trqauthd is running on the master and compute nodes. All other jobs (submitted outside of WebMO to the cluster) are running fine.

I need some help knowing where to look for a misconfiguration.
JR Schmidt
Moderator
Username: Schmidt

Post Number: 570
Registered: 11-2006
Posted on Thursday, October 26, 2017 - 9:27 pm:   

This is an insidious issue that is the result of some Apache defaults in recent distributions. The newest build of WebMO 17 actually check for it in diagnose.pl. Briefly:

By default the new “systemd” daemon creates a “private” /tmp directory for services, including Apache, which breaks the Torque qsub/qstat commands. This must be disabled as follows:

Edit /usr/lib/systemd/system/httpd.service (CentOS, Debian, Ubuntu) or /etc/systemd/system/httpd.service (SuSe) and set
PrivateTmp=false
Restart the daemons
$ sudo systemctl daemon-reload
$ sudo systemctl restart httpd
linux345
Unregistered guest
Posted on Wednesday, November 29, 2017 - 5:49 pm:   

Did you ever find the answer to this - fixing the httpd did not wwork
JR Schmidt
Moderator
Username: Schmidt

Post Number: 581
Registered: 11-2006
Posted on Wednesday, November 29, 2017 - 10:03 pm:   

The PrivateTmp issue is certainly the cause. Disabling the option varies a bit by distribution and requires restarting the httpd service. Make sure to call:

$ sudo systemctl daemon-reload
$ sudo systemctl restart httpd

Restart your server if you are unsure.

Add Your Message Here
Post:
Username: Posting Information:
This is a public posting area. Enter your username and password if you have an account. Otherwise, enter your full name as your username and leave the password blank. Your e-mail address is optional.
Password:
E-mail:
Options: Post as "Anonymous"
Enable HTML code in message
Automatically activate URLs in message
Action:

Topics | Last Day | Last Week | Tree View | Search | Help/Instructions | Program Credits Administration