|Posted on Tuesday, December 01, 2009 - 9:55 am: |
We are using webmo pro version 9.1.002p. In the last week we have had the queue stop working three times. Specifically, the queue is working fine and submitting jobs normally and then suddenly stops working, leaving a large number of calculations queued indefinitely. The only way we have been able to fix this is by stopping the queue and daemon, deleting all of the compute nodes, and then adding them again. This process is quite time consuming and making it difficult to deliver our course in an effective manner. Is there a way to resolve this, or even monitor the queue more closely to identify the origin of these problems?