Why am I getting this error when I try to submit jobs on the cluster?
sbatch: error: Batch job submission failed: Invalid qos specification
There are several reasons you might receive this error but it is mostly likely that your group's access has expired. Every resource at CCR is accessed through an allocation. Allocations have expiration dates. It is the responsibility of the faculty member in charge of the group's account to renew the allocation before it expires. Faculty are sent email reminders starting at 60 days before the allocation expires. When an allocation expires, the users in the group are removed from the resource and will no longer be able to login to it.
The faculty group leader can have the group's cluster access reinstated at any time by requesting a new allocation to the resource. Instructions for this process can be found here
How can you tell if this is the case for your group?
All users have access to the allocation management portal, Coldfront. Login using your CCR username and password and you will see the details of your project and what resources you have access to.
In this example, you see there are no active allocations being displayed. Click on the name of the project, to see all the details of it including all the users in the group and any allocations you have access to. In this example, you have access to the ub-hpc (academic) cluster and that allocation expired on December 10, 2018:
If the PI has purchased any additional storage or has given you access to a private cluster, you will see that under the list of allocations as well.
What do I do if my allocation is expired?
The only one who can request a new allocation is the faculty member who is responsible for it. Please contact that person and ask them to do so so that you can continue running jobs on the cluster. If the allocation has not yet expired, the faculty member can renew it. However, once the allocation has expired they must request a new allocation by clicking on the "Request Resource Allocation" button and proceeding through that process.