How to manage and monitor local cluster resource allocation ?

April 11, 2018 734 views
Clustering Development Ubuntu

We are setting up two GPU server rigs ( https://in.pcpartpicker.com/list/GJstm8 ). These GPU servers should be accessible from anywhere over SSH. The GPU servers will be used for Deep Learning. We want to allocate 25% of GPU resource (out of the total 24GB) at once to each user, on request, we'll assign more if free, maximum of 50%. How can we manage the User authentication and roles ?

1 Answer

There are options available to manage ssh access across multiple machines like KeyBox which provides a central store for ssh key access.

I am not aware of any way to impose resource limits on GPUs under Linux on a per user basis. If this is currently possible I would recommend asking on a cryptocurrency mining forum as this is the most likely place this type of configuration would be used.

Have another answer? Share your knowledge.