Hi all,
I'm currently in the process of setting up a GPU node to accept multiple jobs per GPU (by using a shared
device
memory pool).
This works well so far, but I was wondering if there is an intended way to set the default value of a custom machine resource (`GPUMemoryMB` in my case).
It seems to default to 0 if the `request_GPUMemoryMB` is not set in the submission file, but I need it to instead default to the max value (32000 in this case).
As we have multiple nodes with different types of GPUs, the default also needs to be able to be set for each machine individually.
There seem to be some resources with such defaults like `JOB_DEFAULT_REQUESTDISK`, but I think those are exceptions.
I've attached the relevant configs I'm currently using and would be happy for advice on this topic. Cheers and thanks, Tim Voigtländer |
Attachment:
config.config
Description: config.config