[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] running executable that exists only in the container



Hi Greg,

First, thanks for noting this flag! I figured there had to be something like it.

So the job didn't run properly due to a shadow exception.

001 (006.000.000) 2023-10-02 14:31:11 Job executing on host: <127.0.0.1:9618?addrs=127.0.0.1-9618&alias=minicondor&noUDP&sock=startd_989_f341>
ÂÂÂÂÂÂÂ SlotName: slot1@minicondor
ÂÂÂÂÂÂÂ CondorScratchDir = "/var/lib/condor/execute/dir_114758"
ÂÂÂÂÂÂÂ Cpus = 1
ÂÂÂÂÂÂÂ Disk = 12381279
ÂÂÂÂÂÂÂ Memory = 1918
...
007 (006.000.000) 2023-10-02 14:31:12 Shadow exception!
ÂÂÂÂÂÂÂ Error from slot1@minicondor: Singularity test failed:FATAL:ÂÂ While checking image: could not open image /var/lib/condor/execute/dir_114758/lolcow_latest.sif: failed to retrieve path for /var/lib/condor/execute/dir_114758/lolcow_latest.sif: lstat /var/lib/condor/execute/dir_114758/lolcow_latest.sif: no such
ÂÂÂÂÂÂÂ 0Â -Â Run Bytes Sent By Job
ÂÂÂÂÂÂÂ 0Â -Â Run Bytes Received By Job

The submit file is below:

universeÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ = container
container_imageÂÂÂÂÂÂÂÂ = ./lolcow_latest.sif
transfer_executableÂÂÂ = false
executableÂÂÂÂÂ = cowsay
argumentsÂÂÂ Â = moo
outputÂÂÂÂÂÂÂÂÂÂÂÂ = job.out
errorÂÂÂÂÂÂÂÂÂÂÂÂÂÂ = job.err
logÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ = job.log
request_cpusÂÂ = 1
request_memory = 1024M
request_diskÂÂ = 1024M

queue 1

What sort of read/write/execute permissions does a container image need to have in order for them to be used by the cluster?

Cheers,
Matt

Matthew T. West
DevOps & HPC SysAdmin
University of Exeter, Research IT
http://www.exeter.ac.uk/research/researchcomputing/support/researchit
57 Laver Building, North Park Road, Exeter, EX4 4QE, United Kingdom

On 01/10/2023 20:26, Greg Thain via HTCondor-users wrote:
CAUTION: This email originated from outside of the organisation. Do not click links or open attachments unless you recognise the sender and know the content is safe.


On 10/1/23 1:13 PM, Matthew T West via HTCondor-users wrote:
Evening all,

I am testing out a minicondor instance on a RHEL9 VM with Singularity
(3.11.5-1.el9) installed. For a simple test, I am trying to run
`cowsay moo` in a local copy of lolcow_latest.sif. How do I specify
that the executable I want to run is in the container itself, not the
submit directory?
https://docs.sylabs.io/guides/3.5/user-guide/quick_start.html#interact-with-images

Hi Matt:

Try setting


transfer_executable = false

or setting the executable = /absolute/path/to/coway


-greg

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/