Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] hanging file transfers
- Date: Wed, 21 Aug 2019 19:33:55 +0000
- From: John M Knoeller <johnkn@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] hanging file transfers
> The manual has JobStatus 6 for "transferring output" but none transferring input, is that correct?
yes. but there is a Boolean attribute that is TRUE while a job is transferring input, and a second one
that indicates if it is waiting in the queue or actually transferring, so the expression
TransferringInput && ! TransferQueued
Will evaluate to true if the job is actually transferring input data.
By the way, there is also a TransferringOutput Boolean, in addition to the job status.
-tj
-----Original Message-----
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> On Behalf Of Dimitri Maziuk via HTCondor-users
Sent: Wednesday, August 21, 2019 1:47 PM
To: htcondor-users@xxxxxxxxxxx
Cc: Dimitri Maziuk <dmaziuk@xxxxxxxxxxxxx>
Subject: [HTCondor-users] hanging file transfers
Hi all,
every once in a while we get jobs stuck on http transfers. E.g.
> # ps -AF | grep condor
...
> bbee 13689 13688 0 41582 5868 0 Jul29 ? 00:01:14 /usr/libexec/condor/curl_plugin http://proxy.chtc.wisc.edu/SQUID/bmrb/3.8/combined.tgz.enc /var/lib/condor/execute/dir_13685/combined.tgz.enc
...
> bbee 13817 13816 0 41582 5876 0 Jul29 ? 00:01:17 /usr/libexec/condor/curl_plugin http://proxy.chtc.wisc.edu/SQUID/bmrb/3.8/combined.tgz.enc /var/lib/condor/execute/dir_13813/combined.tgz.enc
Is there a knob I could use to auto-kill these? The manual has JobStatus
6 for "transferring output" but none transferring input, is that correct?
This is v. 8.8.4-1.el7.x86_64
TIA,
--
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu