Hello Greg, Thank you! Of course, I have attached one of today’s logs with Unix/Windows end of the line. Best, Siarhei. From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx>
On Behalf Of Greg Thain
On 1/11/21 1:45 PM, Vaurynovich, Siarhei wrote:
Can you share with us your job log files, which might give some idea into why condor is preempting these jobs?
-greg ............................................................................ |
000 (676395.000.000) 01/10 23:48:35 Job submitted from host: <10.82.184.49:9618?addrs=10.82.184.49-9618+[--1]-9618&noUDP&sock=3463746_3b81_4> DAG Node: 20201210_US_120_ri1_WBA ... 001 (676395.000.000) 01/11 00:23:41 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 006 (676395.000.000) 01/11 00:23:51 Image size of job updated: 17260 17 - MemoryUsage of job (MB) 17260 - ResidentSetSize of job (KB) ... 006 (676395.000.000) 01/11 00:28:51 Image size of job updated: 2501404 2443 - MemoryUsage of job (MB) 2501068 - ResidentSetSize of job (KB) ... 006 (676395.000.000) 01/11 01:18:56 Image size of job updated: 2501408 2443 - MemoryUsage of job (MB) 2501068 - ResidentSetSize of job (KB) ... 001 (676395.000.000) 01/11 01:20:30 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 02:20:48 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 03:21:58 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 04:15:49 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 05:11:34 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 06:00:36 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 06:55:27 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 07:52:15 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 08:49:41 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 09:52:17 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 11:02:24 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 12:04:34 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 13:02:33 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 13:42:04 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 14:44:20 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ...
000 (676395.000.000) 01/10 23:48:35 Job submitted from host: <10.82.184.49:9618?addrs=10.82.184.49-9618+[--1]-9618&noUDP&sock=3463746_3b81_4> DAG Node: 20201210_US_120_ri1_WBA ... 001 (676395.000.000) 01/11 00:23:41 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 006 (676395.000.000) 01/11 00:23:51 Image size of job updated: 17260 17 - MemoryUsage of job (MB) 17260 - ResidentSetSize of job (KB) ... 006 (676395.000.000) 01/11 00:28:51 Image size of job updated: 2501404 2443 - MemoryUsage of job (MB) 2501068 - ResidentSetSize of job (KB) ... 006 (676395.000.000) 01/11 01:18:56 Image size of job updated: 2501408 2443 - MemoryUsage of job (MB) 2501068 - ResidentSetSize of job (KB) ... 001 (676395.000.000) 01/11 01:20:30 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 02:20:48 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 03:21:58 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 04:15:49 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 05:11:34 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 06:00:36 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 06:55:27 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 07:52:15 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 08:49:41 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 09:52:17 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 11:02:24 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 12:04:34 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 13:02:33 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 13:42:04 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ... 001 (676395.000.000) 01/11 14:44:20 Job executing on host: <10.82.176.49:9618?addrs=10.82.176.49-9618+[--1]-9618&noUDP&sock=81524_42f8_3> ...