Hi Carles This may be related to an issue weâre seeing here with capturing resource usage. See e.g. the following: JOB_ID Username Class CMD Finished CPUS CPuse MEMREQ RAM MEM ST CPU_TIME LWALL_TIME WorkerNode 946757.0 jtho long data-theorie-jthoe 11/21 07:19 1 0.999 32.0 GB 732.4 MB 732.4 MB C 13:03:43 13:10:42 wn-lot-045 946741.0 jtho long data-theorie-jthoe 11/21 06:55 1 1.000 32.0 GB 732.4 MB 732.4 MB C 13:03:53 13:04:36 wn-pijl-007 946581.0 jtho long data-theorie-jthoe 11/21 05:59 1 1.000 32.0 GB 732.4 MB 732.4 MB C 15:59:24 15:59:40 wn-lot-002 946889.0 jtho long data-theorie-jthoe 11/21 05:59 1 1.000 32.0 GB 9.8 MB 9.8 MB C 0 10:38:29 wn-lot-060 946732.0 jtho long data-theorie-jthoe 11/21 05:45 1 0.999 32.0 GB 732.4 MB 732.4 MB C 12:20:45 12:21:21 wn-pijl-004 946842.0 jtho long data-theorie-jthoe 11/21 05:23 1 0.997 32.0 GB 1.2 GB 1.4 GB C 10:38:52 10:41:09 wn-pijl-001 946440.0 jtho long data-theorie-jthoe 11/21 05:04 1 0.999 32.0 GB 1.2 GB 1.4 GB C 17:29:34 17:30:26 wn-pijl-006 You can see that for one of these lines, the CPU_TIME is zero, and the memory usage is significantly lower. Iâve seen this with my own test jobs, and looking at what the test jobs themselves (internally) report, they have the normal usage - HTCondor is somehow not always getting the right usage numbers. JT
|