Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected

Date: Tue, 9 Dec 2014 17:54:13 +0000
From: qing <gang.qin@xxxxxxxxxxxxx>
Subject: Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected

Hi:

The previous limit I mentioned is actually done by condor cgroups instead of 'ASSIGN_CPU_AFFINITY = True' . Tests shows that condor cgroups make the restriction quite well.

Cheers,Gang

On 09/12/2014 15:56, qing wrote:

Hi, Anthony:

[root@node008 ~]# ps -aeF | grep stress
scotg001 5317 5315 0 1629   436    1 15:51 ?        00:00:00 stress --vm 4 --vm-bytes 800M --timeout 600s
scotg001 5318 5317 99 206430 487584 1 15:51 ?        00:01:07 stress --vm 4 --vm-bytes 800M --timeout 600s
scotg001 5319 5317 99 206430 778400 3 15:51 ?        00:01:07 stress --vm 4 --vm-bytes 800M --timeout 600s
scotg001 5320 5317 99 206430 399520 0 15:51 ?        00:01:07 stress --vm 4 --vm-bytes 800M --timeout 600s
scotg001 5321 5317 99 206430 655520 2 15:51 ?        00:01:07 stress --vm 4 --vm-bytes 800M --timeout 600s

So the core used here are 0,1,2,3.

Can you mention more details about 'use taskset to determine cpu affinity' ?

Cheers,Gang

On 09/12/2014 15:43, Anthony Tiradani wrote:
Your top output doesn't show which core the processes are using. what does ps -aeF show? The PSR column will show the last core that the process ran on. You can also use taskset to determine cpu affinity.

Thanks,

Anthony Tiradani

On 12/09/2014 08:15 AM, qing wrote:

Dear Condor Expert:

   I am using condor 8.2.2 and from the manual I see that if we set
'ASSIGN_CPU_AFFINITY = True' for the STARTD daemon, then a
multi-threaded job will not use more cores than the number it requests.
I made a quick test but seems 'ASSIGN_CPU_AFFINITY = True' does not work:

1. I added 'ASSIGN_CPU_AFFINITY = True' for the STARTD daemon, after
that I restarted the condor daemon on node008.
[root@node008 ~]# condor_config_val -dump | grep AFFINI
ASSIGN_CPU_AFFINITY = True

2. I submit a 4 threads job to node008, but I assign 'RequestCpus = 1'
in the submission file:

[scotg001@node003 test]$ condor_submit submit.stress
Submitting job(s).
1 job(s) submitted to cluster 41.

[scotg001@node003 test]$ cat submit.stress
Universe       = vanilla
Executable     = stress.sh
Output         = stress.out
Log            = stress.log

RequestMemory = 1000
RequestCpus = 1
requirements = ((Arch == "INTEL" || ARCH == "X86_64") && (machine
=="node008"))

Queue 1

[scotg001@node003 test]$ cat stress.sh
#!/bin/bash
echo $HOSTNAME
echo `date`
stress --vm 4 --vm-bytes 800M --timeout 60s

3. Condor shows that this job requires 1 cpu:

[scotg001@node003 test]$ condor_history 41.0 -af RequestCpus
1

but on node008 with top I can see the job is actually occupying 4 cores.
5232 scotg001 30 10 806m 767m 140 R 94.7 9.8   0:07.60 stress
5233 scotg001 30 10 806m 651m 140 R 94.7 8.3   0:07.48 stress
5230 scotg001 30 10 806m 15m 140 R 92.9 0.2   0:07.46 stress
5231 scotg001 30 10 806m 201m 140 R 92.9 2.6   0:07.54 stress

Any idea why 'ASSIGN_CPU_AFFINITY = True' does not work as expected?
I use partitionable slots but in 8.2.2 the previous bug should already
be fixed.

   Cheers,Gang
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/

References:
- [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected
  - From: qing
- Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected
  - From: Anthony Tiradani
- Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected
  - From: qing

Prev by Date: Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected
Next by Date: Re: [HTCondor-users] Inconsistencies: hold versus abort
Previous by thread: Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected
Next by thread: [HTCondor-users] condor_status nodename shows nothing when HOSTNAME=shortname in /etc/sysconfig/network
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] ASSIGN_CPU_AFFINITY does not work as expected