Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] HTC and HPC

Date: Fri, 05 Dec 2014 11:28:21 -0500
From: Gary Jackson <garyj@xxxxxxxxxx>
Subject: Re: [HTCondor-users] HTC and HPC

"HPC" is not magic pixie dust. If you don't actually havetightly-coupled parallel applications that require high performancecomputing resources, then installing scheduling software that supports"HPC" isn't going to do anything useful for you. Unless you have thosespecific needs, HTCondor is going to do a lot more for you than thoseother schedulers.

As I understand it, the reason you'd use a purpose-built HPC batchscheduler is because HTCondor's scheduling algorithm isn't as flexiblefor parallel jobs. SLURM and Torque give the administrator a lot oftools for tuning parallel scheduling performance to maximize utilizationor minimize turnaround time. For instance, they support plugging in abackfill scheduler for running jobs out of priority order when a lowerpriority job won't interfere with a higher priority job. Backfill inHTCondor, though still useful, doesn't work the same way and isn'tuseful for tightly-coupled parallel jobs.

Obviously, HTCondor is capable of scheduling and running parallel jobs,and you can use that if your parallel scheduling needs do not exceedwhat HTCondor can provide. HTCondor can start an OpenMPI job just aseasily as SLURM.

On the other hand, you probably wouldn't use SLURM or Torque for thesame sort of high throughput computing you do with HTCondor. HTCondor isa very sophisticated program that covers a lot more use cases than thosetwo batch schedulers. For example, HTCondor has support for:


* transparent checkpointing
* running jobs on desktop machines with low impact on end users
* using cloud resources

There's no reason you can't use both HTCondor and a purpose-builtparallel batch scheduler at the same time. Locally, we've used bothTorque and HTCondor on our HPC clusters for many years. When Torque jobsrun, they preempt any HTCondor jobs and the nodes leave the pool for theduration of the parallel job. It's worked out well.


On 12/4/14, 6:10 AM, marrodriguez wrote:

Hi

Hi
I have interest to implement condor on my site, but I have a doubt, Why
condor is consider a HTC  batch system and not HPC. it have some
disadvantage on HPC field respect Slurm, PBS vs SGE?

Thanks in advanced
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/



--
Gary

Follow-Ups:
- Re: [HTCondor-users] HTC and HPC
  - From: marrodriguez

References:
- [HTCondor-users] HTC and HPC
  - From: marrodriguez

Prev by Date: Re: [HTCondor-users] HTC and HPC
Next by Date: Re: [HTCondor-users] default host ranking
Previous by thread: Re: [HTCondor-users] HTC and HPC
Next by thread: Re: [HTCondor-users] HTC and HPC
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] HTC and HPC