Answers below in-line.
On 11/30/19 3:41 PM, Sunghyun Park wrote:
NOTE: You will have to fill in the host name of your central manager.Hi, Tim.ÂThank you so much for the reference.ÂI'm looking at the slides and it seems like I need to add some config files at /etc/condor/config.d/.After the packaged installation, my config.d/ directory is currently empty.Â
Slides say that I need to add the following files:
- /etc/condor/config.d/49-common (all nodes)
- CONDOR_HOST = fermicloud176.fnal.gov
Don't forget the security configuration file on the next page. You will also have to change the host names mentioned in that file as well. You also need to store the pool password on each node. You can create it condor_store_cred command. Either run the command on each node, or copy the /etc/condor/password.d/POOL file to each node.
- /etc/condor/config.d/51-role-cm (one node)
- use ROLE: CentralManager
- /etc/condor/config.d/51-role-submit (some set of nodes)
- use ROLE: Submit
- /etc/condor/config.d/51-role-exec (some set of nodes)
- use ROLE: Execute
The numbers examples. They merely determine the order that the files are interpreted. Convention has them numbered from 00 to 99. Stay away from files that begin with 0, those are often used by the HTCondor team or distributions. The idea is that you can override a setting that is processed after an earlier file.I understand what it is trying to do but cannot fully follow the details. Here are my questions.
- Should I use the exactly same file names or certain naming convention? If so, whereÂdoes the numbers (49, 51, ..) come from?
In the example, these file have a single line. It is not important that they have a single line, For example, you may want to add configuration to a specific type of node in that file.
- Are those only contents inside each file? If so, I want to make sure that each file will have a single line of config.
Once you have established the configuration, you can start the HTCondor daemons and submit jobs. It is a good place to start.
- Is this the only setup to start with condor? I'm wondering if there is an additional step before I can submit jobs at the cm/submit node to execute node.
Using the powerful ClassAd mechanism you can specify the requirements of the job. Usually, the requirements are general, such as the memory and disk required. However, you can make the requirements so specific that that job would only match a single machine.
- I'm also curious if we can specify "execute" node we want to use when we submit the task. I went through the documents, but couldn't find anything relevant yet.
Thank you so much for answering these baby-step questions.ÂHave a nice weekend!
On Fri, Nov 29, 2019 at 2:05 PM Tim Theisen <tim@xxxxxxxxxxx> wrote:
Hello Sung,
I am sorry that our manuals need more work on installation. The condor_configure command is for use in the tarball installations. With the packaged installations, you need to add a few configuration files.
I presented a talk at HTCondor Week last May that has a helpful example. Take a look at the example "3 node pool" in my presentation. If you have further questions, please ask.
Here is a link to the talk:
ÂÂÂ https://agenda.hep.wisc.edu/event/1325/session/16/contribution/41
Let me know how it goes.
Regards, ...Tim
On 11/27/19 3:04 PM, Sunghyun Park wrote:
Hi, all. I'm a newbie who is trying to install condor at the cluster environment.Since I'm not familiar with cluster setup, I'm having trouble to install/configure condor.I'm trying to have a separate machine that has a role of "central manger" and "submit" while having a multiple slave nodes that only "execute".
My machines have ubuntu 18.04 so I successfully installed ubuntu package as the installation using tarball wasn't recommended on the online doc. https://htcondor.readthedocs.io/en/v8_8_6/admin-manual/installation-startup-shutdown-reconfiguration.html#unix-installation-from-a-repository
Here's the steps I think I should follow:--
- Install the package on all machines ( a machine for "manager/submit" + all slave machines for "execute")
- Use condor_configure to configure the type of each machine.
Is this the right approach?
I'm having errors at the second step providing right directories for release/local/install ...
Any suggestion/advice will be greatly helpful.Thank you!
Best, Sung
_______________________________________________ HTCondor-users mailing list To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a subject: Unsubscribe You can also unsubscribe by visiting https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users The archives can be found at: https://lists.cs.wisc.edu/archive/htcondor-users/-- Tim Theisen Release Manager HTCondor & Open Science Grid Center for High Throughput Computing Department of Computer Sciences University of Wisconsin - Madison 4261 Computer Sciences and Statistics 1210 W Dayton St Madison, WI 53706-1685 +1 608 265 5736
--
Best, Sung
-- Tim Theisen Release Manager HTCondor & Open Science Grid Center for High Throughput Computing Department of Computer Sciences University of Wisconsin - Madison 4261 Computer Sciences and Statistics 1210 W Dayton St Madison, WI 53706-1685 +1 608 265 5736