reproduce-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[task #15737] slurm - openmpi - (PMIx+libevent+hwloc)


From: Mohammad Akhlaghi
Subject: [task #15737] slurm - openmpi - (PMIx+libevent+hwloc)
Date: Thu, 30 Jul 2020 06:48:11 -0400 (EDT)
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:79.0) Gecko/20100101 Firefox/79.0

Follow-up Comment #1, task #15737 (project reproduce):

It would be great if you can post your experiences here as you experiment with
various solutions. I haven't had much time to actually test these yet in
Maneage.

One thing that does come to my mind and can probably be helpful is the
preparation phase (which is organized in 'top-prepare.mk'). In that phase, the
project can get basic settings of the host (and necessary analysis), and using
those settings to optimize the Make rules of 'top-make.sh' for them.

For example the 'X.sh' can manually add a file (as a configuration file),
listing the set of independent targets that './project make' should produce
for that particular submission (with 'srun' or 'sbatch'). 

For example if you have 1000 jobs, and the cluster has 100 computers you would
want each computer to do 10 of the independent jobs. So each 'X.sh' can define
a variable, listing the final targets for its './project make' command to
build. All the 'X.sh' submissions will then use the same Maneaged software
environment and raw datasets, but do their jobs independently. In the end you
can add one extra 'X.sh' submission to merge the results of all into one final
result/paper for example. 

I will hopefully start using slurm with Maneage more in the coming months, and
will add the low-level structure to facilitate it. But until then please go
ahead with testing and post the results here for us to also learn from ;-).

    _______________________________________________________

Reply to this item at:

  <https://savannah.nongnu.org/task/?15737>

_______________________________________________
  Message sent via Savannah
  https://savannah.nongnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]