[Tgcmgroup] Recommendations for bluevista

Ben Foster foster at hao.ucar.edu
Wed Nov 29 12:34:04 MST 2006


Hi tgcmgroup:

I have two recommendations for all TIEGCM/TIMEGCM runs on bluevista.
These do not apply to bluesky, but will apply to blueice, when it
becomes available.

I have added these recommendations to the default .job files for two 
versions of tiegcm and timegcm at both HAO and SCD, so you can refer
to them, or copy and modify as needed for your jobs:

$TGCMROOT/tiegcm1.8/scripts/tiegcm-ibm.job 
$TGCMROOT/timegcm1.2/scripts/timegcm-ibm.job 

Thanks to Siddhartha Ghosh for making these suggestions and assisting in
testing and ongoing performance evaluation.

#-----------------------------------------------------------------------

The following environment variable settings are recommended on
all TGCM models run on bluevista:

#
# 11/29/06: add env vars for bluevista (suggested by Siddhartha)
#
if ($nodename =~ bv*) then # bluevista
  setenv MP_STDINMODE 0
  setenv MP_SHARED_MEMORY yes
  setenv MEMORY_AFFINITY MCM
  setenv AIXTHREAD_SCOPE S
  setenv MALLOCMULTIHEAP true
endif

This can be added in your .job script, before the #BSUB commands
(just after the similiar set of env vars for bluesky).

When the new blueice system is available (and bluesky is retired), 
we can remove the conditional on the nodename.

#-----------------------------------------------------------------------

Simultaneous Multi-Threading (SMT), available on the IBM POWER5 architecture,
adds a virtual processor to each physical processor, doubling the number 
of active threads on a node. So when using SMT you can request 16 processors 
per node rather than the physical 8, using the #BSUB -R option.

See also: http://www.cisl.ucar.edu/docs/bluevista/run.html#smt_use

To take advantage of SMT on bluevista, try one of the following 2 
examples (in your .job scripts):

#
#BSUB -n 48                 # 48 procs
#BSUB -R "span[ptile=16]"   # 3 nodes w/ SMT (Symmetric Multi-Threading)
#
##BSUB -n 32                # 32 procs
##BSUB -R "span[ptile=16]"  # 2 nodes w/ SMT
#

The 2.5 degree timegcm model scales to 48 processors, as set above, but 
I have not tested the other models, so please experiment with these and 
other combinations.

#-----------------------------------------------------------------------

Most of the performance improvement will come from setting the env vars
rather than SMT.

Although the above applies only to bluevista, a job script with the above 
changes should continue to run on bluesky, because bluesky recognizes only 
LoadLeveler #@ directives, ignoring the LSF #BSUB directives.

When bluesky is retired early next year, we can remove all #@ directives,
since LoadLeveler is being retired, and all scd systems will run LSF.



-----------------------------------------------------------------------
Ben Foster		      	High Altitude Observatory (HAO)
foster at ucar.edu			phone: 303-497-1595  fax: 303-497-1589  
Nat. Center for Atmos. Res.     P.O. Box 3000 Boulder CO 80307 USA
-----------------------------------------------------------------------



More information about the tgcmgroup mailing list