[mpas-developers] 1/10 degree problems

Mathew Maltrud maltrud at lanl.gov
Fri Apr 16 14:16:36 MDT 2010


Hi Michael and Todd--

i've been trying to run the 1/10 dipole POP grid in the sw  
configuration and am getting something i haven't seen before.  all  
appears normal--all mpi process are going, etc.  the *.err files say  
it is looping over timesteps, though clearly nothing is being done  
(happening too fast).  there's no output.nc file.  here are examples  
of the log.0000.* files (running on 64 cores):

mm at cy-2.lanl.gov {10}% tail log.0000.err
  Doing timestep           11
  Doing timestep           12
  Doing timestep           13
  Doing timestep           14
  Doing timestep           15
  Doing timestep           16
  Doing timestep           17
  Doing timestep           18
  Doing timestep           19
  Doing timestep           20
mm at cy-2.lanl.gov {11}% tail log.0000.out

   TIMINGS (process:event,running,cpu,wall,100*(wall/total wall))
      0 : total time          F        0.00000      196.55210

      0 : initialize          F        0.00000       67.82460   34.51
      0 : time integration    F        0.00000       11.05870    5.63

so the 'F' is a clue, but i don't know what it means.  note that the  
grid.nc file looks ok, and i successfully ran the 4/10 version of this  
grid earlier this week.

any hints?  maybe not enough memory?  there are about 6 million cells...

thanks...
-mat


More information about the mpas-developers mailing list