[mpas-developers] 1/10 degree problems
Mathew Maltrud
maltrud at lanl.gov
Fri Apr 16 14:16:36 MDT 2010
Hi Michael and Todd--
i've been trying to run the 1/10 dipole POP grid in the sw
configuration and am getting something i haven't seen before. all
appears normal--all mpi process are going, etc. the *.err files say
it is looping over timesteps, though clearly nothing is being done
(happening too fast). there's no output.nc file. here are examples
of the log.0000.* files (running on 64 cores):
mm at cy-2.lanl.gov {10}% tail log.0000.err
Doing timestep 11
Doing timestep 12
Doing timestep 13
Doing timestep 14
Doing timestep 15
Doing timestep 16
Doing timestep 17
Doing timestep 18
Doing timestep 19
Doing timestep 20
mm at cy-2.lanl.gov {11}% tail log.0000.out
TIMINGS (process:event,running,cpu,wall,100*(wall/total wall))
0 : total time F 0.00000 196.55210
0 : initialize F 0.00000 67.82460 34.51
0 : time integration F 0.00000 11.05870 5.63
so the 'F' is a clue, but i don't know what it means. note that the
grid.nc file looks ok, and i successfully ran the 4/10 version of this
grid earlier this week.
any hints? maybe not enough memory? there are about 6 million cells...
thanks...
-mat
More information about the mpas-developers
mailing list