[Wrf-users] wrfv3 & mpi
Jan Ploski
Jan.Ploski at offis.de
Wed Aug 27 14:29:40 MDT 2008
Rodrigo Delgado Urzúa wrote:
>
> Cesar Hernandez Baño wrote:
>> Hi
>>
>> I'm using wrfv3 , with mpich 1.2.7p1. The problem is when I use more
>> cpus/nodes in the cluster, the more slower runs the test.
>>
>> I have 4 pcs, Pentium 4 , four cpu each server. I run
>> test/em_real/wrf.exe with mpirun
>>
>> Network cards are Gigabit; using 2 processes network traffic flows at
>> 70 MBits aprox. Using 12 processes uses 150 MBit.
>> When I use 2 processes, each CPU uses 80-90% aprox. Using 3 processes,
>> each CPU uses 50-60%.
>>
>> What's the problem?
>>
>> Cheers
>>
>
> Hi Cesar,
>
> I think that's a normal thing. The explanation is simple: more processes
> in different machines implies more communication to perform computing.
> This happens with relatively slow interconnects (as FastE or GigE) as
> with the faster ones (e.g., Infiniband), the difference is the fast
> interconnects have room to allow more traffic and low latencies than the
> slow ones, so you can add CPUs to the simulation without compromise
> performance. But even the fastest interconnects can be completely busy
> when you add several CPUs to the simulation, resulting in lack of
> performance.
Here's one scalability comparison we did - 03 (OpenMP) vs. 05 (MPI over
GigE) vs. 19 (MPI over InfiniBand):
https://srvgrid01.offis.uni-oldenburg.de/wrf-benchmark/case.pl?case=conus12km_2001
Up to 8 processes you should get a good speedup. Have you compared
running multiple MPI processes on a single machine with distributing
them across machines? Have you tried a different MPI implementation
(OpenMPI is very easy to compile and install in my experience).
Regards,
Jan Ploski
More information about the Wrf-users
mailing list