[Wrf-users] WRF parallel problem

Ever Daniel Barreto Rojas everdaniel at gmail.com
Fri Jun 9 16:03:35 MDT 2017


Hi Elliot,

the errors make it sounds like it's a permission error, for example:

> [0]  starting wrf task            0  of           28
> [0] error_dup: cannot open rsl.out.nnnn: Permission denied

this means the process can't open the rsl.out.nnnn file because it doesn't have the permissions to do so, further down below, it sounds like the wrf.exe process can't even open the namelist.input file:

> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE:  <stdin>  LINE:    8432
> [0] ERROR OPENING namelist.output

I would make sure that all model files, both on your master node and slave nodes all belong to a specific user, you can do this with:

sudo chown -R user:group /path/to/model

Or you can just find which files are owned by another user (maybe root) and change the ownership of those files.

---
Ever Daniel Barreto Rojas
everdaniel at gmail dot com

> On Jun 9, 2017, at 2:54 PM, Tardif, Elliot M <elliot.tardif at ncdenr.gov> wrote:
> 
> Good afternoon all,
>  
> My colleagues and I are attempting to get WRF running on a new high-performance computing cluster that we recently purchased. It is compiled using intel/fortran compilers and DMPAR. When simply typing “./wrf.exe”, it runs without a problem. However, when we try to run it in parallel (“time mpirun -hostfile machines_1 -l -n 28 ./wrf.exe”), we get the following errors. (please see message transcript below). Any proposed help to resolve this problem would be greatly appreciated. Thanks for your help.
>  
> -Elliot
>  
> [0]  starting wrf task            0  of           28
> [0] error_dup: cannot open rsl.out.nnnn: Permission denied
> [0] ...sending output to standard output and continuing.
> [9]  starting wrf task            9  of           28
> [9] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [9] ...sending output to standard output and continuing.
> [0]  module_io_quilt_old.F        2931 F
> [11]  starting wrf task           11  of           28
> [11] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [11] ...sending output to standard output and continuing.
> [10]  starting wrf task           10  of           28
> [10] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [10] ...sending output to standard output and continuing.
> [6]  starting wrf task            6  of           28
> [6] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [6] ...sending output to standard output and continuing.
> [8]  starting wrf task            8  of           28
> [8] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [8] ...sending output to standard output and continuing.
> [8]  module_io_quilt_old.F        2931 F
> [9]  module_io_quilt_old.F        2931 F
> [10]  module_io_quilt_old.F        2931 F
> [11]  module_io_quilt_old.F        2931 F
> [23]  starting wrf task           23  of           28
> [23] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [23] ...sending output to standard output and continuing.
> [3]  starting wrf task            3  of           28
> [3] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [3] ...sending output to standard output and continuing.
> [25]  starting wrf task           25  of           28
> [14]  starting wrf task           14  of           28
> [14] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [14] ...sending output to standard output and continuing.
> [25] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [25] ...sending output to standard output and continuing.
> [19]  starting wrf task           19  of           28
> [19] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [19] ...sending output to standard output and continuing.
> [27]  starting wrf task           27  of           28
> [27] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [27] ...sending output to standard output and continuing.
> [4]  starting wrf task            4  of           28
> [4] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [4] ...sending output to standard output and continuing.
> [4]  module_io_quilt_old.F        2931 F
> [5]  starting wrf task            5  of           28
> [5] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [5] ...sending output to standard output and continuing.
> [5]  module_io_quilt_old.F        2931 F
> [6]  module_io_quilt_old.F        2931 F
> [21]  starting wrf task           21  of           28
> [21] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [21] ...sending output to standard output and continuing.
> [18]  starting wrf task           18  of           28
> [18] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [18] ...sending output to standard output and continuing.
> [2]  starting wrf task            2  of           28
> [2] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [2] ...sending output to standard output and continuing.
> [2]  module_io_quilt_old.F        2931 F
> [3]  module_io_quilt_old.F        2931 F
> [24]  starting wrf task           24  of           28
> [7]  starting wrf task            7  of           28
> [24] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [24] ...sending output to standard output and continuing.
> [16]  starting wrf task           16  of           28
> [7] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [7] ...sending output to standard output and continuing.
> [7]  module_io_quilt_old.F        2931 F
> [16] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [16] ...sending output to standard output and continuing.
> [16]  module_io_quilt_old.F        2931 F
> [18]  module_io_quilt_old.F        2931 F
> [19]  module_io_quilt_old.F        2931 F
> [24]  module_io_quilt_old.F        2931 F
> [25]  module_io_quilt_old.F        2931 F
> [1]  starting wrf task            1  of           28
> [1] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [1] ...sending output to standard output and continuing.
> [1]  module_io_quilt_old.F        2931 F
> [15]  starting wrf task           15  of           28
> [15] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [15] ...sending output to standard output and continuing.
> [12]  starting wrf task           12  of           28
> [26]  starting wrf task           26  of           28
> [12] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [12] ...sending output to standard output and continuing.
> [12]  module_io_quilt_old.F        2931 F
> [14]  module_io_quilt_old.F        2931 F
> [15]  module_io_quilt_old.F        2931 F
> [26] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [26] ...sending output to standard output and continuing.
> [26]  module_io_quilt_old.F        2931 F
> [27]  module_io_quilt_old.F        2931 F
> [13]  starting wrf task           13  of           28
> [13] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [13] ...sending output to standard output and continuing.
> [13]  module_io_quilt_old.F        2931 F
> [17]  starting wrf task           17  of           28
> [17] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [17] ...sending output to standard output and continuing.
> [17]  module_io_quilt_old.F        2931 F
> [20]  starting wrf task           20  of           28
> [20] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [20] ...sending output to standard output and continuing.
> [20]  module_io_quilt_old.F        2931 F
> [21]  module_io_quilt_old.F        2931 F
> [22]  starting wrf task           22  of           28
> [22] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [22] ...sending output to standard output and continuing.
> [22]  module_io_quilt_old.F        2931 F
> [23]  module_io_quilt_old.F        2931 F
> [4] Quilting with   1 groups of   0 I/O tasks.
> [5] Quilting with   1 groups of   0 I/O tasks.
> [7] Quilting with   1 groups of   0 I/O tasks.
> [11] Quilting with   1 groups of   0 I/O tasks.
> [1] Quilting with   1 groups of   0 I/O tasks.
> [2] Quilting with   1 groups of   0 I/O tasks.
> [3] Quilting with   1 groups of   0 I/O tasks.
> [6] Quilting with   1 groups of   0 I/O tasks.
> [8] Quilting with   1 groups of   0 I/O tasks.
> [9] Quilting with   1 groups of   0 I/O tasks.
> [10] Quilting with   1 groups of   0 I/O tasks.
> [12] Quilting with   1 groups of   0 I/O tasks.
> [13] Quilting with   1 groups of   0 I/O tasks.
> [14] Quilting with   1 groups of   0 I/O tasks.
> [15] Quilting with   1 groups of   0 I/O tasks.
> [16] Quilting with   1 groups of   0 I/O tasks.
> [17] Quilting with   1 groups of   0 I/O tasks.
> [18] Quilting with   1 groups of   0 I/O tasks.
> [19] Quilting with   1 groups of   0 I/O tasks.
> [20] Quilting with   1 groups of   0 I/O tasks.
> [21] Quilting with   1 groups of   0 I/O tasks.
> [22] Quilting with   1 groups of   0 I/O tasks.
> [23] Quilting with   1 groups of   0 I/O tasks.
> [24] Quilting with   1 groups of   0 I/O tasks.
> [25] Quilting with   1 groups of   0 I/O tasks.
> [26] Quilting with   1 groups of   0 I/O tasks.
> [27] Quilting with   1 groups of   0 I/O tasks.
> [2] Quilting with   1 groups of   0 I/O tasks.
> [4] Quilting with   1 groups of   0 I/O tasks.
> [5] Quilting with   1 groups of   0 I/O tasks.
> [7] Quilting with   1 groups of   0 I/O tasks.
> [10] Quilting with   1 groups of   0 I/O tasks.
> [11] Quilting with   1 groups of   0 I/O tasks.
> [12] Quilting with   1 groups of   0 I/O tasks.
> [13] Quilting with   1 groups of   0 I/O tasks.
> [15] Quilting with   1 groups of   0 I/O tasks.
> [16] Quilting with   1 groups of   0 I/O tasks.
> [18] Quilting with   1 groups of   0 I/O tasks.
> [20] Quilting with   1 groups of   0 I/O tasks.
> [21] Quilting with   1 groups of   0 I/O tasks.
> [23] Quilting with   1 groups of   0 I/O tasks.
> [25] Quilting with   1 groups of   0 I/O tasks.
> [27] Quilting with   1 groups of   0 I/O tasks.
> [1] Quilting with   1 groups of   0 I/O tasks.
> [3] Quilting with   1 groups of   0 I/O tasks.
> [6] Quilting with   1 groups of   0 I/O tasks.
> [8] Quilting with   1 groups of   0 I/O tasks.
> [9] Quilting with   1 groups of   0 I/O tasks.
> [14] Quilting with   1 groups of   0 I/O tasks.
> [17] Quilting with   1 groups of   0 I/O tasks.
> [19] Quilting with   1 groups of   0 I/O tasks.
> [22] Quilting with   1 groups of   0 I/O tasks.
> [24] Quilting with   1 groups of   0 I/O tasks.
> [26] Quilting with   1 groups of   0 I/O tasks.
> [0] Quilting with   1 groups of   0 I/O tasks.
> [0] Quilting with   1 groups of   0 I/O tasks.
> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE:  <stdin>  LINE:    8432
> [0] ERROR OPENING namelist.output
> [0] -------------------------------------------
> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE:  <stdin>  LINE:    8432
> [0] ERROR OPENING namelist.output
> [0] -------------------------------------------
> [0] [cli_0]: aborting job:
> [0] application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
>  
> ===================================================================================
> =   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
> =   PID 57376 RUNNING AT compute-01
> =   EXIT CODE: 1
> =   CLEANING UP REMAINING PROCESSES
> =   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
> ===================================================================================
>  
> Elliot Tardif, Meteorologist II
> NC DEQ, Division of Air Quality
> Planning Section, Attainment Planning Branch
> 1641 Mail Service Center
> Raleigh, NC 27699-1641
> Phone/Fax: 919-707-8483
> Email:  elliot.tardif at ncdenr.gov
> http://www.ncair.org
> <image001.png>
> Email correspondence to and from this address is subject to the
> North Carolina Public Records Law and may be disclosed to third parties.
>  
> _______________________________________________
> Wrf-users mailing list
> Wrf-users at ucar.edu
> http://mailman.ucar.edu/mailman/listinfo/wrf-users



More information about the Wrf-users mailing list