[Wrf-users] WRF parallel problem
Ever Daniel Barreto Rojas
everdaniel at gmail.com
Fri Jun 9 16:03:35 MDT 2017
Hi Elliot,
the errors make it sounds like it's a permission error, for example:
> [0] starting wrf task 0 of 28
> [0] error_dup: cannot open rsl.out.nnnn: Permission denied
this means the process can't open the rsl.out.nnnn file because it doesn't have the permissions to do so, further down below, it sounds like the wrf.exe process can't even open the namelist.input file:
> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE: <stdin> LINE: 8432
> [0] ERROR OPENING namelist.output
I would make sure that all model files, both on your master node and slave nodes all belong to a specific user, you can do this with:
sudo chown -R user:group /path/to/model
Or you can just find which files are owned by another user (maybe root) and change the ownership of those files.
---
Ever Daniel Barreto Rojas
everdaniel at gmail dot com
> On Jun 9, 2017, at 2:54 PM, Tardif, Elliot M <elliot.tardif at ncdenr.gov> wrote:
>
> Good afternoon all,
>
> My colleagues and I are attempting to get WRF running on a new high-performance computing cluster that we recently purchased. It is compiled using intel/fortran compilers and DMPAR. When simply typing “./wrf.exe”, it runs without a problem. However, when we try to run it in parallel (“time mpirun -hostfile machines_1 -l -n 28 ./wrf.exe”), we get the following errors. (please see message transcript below). Any proposed help to resolve this problem would be greatly appreciated. Thanks for your help.
>
> -Elliot
>
> [0] starting wrf task 0 of 28
> [0] error_dup: cannot open rsl.out.nnnn: Permission denied
> [0] ...sending output to standard output and continuing.
> [9] starting wrf task 9 of 28
> [9] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [9] ...sending output to standard output and continuing.
> [0] module_io_quilt_old.F 2931 F
> [11] starting wrf task 11 of 28
> [11] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [11] ...sending output to standard output and continuing.
> [10] starting wrf task 10 of 28
> [10] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [10] ...sending output to standard output and continuing.
> [6] starting wrf task 6 of 28
> [6] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [6] ...sending output to standard output and continuing.
> [8] starting wrf task 8 of 28
> [8] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [8] ...sending output to standard output and continuing.
> [8] module_io_quilt_old.F 2931 F
> [9] module_io_quilt_old.F 2931 F
> [10] module_io_quilt_old.F 2931 F
> [11] module_io_quilt_old.F 2931 F
> [23] starting wrf task 23 of 28
> [23] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [23] ...sending output to standard output and continuing.
> [3] starting wrf task 3 of 28
> [3] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [3] ...sending output to standard output and continuing.
> [25] starting wrf task 25 of 28
> [14] starting wrf task 14 of 28
> [14] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [14] ...sending output to standard output and continuing.
> [25] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [25] ...sending output to standard output and continuing.
> [19] starting wrf task 19 of 28
> [19] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [19] ...sending output to standard output and continuing.
> [27] starting wrf task 27 of 28
> [27] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [27] ...sending output to standard output and continuing.
> [4] starting wrf task 4 of 28
> [4] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [4] ...sending output to standard output and continuing.
> [4] module_io_quilt_old.F 2931 F
> [5] starting wrf task 5 of 28
> [5] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [5] ...sending output to standard output and continuing.
> [5] module_io_quilt_old.F 2931 F
> [6] module_io_quilt_old.F 2931 F
> [21] starting wrf task 21 of 28
> [21] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [21] ...sending output to standard output and continuing.
> [18] starting wrf task 18 of 28
> [18] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [18] ...sending output to standard output and continuing.
> [2] starting wrf task 2 of 28
> [2] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [2] ...sending output to standard output and continuing.
> [2] module_io_quilt_old.F 2931 F
> [3] module_io_quilt_old.F 2931 F
> [24] starting wrf task 24 of 28
> [7] starting wrf task 7 of 28
> [24] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [24] ...sending output to standard output and continuing.
> [16] starting wrf task 16 of 28
> [7] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [7] ...sending output to standard output and continuing.
> [7] module_io_quilt_old.F 2931 F
> [16] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [16] ...sending output to standard output and continuing.
> [16] module_io_quilt_old.F 2931 F
> [18] module_io_quilt_old.F 2931 F
> [19] module_io_quilt_old.F 2931 F
> [24] module_io_quilt_old.F 2931 F
> [25] module_io_quilt_old.F 2931 F
> [1] starting wrf task 1 of 28
> [1] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [1] ...sending output to standard output and continuing.
> [1] module_io_quilt_old.F 2931 F
> [15] starting wrf task 15 of 28
> [15] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [15] ...sending output to standard output and continuing.
> [12] starting wrf task 12 of 28
> [26] starting wrf task 26 of 28
> [12] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [12] ...sending output to standard output and continuing.
> [12] module_io_quilt_old.F 2931 F
> [14] module_io_quilt_old.F 2931 F
> [15] module_io_quilt_old.F 2931 F
> [26] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [26] ...sending output to standard output and continuing.
> [26] module_io_quilt_old.F 2931 F
> [27] module_io_quilt_old.F 2931 F
> [13] starting wrf task 13 of 28
> [13] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [13] ...sending output to standard output and continuing.
> [13] module_io_quilt_old.F 2931 F
> [17] starting wrf task 17 of 28
> [17] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [17] ...sending output to standard output and continuing.
> [17] module_io_quilt_old.F 2931 F
> [20] starting wrf task 20 of 28
> [20] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [20] ...sending output to standard output and continuing.
> [20] module_io_quilt_old.F 2931 F
> [21] module_io_quilt_old.F 2931 F
> [22] starting wrf task 22 of 28
> [22] error_dup: cannot open rsl.out.nnnn: Read-only file system
> [22] ...sending output to standard output and continuing.
> [22] module_io_quilt_old.F 2931 F
> [23] module_io_quilt_old.F 2931 F
> [4] Quilting with 1 groups of 0 I/O tasks.
> [5] Quilting with 1 groups of 0 I/O tasks.
> [7] Quilting with 1 groups of 0 I/O tasks.
> [11] Quilting with 1 groups of 0 I/O tasks.
> [1] Quilting with 1 groups of 0 I/O tasks.
> [2] Quilting with 1 groups of 0 I/O tasks.
> [3] Quilting with 1 groups of 0 I/O tasks.
> [6] Quilting with 1 groups of 0 I/O tasks.
> [8] Quilting with 1 groups of 0 I/O tasks.
> [9] Quilting with 1 groups of 0 I/O tasks.
> [10] Quilting with 1 groups of 0 I/O tasks.
> [12] Quilting with 1 groups of 0 I/O tasks.
> [13] Quilting with 1 groups of 0 I/O tasks.
> [14] Quilting with 1 groups of 0 I/O tasks.
> [15] Quilting with 1 groups of 0 I/O tasks.
> [16] Quilting with 1 groups of 0 I/O tasks.
> [17] Quilting with 1 groups of 0 I/O tasks.
> [18] Quilting with 1 groups of 0 I/O tasks.
> [19] Quilting with 1 groups of 0 I/O tasks.
> [20] Quilting with 1 groups of 0 I/O tasks.
> [21] Quilting with 1 groups of 0 I/O tasks.
> [22] Quilting with 1 groups of 0 I/O tasks.
> [23] Quilting with 1 groups of 0 I/O tasks.
> [24] Quilting with 1 groups of 0 I/O tasks.
> [25] Quilting with 1 groups of 0 I/O tasks.
> [26] Quilting with 1 groups of 0 I/O tasks.
> [27] Quilting with 1 groups of 0 I/O tasks.
> [2] Quilting with 1 groups of 0 I/O tasks.
> [4] Quilting with 1 groups of 0 I/O tasks.
> [5] Quilting with 1 groups of 0 I/O tasks.
> [7] Quilting with 1 groups of 0 I/O tasks.
> [10] Quilting with 1 groups of 0 I/O tasks.
> [11] Quilting with 1 groups of 0 I/O tasks.
> [12] Quilting with 1 groups of 0 I/O tasks.
> [13] Quilting with 1 groups of 0 I/O tasks.
> [15] Quilting with 1 groups of 0 I/O tasks.
> [16] Quilting with 1 groups of 0 I/O tasks.
> [18] Quilting with 1 groups of 0 I/O tasks.
> [20] Quilting with 1 groups of 0 I/O tasks.
> [21] Quilting with 1 groups of 0 I/O tasks.
> [23] Quilting with 1 groups of 0 I/O tasks.
> [25] Quilting with 1 groups of 0 I/O tasks.
> [27] Quilting with 1 groups of 0 I/O tasks.
> [1] Quilting with 1 groups of 0 I/O tasks.
> [3] Quilting with 1 groups of 0 I/O tasks.
> [6] Quilting with 1 groups of 0 I/O tasks.
> [8] Quilting with 1 groups of 0 I/O tasks.
> [9] Quilting with 1 groups of 0 I/O tasks.
> [14] Quilting with 1 groups of 0 I/O tasks.
> [17] Quilting with 1 groups of 0 I/O tasks.
> [19] Quilting with 1 groups of 0 I/O tasks.
> [22] Quilting with 1 groups of 0 I/O tasks.
> [24] Quilting with 1 groups of 0 I/O tasks.
> [26] Quilting with 1 groups of 0 I/O tasks.
> [0] Quilting with 1 groups of 0 I/O tasks.
> [0] Quilting with 1 groups of 0 I/O tasks.
> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE: <stdin> LINE: 8432
> [0] ERROR OPENING namelist.output
> [0] -------------------------------------------
> [0] -------------- FATAL CALLED ---------------
> [0] FATAL CALLED FROM FILE: <stdin> LINE: 8432
> [0] ERROR OPENING namelist.output
> [0] -------------------------------------------
> [0] [cli_0]: aborting job:
> [0] application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
>
> ===================================================================================
> = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
> = PID 57376 RUNNING AT compute-01
> = EXIT CODE: 1
> = CLEANING UP REMAINING PROCESSES
> = YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
> ===================================================================================
>
> Elliot Tardif, Meteorologist II
> NC DEQ, Division of Air Quality
> Planning Section, Attainment Planning Branch
> 1641 Mail Service Center
> Raleigh, NC 27699-1641
> Phone/Fax: 919-707-8483
> Email: elliot.tardif at ncdenr.gov
> http://www.ncair.org
> <image001.png>
> Email correspondence to and from this address is subject to the
> North Carolina Public Records Law and may be disclosed to third parties.
>
> _______________________________________________
> Wrf-users mailing list
> Wrf-users at ucar.edu
> http://mailman.ucar.edu/mailman/listinfo/wrf-users
More information about the Wrf-users
mailing list