<p><b>weiwang</b> 2008-05-13 11:53:47 -0600 (Tue, 13 May 2008)</p><p>update var from Dale<br>

</p><hr noshade><pre><font color="gray">Modified: trunk/wrf/technote/var.tex

===================================================================

--- trunk/wrf/technote/var.tex        2008-05-13 01:09:49 UTC (rev 67)

+++ trunk/wrf/technote/var.tex        2008-05-13 17:53:47 UTC (rev 68)

@@ -113,6 +113,13 @@

 \section{Improvements to the WRF-Var Algorithm}

 \label{var-upgrade}

+The latest version of WRF-Var (V3.0) contains a number of improvements relative

+to that described in the MM5 3DVAR technical note (\citet{barker03}). These are described below.

+It should also be noted that the public release of WRF-Var V3.0 contains only a subset of

+the capabilities of the full WRF-Var system. In particular, the direct assimilation of radiances, 

+hybrid variational/ensemble data assimilation technique, and 4D-Var will be released once

+funding to support these complex algorithms is available.

+

 \subsection{Improved vertical interpolation}

 The original WRF 3D-Var system described in \citet{barker04} used height 

@@ -142,11 +149,11 @@

 in subsequent outer loops. The assimilation of nearby observations in previous iterations 

 essentially provides a ``buddy check&quot; to the observation in question.

-\subsection{Flexible choice of control variables}

+\subsection{Choice of control variables}

 \label{var-cvs}

-In practical variational data assimilation schemes, the background error covariance 

-matrix ${\bf B}$ is computed not in model space ${\bf x}': u, v, T, q, p_{s}$, but in a 

+A major change that users of previous versions of WRF-Var will notice, is the simplification

+of the background error covariance model used within WRF-Var. As before, the background error covariance matrix ${\bf B}$ is computed not in model space ${\bf x}': u, v, T, q, p_{s}$, but in a 

 control variable space ${\bf v}$ related to model space via the control variable transform ${\rm U}$, 

 i.e.,

@@ -160,129 +167,11 @@

 ${\rm U}_{p}$.

 The components of ${\bf v}$ are chosen so that their error cross-correlations are negligible, 

-thus permitting the matrix ${\bf B}$ to be block-diagonalized. The many varying applications 

-(high/low resolution, polar/tropical, etc.) of WRF-Var require a flexible 

-choice of background error model. This is achieved via a namelist option 

-``cv$\_$options&quot; as defined in Table \ref{var-cvtable}.

+thus permitting the matrix ${\bf B}$ to be block-diagonalized. The major change in WRF-Var V3.0

+is to simplify the control variable transform ${\rm U_p}$ to perform a simple statistical regression as described in subsection 

+(\ref{var-b}) below. Testing in numerous applications has shown 

+a general improvement of forecasts scores using this definition of balance, as compared to the dynamical geostrophic//cyclostrophic balance constraint defined in \citet{barker03}.

-\begin{table}[h]

-\begin{center}

-\begin{tabular}{|l|l|l|l|l|l|}

-\hline

-% Line 1

- { } &amp;

- \ &amp;

- \multicolumn{1}{c|}{2} &amp;

- \multicolumn{1}{c|}{3} &amp; 

- \multicolumn{1}{c|}{4} &amp;

- \multicolumn{1}{c|}{5} \\

- \raisebox{1.5ex}[0cm] {cv$\_$options} &amp; 

- \ &amp;

- \multicolumn{1}{c|}{(original MM5)} &amp;

- \multicolumn{1}{c|}{(NCEP)} &amp;

- \multicolumn{1}{c|}{(Global)} &amp;

- \multicolumn{1}{c|}{(Regional)}\\

- \hline

-% Line 2

- { } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{4}{c|}{ }\\

- \raisebox{1.5ex}[0cm] {Analysis} &amp; 

- \multicolumn{1}{c|}{${\bf x}'$} &amp;

- \multicolumn{4}{c|}{$u'$,$v'$,$T'$,$q'$,${p_s}'(i,j,k)$}\\

- \raisebox{1.5ex}[0cm] {Increment} &amp; 

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{4}{c|}{ }\\

- \hline

-% Line 3

- { } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{3}{c|}{ }\\

- \raisebox{1.5ex}[0cm] {Change of} &amp; 

- \multicolumn{1}{c|}{${\rm U}_p$} &amp;

- \multicolumn{1}{c|}{$\psi'$,$\chi'$,$p_u'$,$q'$} &amp;

- \multicolumn{3}{c|}{$\psi'$,$\chi_u'$,$T_u'$,$r'$,$p_{su}'$}\\

- \raisebox{1.5ex}[0cm] {Variable} &amp; 

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{3}{c|}{ }\\

- \hline

-% Line 4

- { } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ }\\

- \raisebox{1.5ex}[0cm] {Vertical} &amp;

- \multicolumn{1}{c|}{${\rm U}_v$} &amp;

- \multicolumn{1}{c|}{${\bf B}={\bf E}{\Lambda}{\bf E}^{T}$} &amp;

- \multicolumn{1}{c|}{RF} &amp;

- \multicolumn{2}{c|}{${\bf B}={\bf E}{\Lambda}{\bf E}^{T}$} \\

- \raisebox{1.5ex}[0cm] {Covariances} &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ }\\

- \hline

-% Line 5

- { } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ }\\

- \raisebox{1.5ex}[0cm] {Horizontal} &amp;

- \multicolumn{1}{c|}{${\rm U}_h$} &amp;

- \multicolumn{2}{c|}{RF} &amp;

- \multicolumn{1}{c|}{Spectral} &amp;

- \multicolumn{1}{c|}{RF}\\

- \raisebox{1.5ex}[0cm] {Correlations} &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ }\\

- \hline

-% Line 6

- { } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ }\\

- \raisebox{1.5ex}[0cm] {Control} &amp;

- \multicolumn{1}{c|}{${\bf v}$ } &amp;

- \multicolumn{1}{c|}{${\bf v}(i,j,m)$} &amp;

- \multicolumn{1}{c|}{${\bf v}(i,j,k)$} &amp;

- \multicolumn{1}{c|}{${\bf v}(l,n,m)$} &amp;

- \multicolumn{1}{c|}{${\bf v}(i,j,m)$}\\

- \raisebox{1.5ex}[0cm] {Variables} &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{2}{c|}{ } &amp;

- \multicolumn{1}{c|}{ } &amp;

- \multicolumn{1}{c|}{ }\\

- \hline

-\end{tabular}

-\end{center}

-\caption{The definitions of the various stages of the control

-     variable transform given by (\ref{var-cv}) for the unified global/regional

-     WRF-Var system. Indices $(i,j,k)$ refer to grid-point

-     space, index $m$ to vertical mode, and $l$, $n$ to global spectral mode.

-     The variables are: $u, v$: velocity components; $T$: temperature; $q$: mixing ratio;

-     $p_s$: surface pressure; $\psi$: streamfunction; $\chi$: velocity potential;

-     $r$: relative humidity. The subscript $u$ indicates an unbalanced field. The acronym RF stands for recursive filter.}

-\label{var-cvtable}

-\end{table}

-

-Table \ref{var-cvtable} indicates that the only difference between global (cv$\_$options=4) and WRF 

-regional (cv$\_$options=5) versions of the WRF-Var control variable 

-transform is in the horizontal error correlations ${\rm U}_{h}$. 

-Note also, the only difference between the old MM5 

-background error model (cv$\_$options=2) and WRF regional (cv$\_$options=5) is in the 

-${\rm U}_{p}$ transform. The former imposes a dynamical balance constraint via an 

-unbalanced pressure control variable \citep{barker04}, whereas in the new regional 

-covariance model, balance is imposed via statistical regression (see Section \ref{var-be} for 

-details). This choice of control variables is considered more appropriate for the 

-mass-based ARW solver.

-

 \subsection{First Guess at Appropriate Time (FGAT)}

 A First Guess at Appropriate Time (FGAT) procedure has been implemented in 

@@ -405,7 +294,7 @@

 required to specify and implement flow-dependent error covariances in 3/4D-Var is 

 significant.

-The NMC-method code developed for MM5 3D-Var \citep{barker04} is nearing the 

+The NMC-method code developed for MM5 3D-Var \citet{barker04} is nearing the 

 end of its useful life. The development of a unified global/regional WRF-Var system, and 

 its application to a variety of models (e.g., ARW, MM5, KMA global model, 

 Taiwan's Nonhydrostatic Forecast System [NFS]) has 

@@ -552,7 +441,7 @@

 control variable transform. This calculation involves the projection

 of 3D fields on model-levels onto empirical orthogonal functions

 (EOFs) of the vertical component of background error covariances

-\citep{barker04}. For each 3D control variable ($\psi$, $\chi_u$,

+\citet{barker04}. For each 3D control variable ($\psi$, $\chi_u$,

 $T_u$, and $r$), the vertical component of ${\bf B}$, is calculated

 and an eigenvector decomposition performed. The resulting eigenvectors

 ${\bf E}$ and eigenvalues $\Lambda$ are saved for use in WRF-Var.

@@ -570,7 +459,7 @@

 statistics are included in the dataset supplied to WRF-Var, allowing

 the choice between homogeneous (domain-averaged) or local

 (inhomogeneous) background error variances and vertical correlations

-to be chosen at run time \citep{barker04}.

+to be chosen at run time \citet{barker04}.

 Having calculated and stored eigenvectors and eigenvalues, the final

 part of {\it gen$\_$be$\_$stage3} is to project the entire sequence of

 3D control variable fields into EOF space ${\bf v_v}=U_{v}^{-1}{\bf

@@ -594,3 +483,45 @@

 as described in \citet{barker04} to provide correlation lengthscales

 for use in the recursive filter algorithm.

+\section{WRF-Var V3.0 Software Engineering Improvements}

+\label{se}

+

+A major overhall of the WRF-Var software has been performed for V3.0. The following

+is a summary:

+

+\subsection{Memory improvements}

+

+The WRF-Var registry had become bloated with WRF and U4D-Var 2d and 3d state variables that were unused in 

+3D-Var applications. These variables were allocated but uninitialised, and written to the analysis files. The removal 

+of these dummy variable has resulted in a significant (10-50\% depending on application) reduction in the memory 

+requirements of WRF-Var.

+

+\subsection{Four-Byte I/O}

+

+The WRF-Var algorithm requires eight-byte precision internally. However, it only needs to read and write 4-byte files. 

+Switching from 8-byte to 4-byte output in V3.0 has improved I/O performance and halves file sizes.

+

+\subsection{Switch from RSL to RSL\_LITE}

+

+The switch from RSL to RSL\_LITE has been made in V3.0 as the latter possesses 

+a simpler &quot;lighter&quot; communications layer, and has been shown to be scalable 

+to arbitrary domain sizes (largest to date: 4500x4500) and numbers of processors 

+(largest to date: 64K processors on Blue Gene). RSL\_LITE supports all capabilities of WRF, 

+including Halo and periodic boundary exchanges, Distributed I/O, Nesting and moving nests, 

+and parallel transposes. RSL\_LITE has also been simplified by dropping irregular decomposition, 

+load balancing, and ragged edge nesting, and the initialisation techniques improved 

+to avoid recalculation and give better scaling at higher processor counts.

+

+\subsection{Reorganisation of observation structures}

+

+The F90 derived data types used for observations have been rewritten to permit batches 

+of observations to be passed to subroutine calls, especially interpolation ones and 

+subsequently makes better use of cache memory.

+

+\subsection{Radar reflectivity operators redesigned}

+

+The efficiency of the coding of the radar observation operators has been improved. 

+Previously, routines were called once per observation, which would then recalculate common 

+factors before performing what was often just a one-line calculation. Re-writing the code 

+in V3.0 to move the calculations inside loops in the calling routine allows them to work 

+on batches of observations at a time, vastly improving cache hit rates and eliminating recalculation.

</font>

</pre>