<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

  <head>

    <meta content="text/html; charset=windows-1252"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#ffffff" text="#000000">

    <font face="Times New Roman">Hi all,<br>

      <br>

      I've been reading all the emails on this topic, but I haven't had

      a moment to respond.    I am not at all sure it is necessary for

      the DRS for observations to correspond with that for models; 

      afterall we're trying to describe stuff with different

      "attributes".  I think that ESG should be able to make it easy for

      users to find observations that are useful for CMIP5 without

      having to fit the observations into exactly the same structure. 

      So my general advice is to devise something that makes sense (and

      will not be too limiting in the future), rather than attempting to

      perfectly parallel CMIP5 DRS.<br>

      <br>

      I've also been in contact with the CMIP modeling panel, and the

      opinions received so far is to avoid direct mention of CMIP5

      anywhere in the observation metadata.  CMIP does not want to

      appear to endorse any particular data.   On the other hand, the

      panel has expressed strong support for putting observations in a

      form that makes it easy to compare the models against.  They are

      very enthusiastic about the efforts of this (and any other

      similar) groups.    We plan to include on the CMIP5 website a list

      of observational products that have been prepared using CMOR (or

      with software that produces a similar result), and indicate that

      this makes it easier to use these products.  At the same time we

      will have to make it clear that CMIP is not necessarily endorsing

      these products as superior to alternative products (which might be

      more difficult to use but could conceivably be based on more

      reliable observations).  <br>

      <br>

      As for users searching for data that can be compared to CMIP5, I

      think we'll be o.k. as long as the following are included in the

      DRS (and are consistent with the vocabulary used to describe the

      models):<br>

      <br>

      variable name<br>

      modeling realm<br>

      frequency<br>

      <br>

      Other search categories shouldn't have to be consistent with CMIP5

      and could be omitted if inappropriate.  Similarly others could be

      added that are not used for models, and others could be modified

      somewhat, if necessary.<br>

      <br>

      Hope this makes designing a rational DRS for observational

      products easier.<br>

      <br>

      Best regards,<br>

      Karl<br>

    </font><br>

    On 1/31/11 1:25 PM, Lynnes, Christopher S. (GSFC-6102) wrote:

    <blockquote cite="mid:E17A3CA0-3215-482B-BEE2-E2C0D5E9DF5F@nasa.gov"

      type="cite">

      <pre wrap="">

On Jan 31, 2011, at 1:20 PM, Steve Hankin wrote:

</pre>

      <blockquote type="cite">

        <pre wrap="">Hi Luca,

Just below is a version of your email of this morning which has minor editorial changes (in red) to reflect a broader outlook on the term "observations", and an initial proposal for gridded in situ observations.    I think if we adopt language like this we will be glad of it in the long term:

    - Steve

Open Questions

        • Agree on basic DRS-like structure for observations and meaning of each field

                • Use activity="cmip5" or activity="cmip54obs", "obs4models" ?

                • DRS hierarchy for models (partial): "institute" &gt; "model" &gt; "experiment" &gt; "ensemble"

                • Current proposed hierarchy for gridded remote sensed observations (partial): "institute" (="agency") &gt; "mission" &gt; "instrument" &gt; ("processing level" + "other specifiers")

                • Alternative proposal for gridded remote sensed observations (partial): "institute" (="agency") &gt; "instrument" &gt; "processing level" &gt; ?

</pre>

      </blockquote>

      <pre wrap="">

Gridded remote sensed observations are by definition Level 3 (in NASA terminology at least), so processing level is mostly redundant in this context.

Processing level is also not especially analogous to "ensemble", perhaps making it a bit non-intuitive. (But does that matter to this user community?)  

So, would it be completely out of place to put an identifier for the processing algorithm and version in at this point?  For instance, there are some variables (e.g. Ozone) that are computed from the same instrument but using two different algorithms. And of course, the processing code version (or data collection version) can also have multiple instances for the same variable.

</pre>

      <blockquote type="cite">

        <pre wrap="">                • Need a proposal for gridded in situ observations, say:  "institute" (="agency") &gt; "program" &gt; "resolution"&gt;                 "variant" ?

                        • e.g. NOAA/NCAR / ICOADS / 1degree / equatorial / 

                • Should we even attempt to describe in situ (point and trajectory) observations through DRS?  (can we succeed at this?)

        • Decide on whether to have one single CMOR table for observations (currently "obsSites"), or more than one depending on types of observational data: 

                • remote sensed (grids and swaths)

                • in-situ stations (time series and profiles)

                • trajectory-based observations

                • in-situ gridded products

        • Decide on whether to encode global attributes for data source in netcdf files ("source", "source_datastream", "source_url", "source_reference")

Action Items

        • Populate CV for observations (decide on upper/lower case)

        • Produce some reference datasets

        • Develop snippet of "esg.ini" (ESG publisher) configuration for processing observations

On 1/31/2011 6:58 AM, Cinquini, Luca (3880) wrote:

</pre>

        <blockquote type="cite">

          <pre wrap="">Hi all,

 thanks to everybody for the lively discussion... I just wanted to summarize what I think is the status so far - I posted this also on the wiki. Please keep the discussion going...

thanks, Luca

Open Questions

        • Agree on basic DRS-like structure for observations and meaning of each field

                • Use activity="cmip5" or activity="cmip54obs", "obs4models" ?

                • DRS hierarchy for models (partial): "institute" &gt; "model" &gt; "experiment" &gt; "ensemble"

                • Current proposed hierarchy for observations (partial): "institute" (="agency") &gt; "mission" &gt; "instrument" &gt; ("processing level" + "other specifiers")

                • Alternative proposal for observations (partial): "institute" (="agency") &gt; "instrument" &gt; "processing level" &gt; ?

        • Decide on wether to have one single CMOR table for observations (currently "obsSites"), or more than one depending on types of observational data: 

                • remote sensed (grids and swaths)

                • in-situ stations (time series and profiles)

                • trajectory-based observations

                • in-situ gridded products

        • Decide on wether to encode global attributes for data source in netcdf files ("source", "source_datastream", "source_url", "source_reference")

Action Items

        • Populate CV for observations (decide on upper/lower case)

        • Produce some reference datasets

        • Develop snippet of "esg.ini" (ESG publisher) configuration for processing observations

On Jan 31, 2011, at 2:42 AM, Bryan Lawrence wrote:

</pre>

          <blockquote type="cite">

            <pre wrap="">

Hi Folks

Sorry I've come late to this discussion.

I've got just two simple points to make (for now).

Firstly, I think this activity (organising observational data to support 

CMIP5) is not the same as cmip5 itself.  Sooner or later things break in 

the DRS heirarchy doing this, and so I think it would be helpful to all 

consumers if this was dealt with at the top level of the DRS. 

Additionally, remember obs are timeless, models are not. This data will 

be useful beyond cmip5 (e.g. cordex). So I recommend *not*  shoehorning 

all the obs data under cmip5 in the DRS.

The DRS allows you to define new activiites, and I'd do so, something 

like obs4models or (if you must) cmip5obs ....

Otherwise I'm fine with the approach. 

Secondly, wrt to Steve's list below: there is of course swath data as 

well ... but otherwise I rather agree that 3 and 1 can be handled the 

same.

Cheers

Bryan

</pre>

          </blockquote>

          <pre wrap="">

</pre>

        </blockquote>

      </blockquote>

      <pre wrap="">

--

Dr. Christopher Lynnes     NASA/GSFC, Code 610.2    phone: 301-614-5185

</pre>

    </blockquote>

  </body>

</html>