[Go-essp-tech] CMIP5 data archive size estimate

Karl Taylor taylor13 at llnl.gov
Wed Dec 9 01:05:56 MST 2009


Dear all,

I promised to send these spreadsheets to you today, but I don't have 
time to explain them.  Here are some quick notes:

0.  I've only attached the .xlxs version.  The .xls version is 40 
megabytes, so I can't send it by email.  I'll try to find another way to 
get it to you tomorrow.

1.  Estimates are based on input from modeling groups collected more 
than a year ago.

2.  I think only about 2/3 of the models are included in the estimate.

3.  Estimate is based on assuming that all experiments designated by the 
group as 66% likely to be performed or better will actually be run.  
(This perhaps approximately offsets the fact that not all groups have 
provided input yet.)

4.  You can't rely on a single piece of information in the spread sheet 
(it's all completely unofficial), but the estimate of archive size under 
the stated assumptions is probably correct.

5.  There are no estimates of the number of "atomic datasets" or the 
number of files per atomic dataset.

6.  I think in one place, at least gigabytes should have read bytes, but 
that should be obvious.

7.  There are estimates for size at the end of 2010 and at the end of 
2014, but I didn't ask groups for their timelines, so these estimates 
are identical.

8.  There are estimates for "requested output" volume and "replicated" 
output volume.  

9.  The tables of variables that are referred to in the spreadsheets can 
be found at: 
http://cmip-pcmdi.llnl.gov/cmip5/data_description.html?submenuheader=1

10.  Bottom line:  about 1 petabyte of data will be replicated of the 
almost 2 petabytes requested.

Best regards,
Karl
-------------- next part --------------
A non-text attachment was scrubbed...
Name: CMIP5_archive_size.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 9508469 bytes
Desc: not available
Url : http://mailman.ucar.edu/pipermail/go-essp-tech/attachments/20091209/0588638d/attachment-0001.bin 


More information about the GO-ESSP-TECH mailing list