|
1
|
- Dr. Alexander Sterin,
- RIHMI-WDC, Russia
|
|
2
|
- RIHMI-WDC is located in Obninsk, 100 km from Moscow
- RIHMI-WDC hosts 4 World Data Centers: Meteorology, Oceanography, Rockets
& Satellites, Earth Rotation
- RIHMI-WDC has one of the most wide collections of global meteorological,
aerological, hydrological, oceanographical data in the world
- The total volume of the digitized data in RIHMI-WDC is about 2 TBytes
- The total volume of hardcopy
undigitized data at RIHMI-WDC is questionable
|
|
3
|
- To obtain the estimates of such climatic trends in the global
temperature, one needs to process the total of about 200 GBytes of
primary observational data obtained for land surface, sea surface,
atmosphere
|
|
4
|
- Environmental observational Data collection and transfer (should not be
mentioned here)
- Data archiving and preservation
- Data quality assurance and control
- Methodology of Data processing
- Analysis of Data, getting and presenting new Information on Climate
Changes
|
|
5
|
- The equal absolutely identical copies of data archives must be located
on diverse media in diverse places (even in the diverse buildings)
- Periodical transfer to new media is required
- Old media and old media drives must not be lost
- Currently RIHMI-WDC collection is transferred from obsolete 9-track tape
media to new tape library media (SuperDLT tapes)
- The total amount of 9-track tapes is about 60,000
- Such transfer is not a simple one-moment action
|
|
6
|
- New solutions –technical, scientific, technological, financial,
organizational – are needed
- Special solutions on how to re-arrange Data portions are needed
- Catalogue of Data must be created in parallel with the copying process
- Metadata files must be created
- Data conditioning (integrity check) process must go in parallel
- All conditioning (integrity check) process must be put in protocol.
Protocol will be preserved
- A step back (or few steps back) must be possible at any point of the
process
|
|
7
|
- For QC of meteorological Data,
complex solutions that are based
on physical, meteorological, technological, linguistic considerations
are needed
- QC procedures must not change any values of Data – only QC flags must be
attributed to values, so that step back should be possible at any point
- QC algorithms are not magic – for meteorological data, they often
decline from QC solutions and flagging (time series with gaps, sparse
network of stations, etc.)
|
|
8
|
- Global climate generalizations (separate figures)
- Highly integrated climate data derivatives (miserable volume)
- Climate data derivatives based on inputs of next lower level derivatives
(small volumes)
- Climate data derivatives based on inputs of observational Data (modest
volumes)
- Next level: observational Data
after QC (huge volume)
- Lowest level of climate Data products: raw observational Data (huge
Volume)
|
|
9
|
- CARDS (Comprehensive Aerological Reference Data Set) – a joint project
of NCDC/NOAA, USA, and RIHMI-WDC, Russia
- MONADS –CARDS Derivative – MONthly Aerological Data Set – Monthly
statistics for climate research, calculated from CARDS observational
data
- Further aerological climate products are intended to be calculated from
MONADS, not from CARDS observational Data
|
|
10
|
- COADS – Comprehensive Ocean – Atmosphere Data Set
- COADS produced a lot of climate derivatives based on observational data
- REANALYSIS Projects: the main goal is to produce meteorological fields
for further climate studies and modelling
- REANALYSIS Project of NCAR/NCEP (USA) produced a lot of derivatives.
Multi-level structure of these derivatives is strongly evident
|
|
11
|
- Based on previous experience and on better understanding of climate
processes, several cycles are needed for data archiving and processing
within large climate-related data projects
- Technical progress requires to repeat cycles of climate data processing
& archiving
- All stages of archiving must be available: you need to reutilize data
collections, to update them and to do the recalculations of climate
products
- Normally, these cycles must be repeated once or twice per decade
|
|
12
|
- Lot of analysis methods must be applied to data for climate studies (elementary
statistics, data mining, regression, multidimensional analysis, cluster
analysis, time series analysis, spectral and wavelet analysis, etc.)
- Appropriate presentation instruments (graphs, maps, reports, etc) must
be applied
- Spatial and temporal inhomogeneities are typical for environmental data
- Special analyses enable to detect and to adjust these inhomegeneities
- Are adjusted Data better than unadjusted Data?
|
|
13
|
- The Software must:
- enable access to archived data in
their native formats
- support operations on all levels of the pyramid of climate products
- Have powerful analytical capacity
- Offer numeric graphical capabilities, including mapping
- Special original software ?
- SAS ® ?
|
|
14
|
- Getting new knowledge on climate changes needs processing of huge
volumes of observational Data from various platforms worldwide
- These Data are located in various
places, on various media and in various formats – you need to be ready
to overcome this diversity!
- These Data, both raw and QC’d, need to be archived, and transfer to new
archival media must go on, as soon as new technical and technological
solutions appear
|
|
15
|
- The multi-level structure of climate Data products enables to avoid repeated processing of huge volumes of
observational Data in most situations, but not always
- The multi-cycle processing of observational Data for climate studies is
natural in global climate projects
- To improve our knowledge of climate changes, you need to do numerous
inter-comparisons of Data from various sources
- However, our knowledge of climate changes remains incomplete and
uncertain!
|
|
16
|
|