Top Banner
Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center for Atmospheric Research Scientific Computing Division
25

Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Dec 25, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Scientific Investigations; Support from Research Data

Archivesfor

Computing in Atmospheric Sciences 2001

29 October, 2001Steven Worley

National Center for Atmospheric ResearchScientific Computing Division

Page 2: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Key Steps of Scientific Investigations

• Formulate the questions and review the state of understanding

• Search and discover data• Access data• Analyzes data• Community sharing and archive • Document new understandings

Page 3: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Search and Discover Data

• How? Web based Information Server• Salient Features

– 2.5K + html pages (metadata)– All datasets are described (500+)– Location of all data files in MSS– Higher level information

• Catalogs• Project specific descriptions

Always current dataset descriptions

Page 4: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Features

• Organization Navigation

• Archive Navigation

• Pull down menus

• Search

• Project Links

Page 5: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Dataset Page

• Title and Brief description

• Systematic Navigation

• Metadata highlights

• Period of Record

• Usage

• Variables

• Related Sites (NOAA)

• Contact Person

• Related Datasets

Page 6: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Brief Archive History and Specifications

• Started in middle 1960’s, (35 years)

• Managed by nine people

• 211K data files

• 17 TB in a MSS

• 530 datasets – all sizes

Page 7: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Global Observations

P.O.R # Yrs Incep.

Date

Comments

Rawinsondes 1946-

on

55 1967 Upper Air

Pibals 1942-

on

59 1973 Upper Air, wind

Aircraft 1947-

on

52 1973 USAF and

Commer.

Sat. cloud wind

drift

1967-

on

34 1973 GOES and GTS

Satellite

Soundings

1969-

92

25 1973 TOVS +

irradiance

Surface Synoptic 1948-

on

53 1975 some much older

Ocean Surface 1794-

on

203 1981 COADS

Usages:

• Input for global atmospheric reanalysis

• Basic long term climate assessment and case studies

Page 8: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Operational and Composite Analyses

U.S. Analyses for the N. H. (Early Operational outputs and composites) P.O.R. Comments

Daily SLP Analysis 1889-on Composite of data sources, 2 x daily later period

Selected Early Analyses 1946,1950 - on 700mb, 500mb, 300mb NMC Oper. Analysis 1962-on Z &T @ 10mb – sfc. (11 lev)

Global Operational Analyses NCEP/NMC 1976-on Many levels and variables ECMWF 1980-on Many levels and variables

Special Analyses Australian 1972-1992 Discontinued FNOC (U.S. Navy) 1973-1993 Discontinued

• Daily SLP is a small but very popular dataset, e.g. NAO evaluations

• Two main operational centers provide the best current analyses

Page 9: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

ECMWF Global Operational Analyses Data Product Period of

Record Temporal Res.

Spatial Res. (dg)

Update Cycle

# Levs.

# Vars.

Major Variables

Upper Air 1985- 06/ 2001

6 hr ~1.125 6 mn 21 8 z,t,wind,rh

Surface 1985- 06/ 2001

6 hr ~1.125 6 mn 1 47 p,t,wind,soil.t, soil.moist.

Supplemental 1985- 06/ 2001

6 hr ~1.125 6 mn 16 rad.,stress,heat.flux, clouds

Extension 1991- 06/ 2001

6 hr ~1.125 6 mn 18 precip,heat.flux

Sf c/ Up.Air Low Resolution

1985- 06/ 2001

12 hr 2.5 1 mn 21+ 14 sf c.t,sf c. p,z,t,wind,rh

Sf c/ Up.Air †

Low Resolution

1985- 06/ 2001

1 mn 2.5 ~1 mn 21+ 14 sf c.t,sf c. p,z,t,wind,rh

† Computed by the SCD/ DSS

Key Aspects• Medium size archive – 170 Gigabytes• multi-(product, temporal res., spatial res.) - complex

Concerns;

• Restricted distribution• U.S. non-profits and UCAR members only• Need online authentication and authorization for easy access

Page 10: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

NCEP Operational Analyses Data Product Period of

Record Temporal Res.

Spatial Res. (dg)

Update Cycle

# Levs.

# Vars.

Major Variables

Final Analysis Global 2.5

1976- 08/ 2001

6 hr 2.5 1 mn 11+ 15 z,t,wind,rh, sf c.t, sf c.p

Final Analysis Global 1.0

09/ 1999 - today

6 hr 1.0 Daily (FTP)

26+ 71 z,t,wind,rh,vorticity sf c.t,sf c.p

ETA-3D N. America

05/ 1995- 07/ 2001

6 hr 40 (km) 1 mn 26+ 5 z,t,wnd,sh, precip(f orecast)

ETA-Surface N. America

05/ 1995- 07/ 2001

6 hr 40 (km) 1 mn 12 wind,sf c.p,sf c.t, soil.t,soil.p

LFM (1971-1995) and NGM (1984-cont), N. America, 190km and 6 hr resolution, are available but ETA is considered a superior replacement.

Highlights

• Frequent updates to FNL, 1º, daily via FTP

• High resolution N. America product, ETA at 40km

• No distribution restrictions or cost

Page 11: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Reanalyses

P.O.R # Yrs Incep. Date

NCEP/NCAR Reanalysis

I

1948-06/2001 53 1994

ECMWF ERA-15 1979-1993 15 1994

NCEP Reanalysis II 1979-06/2001 22 1998

Notes:

• ERA-15 is finished, ERA-40 is running now

• NCEP II, primarily experimental run

Page 12: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

NCEP/NCAR Global Atmospheric Reanalysis Data Product Period of

Record Temporal Res.

Spatial Res. (dg)

Update Cycle

# Levs.

# Vars.

Major Variables

Analysis on Pressure Sf c.

1948- 6/ 2001

6 hr 2.5 1-2 mn 17 7 u,v,z,t,rh

Analysis on Sigma Sf c.

1948- 6/ 2001

6 hr 192x94 Gaussian

1-2 mn 28 6 u,v,t,sph,rel.vort,

Analysis on Theta Sf c.

1948- 6/ 2001

6 hr 2.5 1-2 mn 11 10 N**2, ab.vort,u,v, t,rh,pot.vort

Surf ace Flux Fields

1948- 6/ 2001

6 hr 2.5 1-2 mn 12 Clouds, rad.flx, soil.moist,heat.flx precip

Monthly Mean Anal. P. Sf c.

1948- 2000

1 mn 2.5 1-2 mn 17+ 36 u,v,z,t,rh

CD-ROMS 1953- 1999

12 hr, 1 day, 1mn

2.5 3-6 12 u,v,z,t,rh,heat.flx, rad,flx,precip

model qc’ed observations are returned f orecasts, once every 5 days a f orecast fi elds, 6 hr, available out to 8 days

Outstanding Features• Three different coordinate surfaces• Very long analysis, 2+ Terabytes size• Unrestricted distribution• CD-ROMS are very popular

Page 13: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Countries Receiving Reanalysis CDROMs

Highlights• Over 8900 CDROMs 1997-09/2001

• Recipients; U.S. 46%, Japan 11%, (Canada, UK) 4%, (Germany, India) 3%, (Australia, S.Korea, Spain, Mexico, Norway, Russia, France) 2%

Page 14: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Reanalysis Users for 2001 (4th qtr estimated)

209 From the MSS [157 Jan.-Sep.] 47 On CDROM [35] 48 Custom data orders on FTP or Tape [36] 540 From the online server [406]

844 Total Served

0

50

100

150

200

250

Un

iqu

e U

sers

1995 1996 1997 1998 1999 2000 2001

Years

NCEP/NCAR Renalysis from the MSS

Estimate

Other Users

Univ. Users

NCAR Users

Page 15: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Reanalysis Data Distributed for 2001 (4th qtr estimated)

• 9616 GB from the MSS [7230 GB Jan.-Sep.]

• 808 GB On CD-ROM [935, @650Mb/CDROM]• 1383 GB Custom orders, FTP and tape [1040]• 88 GB From the online server [66 GB]

11895 GB, 11.9 TB Total

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

Dat

a A

mo

un

t (G

B)

1995 1996 1997 1998 1999 2000 2001

Years

NCEP/NCAR Reanalysis from the MSS

Estimate(GB)

Other (GB)

Univ. (GB)

NCAR (GB)

Page 16: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

GCIP Model Data Center Collection

High resolution atmospheric models focused on energy and hydrology cycles.

GCIP: GEWEX Continental-Scale International Project / GEWEX : Global Energy and Water Cycle Exper.

• Critical data for N. American mesoscale studies• Complete archive is about 1 Terabyte

Eta –NCEP 3 hr 40 km25 lvs

5/1995 – 7/2001

MAPS – FSL NOAA

3 hr 40 km5 lvs

8/1996 - 7/2001

GEM – Canadian

6 hr 41 km28 lvs

4/1997 – 6/2001

Page 17: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Ocean Model Data

MICOM; Miami Isopynic Coordinate Ocean Model, 1/12th degree 70N to 28 S, 16-20 layers

COADSClim. Forcing

6 yrs 305 Gigabytes

ECMWFClim. Forcing

2 yrs 164 Gigabytes

ECMWF Daily Forcing

5 yrs 415 Gigabytes(1979-1983)

University of Miami

6-yr Mean T at 5 meters

Page 18: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Dataset Sizes and Scales

• Today – ~ 800 Unique users– ~ 12 Terabytes data transferred– 2 Terabyte dataset size– Example: NCEP/NCAR Reanalysis

• Near Future Excludes TB-PB Level 0 and 1 satellite and the super

scale experimental models– Numbers of Users, ~ same– Data transferred, 5x to 10x more ?– Dataset size, 2-20 TB– Examples:

• Ocean and Atmosphere models • ECMWF Reanalysis (ERA40)

Page 19: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Access to Data

Methods• NCAR computers

– From the local MSS

• Web data server • Custom data packages

– by request (FTP, tape, CDROM)

Users • World class programmer• Research Scientist• Graduate Students• Undergraduate Students

Page 20: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Data Access in the future

• Do we continue doing what we are doing?

“Absolutely”Why? It Works– Over 1000 users annually

• Very diverse skills

– The archive is a heterogeneous collection• Many formats (ASCII, Binary, GrIB, BUFR, netCDF, HDF)• Many sizes (1 MB to 2 TB)

– Capable of serving large and small projects

Maintain a variety of flexible methods

Page 21: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Data Access in the future

• Keys to handling future larger collections– Plan to create useful data products

• Condensed datasets from high resolution output• Group most popular variables products together

– Serve many, e.g. CDROMS and WWW

– Continue to develop emerging online data systems

• User driven subset selection with graphics and data download options

• Server-side elementary analysis– Multi-dataset comparisons– Statistical summaries and basic meteorological calculations

– Our development is the “Community Data Portal”

Page 22: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Data Analysis

• Tools– NCAR Command Language (NCL) software

• Features in brief– I/O for many ‘standard’ data formats– Easy adaptations to read any format– 100’s meteorological functions– “Publication quality” graphics

– The CDP is capable of analysis• NCL is one of several middleware packages

Page 23: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Community Sharing

• Support for the scientist– A place to distribute new data results

• Possibly with authentication and authorization control

• E.g. model outputs

– Spin off benefit• New data resources for the archive• Many users can then use new product

Page 24: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

NCEP Operational Analyses blended with QSCAT Satellite data

Wind Stress Curl, 01/24/2000 1800 UTC

a) NCEP Operational ONLY

b) NCEP + QSCAT swaths

c) OI blend of NCEP + QSCAT

Blending by Colorado Research Associates

We archive all three products.

a b

c

Page 25: Scientific Investigations; Support from Research Data Archives for Computing in Atmospheric Sciences 2001 29 October, 2001 Steven Worley National Center.

Key Steps of Scientific Investigations

• Formulate the questions and review the state of understanding

• Search and discover data• Access data• Analyzes data• Community sharing and archive • Document new understandings