Climate Data Formats Deniz Bozkurt

Slides:



Advertisements
Similar presentations
Data Formats: Using self-describing data formats Curt Tilmes NASA Version 1.0 Review Date.
Advertisements

The NCAR Command Language (NCL) and the NetCDF Data Format Research Tools Presentation Matthew Janiga 10/30/2012.
Reading NetCDF Files in Matlab and analyzing the data.
The Future of NetCDF Russ Rew UCAR Unidata Program Center Acknowledgments: John Caron, Ed Hartnett, NASA’s Earth Science Technology Office, National Science.
Introduction to NetCDF Ernesto Munoz. Outline Overview of NetCDF Overview of NetCDF NetCDF file information NetCDF file information CDL utilities: ncdump,
Fall 2006AE6382 Design Computing1 Matlab File & Directory Management Learning Objectives Define file input and output terminology Compare high and low.
CLIMATE SCIENTISTS’ BIG CHALLENGE: REPRODUCIBILITY USING BIG DATA Kyo Lee, Chris Mattmann, and RCMES team Jet Propulsion Laboratory (JPL), Caltech.
Активное распределенное хранилище для многомерных массивов Дмитрий Медведев ИКИ РАН.
Marine GIS Applications using ArcGIS Global Classroom training course Marine GIS Applications using ArcGIS Global Classroom training course By T.Hemasundar.
CODING SYSTEMS CODING SYSTEMS CODING SYSTEMS. CHARACTERS CHARACTERS digits: 0 – 9 (numeric characters) letters: alphabetic characters punctuation marks:
Status of netCDF-3, netCDF-4, and CF Conventions Russ Rew Community Standards for Unstructured Grids Workshop, Boulder
Chapter 2: C Fundamentals Dr. Ameer Ali. Overview C Character set Identifiers and Keywords Data Types Constants Variables and Arrays Declarations Expressions.
Data Formats: Using Self-describing Data Formats Curt Tilmes NASA Version 1.0 February 2013 Section: Local Data Management Copyright 2013 Curt Tilmes.
Bits & Bytes: How Computers Represent Data
Aloha Aloha What you see: What the computer sees: binary number columns binary number columns
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
Netcdf course Intro CF convention Netcdf excercises.
General Computer Science for Engineers CISC 106 Lecture 02 Dr. John Cavazos Computer and Information Sciences 09/03/2010.
Multidimensional Data and GIS Steve Kopp Nawajish Noman ESRI.
Unidata’s TDS Workshop TDS Overview – Part II October 2012.
Training on Meteorological Telecommunications Alanya, Turkey, September 2010 General Philosophy of Table Driven Code Forms Simon Elliott, EUMETSAT.
Scientific Computing Division A tutorial Introduction to Fortran Siddhartha Ghosh Consulting Services Group.
EARTH SCIENCE MARKUP LANGUAGE Why do you need it? How can it help you? INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
The netCDF-4 data model and format Russ Rew, UCAR Unidata NetCDF Workshop 25 October 2012.
Matlab Training Session 10: Loading Binary Data Course Website: Training Sessions.htm.
Integrated Grid workflow for mesoscale weather modeling and visualization Zhizhin, M., A. Polyakov, D. Medvedev, A. Poyda, S. Berezin Space Research Institute.
Deutscher Wetterdienst
Section 3.1: Number Representation Practice HW (not to hand in) From Barr Text p. 185 # 1-5.
Fundamentals of C and C++ Programming. EEL 3801 – Lotzi Bölöni Sub-Topics  Basic Program Structure  Variables - Types and Declarations  Basic Program.
Brief Overview: GRIB, HDF, WRF.nc
Marr CollegeHigher ComputingSlide 1 Higher Computing: COMPUTER SYSTEMS Part 1: Data Representation – 6 hours.
THREDDS Catalogs Ethan Davis UCAR/Unidata NASA ESDSWG Standards Process Group meeting, 17 July 2007.
GCSE ICT Storing data - Internal memory, backing storage, and measuring memory.
Eurasia Institute of Earth Sciences Istanbul Technical University NCL Introduction Deniz Bozkurt web.itu.edu.tr/bozkurtd ITU Eurasia Institute of Earth.
Ch Chapter 4 Basic Data Types and Variables 4.1 Basic Data Types In C TABLE 4.1 Introduction to Basic Data Types in C Type SizeDescription char 1.
The HDF Group Introduction to netCDF-4 Elena Pourmal The HDF Group 110/17/2015.
Introduction to Information and Computer Science
Data Storage System, Model Output and Analysis Tools PRECIS Workshop Tanzania Meteorological Agency 29 th June – 3 rd July 2015.
NetCDF-4: Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder
Table formats ArcGIS reads table in many formats: dBase format (shapefile) INFO format (workstation ArcInfo) Geodatabase table (MS Access in case of personal.
CMPS 1371 Introduction to Computing for Engineers CHARACTER STRINGS.
ASCII AND EBCDIC CODES By : madam aisha.
Information Coding Schemes Group Member : Yvonne Tiffany Jurifah bt Junaidi Clara Jane George.
WORLD METEOROLOGICAL ORGANIZATION RA-VI Regional Training on BUFR and Migration to Table Driven Code Forms Langen, Germany, April, 2007 General.
Reflections on the theme of classifying, documenting and exchanging meteorological data, and some additional comments on agro meteorological and biological.
Update on Unidata Technologies for Data Access Russ Rew
WMO GRIB Edition 3 Enrico Fucile Inter-Program Expert Team on Data Representation Maintenance and Monitoring IPET-DRMM Geneva, 30 May – 3 June 2016.
Unidata Infrastructure for Data Services Russ Rew GO-ESSP Workshop, LLNL
Chapter 3: Mastering Editors Chapter 3 Mastering Editors (Emacs)
1.4 Representation of data in computer systems Character.
Nat 4/5 Computing Science Data Representation Lesson 3: Storing Text
Nat 4/5 Computing Science Data Representation Lesson 3: Storing Text
Binary Representation in Text
Binary Representation in Text
(Network Common Data Form)
SRNWP Interoperability Workshop
Binary Universal Form Representation (BUFR) Paul Hamer November, 2009
Data Transfer ASCII FILES.
What is Binary? Binary is a two-digit (Base-2) numerical system, which computers use to process and store data. The reason computers use the binary system.
Other Kinds of Arrays Chapter 11
Data Encoding Characters.
Multidimensional Data and GIS
CEE 6440 GIS in Water Resources Fall 2004 Term Paper Presentation
NetCDF and Scientific Data Standard
Presenting information as bit patterns
Storing Information Each memory cell stores a set number of bits (usually 8 bits, or one byte) (byte addressable)
Learning Intention I will learn how computers store text.
Practical work on NetCDF - CFPOINT
NCL variable based on a netCDF variable model
C Language B. DHIVYA 17PCA140 II MCA.
Presentation transcript:

Climate Data Formats Deniz Bozkurt web.itu.edu.tr/bozkurtd ITU Eurasia Institute of Earth Sciences bozkurtd@itu.edu.tr The International Training Course on “Climate Analysis and Applications” Alanya 10-19 October 2011

Climate Datasets Datasets

Digital representation “11010001” Climate Datasets Raw data “abcd12345” Input system Digital representation “11010001”

Climate Datasets Modeled data: gridded multi-dimendisonal data Consisting of variables, dimensions and attributes Observations (station, reanalysis)

Climate Datasets Climate data can be stored as different formats in different platforms: Character format Packed binary format

Climate Data Formats - Character Format ASCII (American Standard Code for Information Interchange) It refers to a “text” file that is readable by the naked eye (it only contains the letters a-z, numbers, carriage returns, and punctuation marks. Each character is stored as digital = Binary 1001000 1100101 1101100 1101111 0101100 0100000 1110111 1100111 1110010 1100100 H e l o , w r d

Climate data formats – ASCII

Climate data formats – ASCII Factors restrict the use of ASCII: Weak data storage and huge data size: “9” 1 byte, “679.43” 6 byte, -0.123456E+05 13 byte No or not practical metadata

Climate data formats – Packed Binary A common data format for saving and storing of large size of the data “3.1” 24 bits (ASCII) can be packed as 5 bit Can be processed very quickly and effectively in data analysis

Climate data formats – Packed Binary

Some standard data formats NetCDF (network Common Data Form) Format Self-describing data format All the information about data is stored inside of the data No need for an additional information for the identification of the contents of the file Machine independent Can be readable by various applications NCL, IDL, GrADS, Matlab, Ferret, C, C++, Java, Fortran Komut satırı operatörler: NCO, CDO Most commonly used format in the oceanographic and atmospheric science for observational data and numerical modeling (IPCC datasets, NASA, NOAA, NCAR, UCAR…)

Climate data formats - NetCDF metadata dimensions variables Variable properties attributes

Climate data formats - NetCDF Data storage in NetCDF format 3D (x,y,t) or 4D (x,y,z,t)

Climate data formats - NetCDF Data storage in NetCDF format netcdf mynetcdf { dimesions: x=4; y=4; time=UNLIMITED; variables: float x(x); float y(y); int time(time); float temperature(time,x,y); data: x = 10, 20, 30, 40; y = 110, 120, 130, 140; time = 31, 59, 90; }

Climate data formats - NetCDF Data storage in NetCDF format netcdf mynetcdf { dimesions: x=4; y=4; time=UNLIMITED; variables: float x(x); float y(y); int time(time); float temperature(time,x,y); data: x = 10, 20, 30, 40; y = 110, 120, 130, 140; time = 31, 59, 90; Temperature = 111, 211, 311, 411; } time = 1 x = 1 to 4 y = 1

Climate data formats - NetCDF Data storage in NetCDF format netcdf mynetcdf { dimesions: x=4; y=4; time=UNLIMITED; variables: float x(x); float y(y); int time(time); float temperature(time,x,y); data: x = 10, 20, 30, 40; y = 110, 120, 130, 140; time = 31, 59, 90; Temperature = 111, 211, 311, 411, 121, 221, 321, 421, 131, 231, 331, 431, 141, 241, 341, 441; } time = 1 x = 1 to 4 y = 1 to 4

Climate data formats - NetCDF Data storage in NetCDF format netcdf mynetcdf { dimesions: x=4; y=4; time=UNLIMITED; variables: float x(x); float y(y); int time(time); float temperature(time,x,y); data: x = 10, 20, 30, 40; y = 110, 120, 130, 140; time = 31, 59, 90; Temperature = 111, 211, 311, 411, 121, 221, 321, 421, 131, 231, 331, 431, 141, 241, 341, 441, 112, 212, 312, 412, 122, 222, 322, 422, 132, 232, 332, 432, 142, 242, 342, 442, 113, 213, 313, 413, 123, 223, 323, 423, 133, 233, 433, 143, 243, 343, 443; } time = 1 to 3 x = 1 to 4 y = 1 to 4

Climate data formats - NetCDF 4D (x,y,z,t)

Climate data formats - NetCDF some NetCDF commands ncdump file_name | less, to see the entire contents of the file ncdump –h file_name | less, to see the front portion of the data ncdump –v var_name file_name | less, to see the specific data within the file ncl_filedump to see the contents of different data formats

Climate data formats - GRIB GRIB (GRIdded Binary) Format World Meteorological (WMO) standard, commonly used in meteorology Bit-oriented data format: very efficient for transmission/archival - netCDF (float) 2-4 times larger than corresponding GRIB two dimensional horizontal grids only - each grid has scale/offset for optimal precision requires external parameter table (s)

Climate data formats - GRIB Basically consists of: Product definition section(PDS) Creater of the data, unit of the data, time stamp, vertical system of the data Grid description section (GDS), optional Grid description, map projections Bit map section (BMS), optional Defines which grid points will be presented Binary data section (BDS) Contains the packed binary data End section

Climate data formats - GRIB Files are typically 1/2 to 1/3 of the size of normal binary files: Y: gridded data D: scale factor R: reference value E: binary scale factor

Climate data formats – ArcGIS Shapefile NetCDF data: Raster Table Feature Directly readable

Climate data formats – ArcGIS Shapefile

Climate data formats – ArcGIS Shapefile Raster Feature Table

Climate data formats – ArcGIS Shapefile

Climate data formats – ArcGIS Shapefile http://www.gisclimatechange.org/