Presentation is loading. Please wait.

Presentation is loading. Please wait.

Datawarehouse Workflow: ETLP Extract Transform LoadProvide Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats.

Similar presentations


Presentation on theme: "Datawarehouse Workflow: ETLP Extract Transform LoadProvide Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats."— Presentation transcript:

1 Datawarehouse Workflow: ETLP Extract Transform LoadProvide Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats

2 Datawarehouse Workflow: ETLP Extract Transform LoadProvide tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data providers = data users data Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats

3 DATA = RAW DATA + PROCESSING DATA = RAW DATA (volts) History will never change! one parameter at one place at one time PROCESSING Interpretation does change! e.g. instrument deterioriation, recalibration + NASA satellite data with open source SeaDas processing toollkit (in IDL) L0: dump of recorded voltages, only averaged over 16 pixels LAC: MLAC GAC L1: voltages + satellite track L2 ~ physical quantities L3 ~ binned in space (1 grid instead of zillions of warped photos) L4 ~ binned in time (climatology) Deltares Aukepc for flumes Stored raw data With calibration coefficients Allows for recalibration

4 Datawarehouse Workflow Extract Transform LoadProvide Subversion repository tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data data providers = data users Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats

5 Programme today, and current session 1 3 D:\ tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data data providers = data users 2 Extract Transform LoadProvide Subversion repository

6 Repository username Get username and password. Why, OpenEarth is open, right? Yes, but closed community For best quality all actions are logged: Nothing can be lost, only temporarily disabled So anyone can be allowed to join Every file is logged … … and every line in every file is logged.

7 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY basics delete add copy update browse checkout

8 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY browse delete add copy update browse checkout

9 REPOSITORY browse

10 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY checkout delete add copy update browse checkout Not handy to get files one by one with browser Get them all at once with free program

11 REPOSITORY checkout Download and install Tortoise (http://tortoisesvn.net/) Make a checkout in e.g. F:\checkouts\ No need to back this up, it’s only a copy...

12 REPOSITORY checkout Copy url from browser (case sensitive!) Make sure that tree of local copy resembles server

13 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY commit delete add copy update browse checkout

14 REPOSITORY commit up to date modified

15 REPOSITORY commit

16 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY update delete add copy update browse checkout

17 REPOSITORY update

18

19 REPOSITORY statistics

20 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY add delete add copy update browse checkout

21 REPOSITORY add a raw dataset OpenEarthRawData is very big: don’t make a full checkout To add a thing, first make an empty checkout of the destination.

22 REPOSITORY add a raw dataset There are 2 copies of 1 file on your PC: Visible working copy, for editing Hidden shadow copy, to detect changes Before adding a file to the server, a shadow copy must be created. Allows for offline working

23 REPOSITORY add a raw dataset Now the addition must be simply be committed as any change

24 REPOSITORY add The repository is supposed to be working anytime Do not play with the actual repository All advanced users will by annoyed by this But then, how I can I learn how to work with it? Solution: use the sandbox Play around at the highest level as much as you like And clean up afterwards (delete) With your browser:

25 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY delete delete add copy update browse checkout

26 REPOSITORY add a raw dataset There are 2 copies of 1 file on your PC: Visible working copy, for editing Hidden shadow copy, to detect changes When deleting a file on the server, your shadow copy be informed Allows for working offline

27 REPOSITORY add a raw dataset Now the deletion must be simply be committed as any change

28 REPOSITORY delete Now delete the addition you made in And check the log file, to see what colleagues did.

29 commit central database: repos.deltares.nl local copy D:\ E:\ F:\ REPOSITORY copy delete add copy update browse checkout

30 REPOSITORY copy Again: first inform shadow copy locally, then commit to server … Drag with right-mouse button

31 OpenEarthRawData Raw data are stored under https://repos.deltares.nl/repos/OpenEarthRawData/trunk/ Data are stored with copyright holder as main directory. This allows copyright holders to maintain their own data copyright holders to shift easily from private to open source users to identify whom to acknowlegde Data should also contain dedicated processing scripts (if not in OpenEarthTools) url file to web source INSPIRE XML meta-data file

32 Summary: current session 1 3 D:\ tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data data providers = data users 2 Extract Transform LoadProvide Subversion repository

33 Next: use OpenEarthTools to make netCDF 1 3 D:\ tools models add meta information netCDF on web server transform to netCDF netCDF on OPeNDAP server data data providers = data users 2 Extract Transform LoadProvide Subversion repository


Download ppt "Datawarehouse Workflow: ETLP Extract Transform LoadProvide Make user- friendly formats Dynamic database Charts & Maps Tools & websites Archive native formats."

Similar presentations


Ads by Google