Presentation is loading. Please wait.

Presentation is loading. Please wait.

DIS Working Group Report LBA Science Steering Committee Meeting Campinas – SP - Brazil May 19-20, 2007 Luiz Horta (CPTEC/INPE)

Similar presentations


Presentation on theme: "DIS Working Group Report LBA Science Steering Committee Meeting Campinas – SP - Brazil May 19-20, 2007 Luiz Horta (CPTEC/INPE)"— Presentation transcript:

1 DIS Working Group Report LBA Science Steering Committee Meeting Campinas – SP - Brazil May 19-20, 2007 Luiz Horta (CPTEC/INPE)

2 Topics Data Registration / Archive Status Data Registration status for EU and BR teams System Updates Report on 21 th SSC Recommendations Miscellaneous BiblioPac Overview

3 LBA Data Registration Status

4 LBA Overall Status: Metadata Registered in Beija-flor N=801

5 LBA Overall Status: Data Volume Archived at LBA DIS

6 Metadata Registration Status by Component N=48 N=77 N=612 N=64 There are still data sets with unavailable or restricted data and several BR and BR-EU teams have only posters registered -- no data.

7 Data Progress toward Long-Term Archive November 2007 – May 2008 Data Maturity Preliminary Documented Final, QA’d Archive-ready Project Office responsibility Investigator responsibility (including documentation)

8 Data Set Documentation Documentation is part of the LBA metadata file for each data set Investigator uses the LBA Metadata Editor (LME) to add this information to existing metadata Documentation fields: Data Set Overview Data Characteristics Data Application and Derivation Quality Assessment Data Acquisition Materials and Methods References Metadata (w/ documentation) is exported as a Data Set User’s Guide and placed online for download along with data

9 Data Documentation Status - November 2007 - Number of documentation fields completed in metadata

10 Data Documentation Status - May 2008 - Number of documentation fields completed in metadata Should documentation be an LBA priority as well ? LBA-ECO data sets are moving from here (undocumented) …to here, (documented)

11 Why the increase in documented LBA- ECO data sets? Diane: “Don’t make me come up there!” (Sept. ’07) Hired another “data chaser” on Aug. 07 (Megan Mcgroddy) –Now a staff of two Additional data chaser has enabled us to provide even more one-on-one assistance to data providers in archive preparation –Documentation support –Data reformatting / reorganization

12 Data Registration & Archive Ongoing tasks …. Continue the effort to get data into the archive and corresponding metadata registered Many early LBA projects (BR and BR/EU teams) have still not delivered data to LBA DIS. Email announcement was sent out to LBA community requesting each to review their contributions. There may not be any way to identify all data that should be delivered to the archive: LBA-ECO has used publications as the guide and this method has proved somewhat successful. Continue to identify and repair broken data links in the metadata

13 Data Registration & Archive Ongoing tasks …. Continue the effort to prepare data for long-term archive LBA-ECO data sets are becoming final; fully quality- assured data is replacing preliminary data; and the data are being reformatted and documented for archive at the ORNL DAAC. These final data sets & documentation will be provided to LBA DIS as well. This process requires close coordination between LBA and LBA-ECO DIS staff to ensure that the two archives are in sync, and involves many “data housekeeping” chores to ensure that the most recent versions of data are made available. And of course, all of the data are being backed up regularly, per archive protocols.

14 Data Registration & Archive Ongoing tasks, cont. Increase in the number of data sets in the archive process is mainly due to increased investment in data ‘chasing’ and requires a dedicated level of support for this task. Peter’s comment about LBA-ECO involvement in data chasing activities. Peter, please stand up and talk if you have anything to say about this topic.

15 Number of files in LBA DIS Archive, by disk area TOTAL Nov 2005: 349.664 TOTAL May 2006: 361.766 TOTAL Apr 2007: 384.415 TOTAL Nov 2007: 498.283 TOTAL May 2008: 499,356 Updated May 13, 2008 Increase in number of data sets archived -- > Over 1.3 Million Files (Data + System files)

16 System Updates We have been backing up ALL LBA data on a regular basis (using an external disk drive array with 2 Tera bytes in Raid-5). Followed short range solution recommended from the 20 th SSC meeting: –SHORT TERM solution implemented: purchased (2 units) of 2 Terabyte of disk for data backup only. Pursuing THE ideal solution – “DAAC” Brazil: –LONG TERM solution pending: purchase another system to act as a server backup (data files + Beija-flor/LME backups) for INPA and CPTEC.

17 Short Term Solution implemented Nov 2007 at CPTEC (2 Tera bytes in RAID-5)

18 Short Term Solution implemented Nov 2007 at Central Office (2 Tera bytes in RAID-5)

19 System Updates (cont) The LBA server was moved to a dedicated internet link (155 Mbps) from Cachoeira Paulista to Rio de Janeiro. If anything goes wrong, the connection can be redirected to use the standard link (CP to SP). Net gain: more bandwidth (speed) and reliability. We have started creating a development system (using a laptop) with the new version of Beija-flor and Metadata System. Benefits: –Free software: no licenses to pay (ex: Lucene indexer instead of Blueangel  savings of $11K / year) –Platform independent (i.e., Windows, Linux, etc); –Enhanced search capabilities and new options; –Enhanced metadata reports page; –Map oriented to LBA study areas; etc. etc.

20 Miscellaneous (news FYI) APLBA initiated a process to import equipment free of custom taxes using a CNPq legislation related to import of equipment for scientific and technological research (CNPq legislation number 8.010 of March 29,1990)

21 Challenges to maintain the LBA Server up and running … DIS personnel called on Saturday, May 17 to solve problem related to unauthorized access to LBA server. Log from access: 11:48:13 202.181.206.50 - 150.163.158.28 80 POST /_vti_bin/_vti_aut/author.dll - 200 lba.cptec.inpe.br core-project/1.0

22 Hack attempt trace route Hong Kong (Greenpeace Org.) OS Unpatched vulnerability exploit.

23 Message to all LBA participants requesting data.

24 Positive response to message requesting data.

25 Report from 21 st SSC recommendations Recommendation # 3: The SSC recommends to the LBA Central Office the replacement of the computers used as servers by the LBA DIS to ensure data safety and system redundancy. Report from DIS: work in progress, waiting for funds to purchase the equipment.

26 Towards DAAC in Brazil based on Industry strength Data Server * HP Server DL380G5 * 2 Intel QUAD CORE E5335; * 4 GB of Memory PC2-5300; * 2 146GB 10K SAS 2.5 Hot Plug Hard; * 1 Slim 8X/24X DVD-ROM; * Redundant Fan and Power Supply; * Storage MSA20 * StorageWorks MSA20 * 12 HP 750GB 7.2k HP SATA 1 ====  9 TERA BYTES SCALABLE SYSTEM * Smart Array 6404/256 Controller 21th SSC recommentation ! (i.e., server replacement)

27 Bibliopac Overview (Publications Database at INPA)

28 BiblioPac : Publications Database at INPA How to access it: http://mapara1:inpa.gov.br/bibliopac.htm, select option “Todas as bases de dados/All Databases” or select any other option available. In this case, we selected “Teses e dissertações”http://mapara1:inpa.gov.br/bibliopac.htm

29 The system will display a screen with several search options. Type the appropriate field and hit the search button. In this example, we search a specific author. If you don’t know for sure the name of the author, just type part of his/her name.

30 The system allows also to retrieve all LBA publications with data in this database. To get these results, choose in INDICE option ‘setorial’ and use keyword “LBA” and then hit the PESQUISAR button.

31 Now the system will show the result with all bibliographic information. In this example the results of the search were a dissertation which can be downloaded. to start the download process, you just click on “Clique aqui e acesse o texto completo”

32 Publications Database at INPA Data Inventory

33

34

35

36 Description of work done.

37

38 Data available for download.

39 Bibliopac Final Considerations We are in the fourth year of this Project and considering that the difficulties found are quite relevant, the results show that the volume of the bibliographical production of LBA made available is in constant growth. In this phase we are insisting much more so that LBA students explore information made available by this Project. To integrate LBA students in the LBA program is a great challenge but we continue working to achieve this goal.

40 The End !


Download ppt "DIS Working Group Report LBA Science Steering Committee Meeting Campinas – SP - Brazil May 19-20, 2007 Luiz Horta (CPTEC/INPE)"

Similar presentations


Ads by Google