Presentation is loading. Please wait.

Presentation is loading. Please wait.

Hackathon Challenge: (Semi-) Automating DNA Collection Sara Farmer Noah Hofmann-Smith Jonathan Undy.

Similar presentations


Presentation on theme: "Hackathon Challenge: (Semi-) Automating DNA Collection Sara Farmer Noah Hofmann-Smith Jonathan Undy."— Presentation transcript:

1 Hackathon Challenge: (Semi-) Automating DNA Collection Sara Farmer Noah Hofmann-Smith Jonathan Undy

2 Outline Need to assess country preparedness on onset of disaster QUICKLY. Lots of sources, but is not machine accessible.

3 Motivation Websites: Html, xls, csv, apis etc Template Creator Partially-filled indicators spreadsheet Researchers Completed indicators spreadsheet DNA Analyst

4 Outline 2 Process for automation: Scrape data from webpages Transform scraped data into CSV files Automatically load data from CSV files into standard Excel report Sara and team (partially completed already) Noah and Jonathan

5 Scraping data and CSV files (Sara)

6 Scrapers

7 CSV Data Files

8 Loading from CSV files to Excel (Noah & Jonathan) Challenges: Key indicators referred to differently by different sources Several years’ worth of data Countries not included in all datasets

9 Challenges going forward Improving data quality. (E.g. unpacking compound data items from the same field.) Continue to develop the standard list of indicators. “Close the loop”. Eliminate manual cleaning of the scraped data.


Download ppt "Hackathon Challenge: (Semi-) Automating DNA Collection Sara Farmer Noah Hofmann-Smith Jonathan Undy."

Similar presentations


Ads by Google