Data Science for MyFamilySearch.org and FamilyTree DNA Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

Slides:



Advertisements
Similar presentations
Data Science for Natural Medicines: Dead Doctors Don't Lie Radio
Advertisements

Semantic Search for NSF Decision Making Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Tackling the Challenges of Big Data
Natural Medicine for Disease and Wellness Meetup Co-founders Lara MacDonald Dr. Brand Niemann Jenny Payne February 11, 2015
Data Science for NSF Polar Cyberinfrastructure & MIT Big Data Course Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
1 Services and Cloud Computing Work Groups: Status Update Brand Niemann US EPA January 8, 2010.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 2 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair,
1 Improved Access to EPA Information: Before and After with Web 2.0 Brand Niemann Senior Enterprise Architect, US EPA, and Co-chair, Federal SOA CoP and.
Build VIVO in the Cloud NIH Workshop on Value Added Services for VIVO Brand Niemann Semantic Community March 25-26,
EarthCube Data Science Publications Dr. Joan Aron Dr. Sophia Liu Dr. Brand Niemann May 29, 2015
Data Science for Big Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Build the Binary Group in the Cloud Brand Niemann Senior Enterprise Architect Binary Group August 5, Updated August 8,
Data Science for MyFamilySearch.org Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community My Personal Family History.
Semantic Interoperability Community of Practice (SICoP) Semantic Web Applications for National Security Conference Hyatt Regency Crystal City, Regency.
My FamilySearch.org Tutorial Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community My Personal Family History Dashboard.
NLM-Semantic Medline Data Science Data Publication Commons Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Big Data and Social Media & Web Analytics Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
NIST Scientific Data for Data Science United Nations Open Data / Open Government Conference, April 26-28, Abu Dhabi
Federal Big Data Working Group Meetup Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Semantic Data Discovery: Proof of Concept for DHS
Cloud: SOA, Semantics, & Data Science Welcome and Overview Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Big Data from Everywhere for Families and Community Service: RootsTech 2015 Developer Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data.
Data Science for USGS Minerals Big Data Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data.
Imagine Everything is Before You: Past, Present, and Future Paper and Demonstration for the 2014 Family History Technology BYU Dr. Brand Niemann.
Information Sharing Begins With Me Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Agency Initiatives 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
Data Science Publication for NSF Polar Cyberinfrastructure Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Using Data Science as Evidence in Public Policy With Big Data and Elections Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist.
Federal Big Data Working Group Meetup: The Yosemite Project: A Roadmap for Healthcare Information Interoperability and The New Book: Building Ontologies.
Farm Data Dashboards: USDA and Microsoft Innovation Challenge Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Data Science for RDA Climate Change Data Challenge and Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Federal Big Data Working Group Meetup Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
RootsTech 2012: My Experiences Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community
Data Science for Big Data Application and Analytics MOOC Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for VIVO Dr. Brand Niemann Director and Senior Data Scientist Semantic Community
Data Science for NOAA Chief Data Officer and Big Data Predictive Analytics Meetup Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist.
Data Science for International Data Week 2016: Concept Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science.
Director and Senior Data Scientist/Data Journalist
Data Science for DataBay DataBay "Reclaim the Bay" Innovation Challenge: August 1-3, 2014, Smithsonian Environmental Research Center, 647 Contees Wharf.
Data Science for EPA & USGS Fracturing & Fracking­­­­­ Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data.
Data Science for USDA Big Data
Data Science for HealthData.gov Developers & Family Caregivers Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Data Science for the National Big Data R and D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
SmartGrid and Spotfire Cloud Computing - Similarities in Innovation Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic.
Health Datapalooza Would Benefit From Real Innovation Investment Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for NSF Data Science Workshop 2015 Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science NSF.
Data Science for the NOAA Chief Data Officer Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Data Science for Semantics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Semantics.
Department of Commerce App Challenge: Big Data Dashboards Dr. Brand Niemann Director and Senior Enterprise Architect – Data Scientist Semantic Community.
Data Science for Joint Doctrine Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Joint.
1 Shift Happens! Briefing for the EPA Enterprise Architecture Team Brand Niemann Senior Enterprise Architect, US EPA, and Federal Web 2.0/3.0 Community.
1 Tutorial for the EAWG: Solution Architecture for 2010 Brand Niemann Senior Enterprise Architect U.S. EPA January 28, 2010.
1 Promoting Careers in Knowledge Management: My Experiences Brand Niemann Library of Congress June 3, 2010.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 7 EPA Jam on Improved Access to Environmental Information, June.
1 Improved Access to EPA and Interagency Information: Before and After with Web 2.0 – Part 4 Interagency and Non-government (in process) Brand Niemann.
Government Technology & Innovation Incubator for Big Data Analytics Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Data Science for NIST Big Data Framework Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
Defense Strategies Institute Professional Educational Forum Harnessing the Power of Big Data for The Intelligence Community November 17-18, 2015 Mary M.
Climate Change & Genomic Data - Data Science Meetup of Meetups Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community.
Data Science for Global Ebola Response Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science.
National Data Science Organizers Lightning Talks From Around the Country Dr. Brand Niemann Founder and Co-Organizer Federal Big Data Working Group Meetup.
Data Science and Semantic Insights for DoD Joint Doctrine Meetup Dr. Brand Niemann Founder and Co-Organizer Federal Big Data Working Group Meetup Director.
Data Science for the National Big Data R&D Initiative Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community
USDA Big Data Science for Precision Farming With FarmLogs
Federal Communities of Practice: IBM Contributions
Data Science for RDA Climate Change Data Challenge and Meetup
First Meetup: Data Science for the Data Act at Treasury
Anjali Yakkundi, Analyst
Welcome to YOUR Community!
Presentation transcript:

Data Science for MyFamilySearch.org and FamilyTree DNA Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community February 16,

Introduction Welcome: – Federal Big Data Working Group Meetup – Virginia Big Data Meetup – Lotico Northern Virginia Semantic Web – Other? Data Science for the National Big Data R and D Initiative, February 2, 2015: – NITRD Big Data Chronology (2012-present) and NITRD-GU Big Data Workshop: Dr. Moore agrees with IBM Watson that human curation is generally under appreciated and is the secret sauce in Big Data successes. – Wendy Wigen’s Slides: Summary of RFIs and Dr. Sudarsan Rachuri, NIST, Smart Manufacturing Systems Design and Analysis. – Calvin Andrus, CIA (Data Science: An Introduction) would "like to see more science in data science.“ – Jim Burke: Conference call-in and online slides were very useful. I appreciate the extra mile efforts, and great, informative conversations. 2

Federal Big Data Working Group Meetup Federal: Supports the Federal Big Data Initiative, but not endorsed by the Federal Government or its Agencies; Big Data: Supports the Federal Digital Government Strategy which is "treating all content as data", so big data = all your content; Working Group: Data Science Teams composed of Federal Government and Non-Federal Government experts producing big data products (see Possible Team Presentations below); and Meetup: The world's largest network of local groups to revitalize local community and help people around the world self-organize like MOOCs (Massive Open On-line Courses) being considered by the White House 3

The Profit and Data Enterprises Marcus Lemonis (born November 16, 1973) is a Lebanese-born American businessman, investor, television personality and philanthropist. He is currently the chairman and CEO of Camping World and Good Sam Enterprises, and the star of The Profit, a CNBC reality show about saving small businesses through People, Process, and Products. – us_Lemonis us_Lemonis The Federal Big Data Working Group Meetup is also about helping government agencies develop: – People – Data Scientists – Process – Data Infrastructure – Products – Data Publications Some examples: – EPA – FDA – NOAA – HHS – Eastern Foundry And provide MOOCs for training and networking. (Massive Open Online Courses) 4

Calendar NITRD FASTER Bigdata at NSF, February 17, 2015: – Dr. McHenry will discuss Brown Dog: A search engine for the other 99 percent (of data). Brown Dog seeks to develop a service that will make un-curated data accessible to scientists. Mission Source Consulting Launch Party, February 28: – Steven M. Hanmer, 12:00 PM to 4:00 PM, Eastern Foundry 2011 Crystal Drive, Suite 400, Data Science for Big Data Application and Analytics MOOC, March 2, th Annual Government Big Data Forum, March 12, 2015 USDA CIO and ACDO on Open Data Plan and Roundtable, March 16, 2015 Government Technology & Innovation Incubator for Big Data Analytics II, TBA. Week of March 23, Need Sponsor Data Science for HealthData.gov Developers & Family Caregivers. April 6, 2015 The Wharton DC Alumni Innovation Summit, April 28-29, 2015 Data Science for Natural Medicines and Epigenetics (in planning), May 4,

Agenda 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Story, Slides for RootsTech 2015 Developer Challenge: Big Data from Everywhere for Families and Community Service, February 12–14, 2015 in Salt Lake City, UtahRootsTech :10 p.m. Brief Member Introductions 7:15 p.m. Data Science for MyFamilySearch.org: Story, Slides, and Tutorial 7:45 p.m.​ National Geographic Genographic Project and Big Data, Syed Ali, Data Scientist, Analytics Led Intelligence Slides. See FamilyTree DNA and National Geography Genographic DNA test for deep ancestrySyed AliSlidesFamilyTree DNANational Geography Genographic DNA test for deep ancestry 8:30 p.m. Open Discussion 8:45 p.m. Networking 9:00 p.m. Depart 6

Overview January 13, 2015: Family Search Launches New App Gallery (more than 50 apps)App Gallery February 12–14, 2015: RootsTech 2015 Developer Challenge in Salt Lake City, Utah My Entry: Big Data from Everywhere for Families and Community Service My Partner Work: Data Science for MyFamilySearch.org Syed Ali’s App: National Geographic Genographic Project and Big Data You could be a partner and develop apps (e.g. A Billion Person Family Tree with MongoDB by Randall Wilson, Family Tree of Data: Provenance and Neo4, etc.) 7

FamilySearch.org “FamilySearch is a great resource, but FamilySearch alone can’t do everything. That is why we work with partners to provide complementary tools and resources and why the FamilySearch App Gallery is so important,” said Dennis Brimhall, FamilySearch CEO. “We’ve had partners for many years, and now we want to make it easier for our patrons to know about them and to find the apps they need.” 8

MyTableBox of MyFamily Tree 9

Person Template for Brand Lee Niemann 10

Mini-Tutorial: Sony Camcorder and Camtasia Video to YouTube Video How is the data collected? – Sony Camcorder and PowerPoint Slides. Where is the data stored? – Hard drive and DVD in MP4 format. What are the results? – MP4 files converted and uploaded to YouTube. Why should we believe the results? – Because I and others have done it successfully many times. 11

Data Science for Natural Medicines 12 YouTube