Author(s): Justin Joque License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike.

Slides:



Advertisements
Similar presentations
Author(s): Don M. Blumenthal, 2010 License: Unless otherwise noted, this material is made available under the terms of the Attribution – Non-commercial.
Advertisements

Author(s): Paul Conway, License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Michael Hortsch, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): John Doe, MD; Jane Doe, PhD, 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): John Doe, MD; Jane Doe, PhD, 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Templates for editing U-M OER Materials
Author(s): Paul Conway, License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Module: Leadership Training Workshop for Health Professionals Organization: East Africa HEALTH Alliance Author(s): Dr. Roy William Mayega, Resource.
Module: Leadership Training Workshop for Health Professionals Organization: East Africa HEALTH Alliance Author(s): Prof. John T. Kakitahi, Resource.
Project: Ghana Emergency Medicine Collaborative Document Title: Open Educational Resources Author(s): University of Michigan Department of Emergency Medicine.
Author(s): Bob Riddle, Kathleen Ludewig Omollo License: Unless otherwise noted, this material is made available under the terms of the Creative Commons.
Module: Public Health Disaster Planning for Districts Organization: East Africa HEALTH Alliance, Author(s): Dr. Roy William Mayega (Makerere.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author: Michael Jibson, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Share.
We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation.
Author(s): MELO 3D Project Team, 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Steve Jackson, 2009 License: Unless otherwise noted, this material is made available under the terms of the Attribution - Noncommercial - Share.
Author(s): MELO 3D Project Team, 2011 License: This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a.
Author(s): Justin Joque License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Joan Durrance, 2009 License: Unless otherwise noted, this material is made available under the terms of the Attribution - Non-commercial 3.0.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author: Michael Jibson, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Share.
Author(s): Kate Saylor, 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author: John Williams, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Project: Ghana Emergency Medicine Collaborative Document Title: My Bougie and Me Author(s): Vijay Kairam (University of Utah), MD 2012 License: Unless.
Author(s): Gerald Abrams, M.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author: Michael Jibson, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Share.
Author(s): Michael Hortsch, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Michael Hortsch, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Brenda Gunderson, Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author: Michael Jibson, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Share.
Author: Michael Jibson, M.D., Ph.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Share.
Author(s): Michael Hortsch, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Don M. Blumenthal, 2010 License: Unless otherwise noted, this material is made available under the terms of the Attribution – Non-commercial.
Author(s): MELO 3D Project Team, 2011 License: This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a.
Author(s): Don M. Blumenthal, 2010 License: Unless otherwise noted, this material is made available under the terms of the Attribution – Non-commercial.
Author(s): Steve Jackson, 2009 License: Unless otherwise noted, this material is made available under the terms of the Attribution - Noncommercial - Share.
Author(s): Vic Divecha, 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-Non-commercial-Share.
Author(s): Lisa McLaughlin, 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-ShareAlike.
Author(s): MELO 3D Project Team, 2011 License: This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a.
Author(s): Gabriel Krieshok, Alex Pompe, 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons.
Author(s): MELO 3D Project Team, 2011 License: This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a.
Author(s): MELO 3D Project Team, 2011 License: This work is licensed under the Creative Commons Attribution-ShareAlike 3.0 Unported License. To view a.
Author(s): Gerald Abrams, M.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Non-commercial–Share.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
Author(s): Paul Conway Ph.D., 2011 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
Author(s): Paul Conway, PhD, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
1 Author(s): Andrew Rosenberg
Author: Michael Jibson, M.D., Ph.D., 2009
Author(s): Rajesh Mangrulkar, MD, 2009
Author(s): Paul Conway, PhD, 2010
Author: Michael Jibson, M.D., Ph.D., 2009
Attribution: University of Michigan Medical School, Department of Internal Medicine License: Unless otherwise noted, this material is made available under.
Author(s): Paul Conway, PhD, 2010
1 Author(s): Rebecca W. Van Dyke, M.D., 2012
Author(s): Joan Durrance, 2009
1 Author(s): Rebecca W. Van Dyke, M.D., 2012
Attribution: University of Michigan Medical School, Department of Microbiology and Immunology License: Unless otherwise noted, this material is made available.
Attribution: Department of Neurology, 2009
Author: Michael Jibson, M.D., Ph.D., 2009
Presentation transcript:

Author(s): Justin Joque License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License: We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation key on the following slide provides information about how you may share and adapt this material. Copyright holders of content included in this material should contact with any questions, corrections, or clarification regarding the use of content. For more information about how to cite these materials visit Any medical information in this material is intended to inform and educate and is not a tool for self-diagnosis or a replacement for medical evaluation, advice, diagnosis or treatment by a healthcare professional. Please speak to your physician if you have questions about your medical condition. Viewer discretion is advised: Some medical content is graphic and may not be suitable for all viewers.

Attribution Key for more information see: Use + Share + Adapt Make Your Own Assessment Creative Commons – Attribution License Creative Commons – Attribution Share Alike License Creative Commons – Attribution Noncommercial License Creative Commons – Attribution Noncommercial Share Alike License GNU – Free Documentation License Creative Commons – Zero Waiver Public Domain – Ineligible: Works that are ineligible for copyright protection in the U.S. (17 USC § 102(b)) *laws in your jurisdiction may differ Public Domain – Expired: Works that are no longer protected due to an expired copyright term. Public Domain – Government: Works that are produced by the U.S. Government. (17 USC § 105) Public Domain – Self Dedicated: Works that a copyright holder has dedicated to the public domain. Fair Use: Use of works that is determined to be Fair consistent with the U.S. Copyright Act. (17 USC § 107) *laws in your jurisdiction may differ Our determination DOES NOT mean that all uses of this 3rd-party content are Fair Uses and we DO NOT guarantee that your use of the content is Fair. To use this content you should do your own independent analysis to determine whether or not your use will be Fair. { Content the copyright holder, author, or law permits you to use, share and adapt. } { Content Open.Michigan believes can be used, shared, and adapted because it is ineligible for copyright. } { Content Open.Michigan has used under a Fair Use determination. }

Data Analysis and Design Data Jam Justin Joque Fall 2012 This presentation lis licensed under a Creative Commons Attribution NonCommercial Share Alike 3.0 license. Copyight Justin Joque. Creative Commons Attribution NonCommercial Share Alike 3.0

Agenda Introduction Analysis Tools Data Design Jam/Discussion about Huron River Watershed Council Data

Needs Evaluation 3 Critical Components: 1.Collection Processes o Is the current process working? o Do additional components need to be added? 2.Data Organization o Does the organization facilitate moving data from collection to analysis? o This includes location, design and control o This is an area where you can have a big impact o The best solutions are often times the least complex 3.Data Analysis o What questions can you help answer? o What methods and tools can you recommend?

Analysis Tools Data Storage o SQL (MySQL, Access) o Google Docs o Excel Data Manipulation o R/SPSS/Stata o Excel o Text Editor o Perl/Python o Google Refine o Fusion Tables Data Analysis and Visualization o R/SPSS/Stata o Excel o ArcGIS o Perl/Python Remember: Focus on processes whenever possible

Data Design Controlling who and at what cost someone can update data (or the schema) can improve quality Document your design Try to make it as simple and useable as possible Design to move data from collection to analysis

Data Design - The Difficulty of Excel NamePet JohnCat AliceDog BobCat NamePetPet Name JohnCatWhiskers AliceDogSpot BobCat,DogMittens,Sparky

Data Design - The Difficulty of Excel NamePet JohnCat AliceDog BobCat NamePet1Pet Name1Pet2Pet Name2 JohnCatWhiskers AliceDogSpot BobCatMittensDogSparky

Data Design - The Difficulty of Excel NamePet JohnCat AliceDog BobCat NamePetPet Name JohnCatWhiskers AliceDogSpot BobDogSparky BobCatMittens

Data Design - Relational Databases NameID John1 Alice2 Bob3 Person_IDPetPet Name 1CatWhiskers 2DogSpot 3DogSparky 3CatMittens

Data Design - Flatness Relational Databases are Flat o This makes processing and analysing data significantly easier o Even if you are not using a relational database try to 'approximate flatness' o Stacking vertically can help o If you must stack within cells try to use as unlikely of a character as possible to separate data (i.e. use | instead of a commas since commas can show up in people's names).

Data Design - Uniqueness Relational Databases Rely on Unique Identifiers o These must be unique or they will not work o Breaking uniqueness often means manual cleanup or bad results o This can be emulated in Excel, but beware of formulas and sorting

Data Enterers Likelihood of Error or Willful Abandonment of the Planned Data Entry Method

Data Design - Authorities Authorities lists or controlled entry options can vastly improve data entry o This can be done through validation or forms for entry (google docs is pretty good at this. o Make sure to allow for updating an authorities list. In terms of time/effort, updating an authorities list should be costly but not prohibitively so o Using numbered scales or even just prior agreement can be beneficial (e.g. we will enter man/woman and not male/female). o Document these decisions in a place that is accessible and ideally co-located with the data/data entry point

Case Study - HRWC Site Survey Data Description: HRWC divided the county up into numbered bioreserves which were then combined (geographically) with parcels. They then use this information to try to contact property owners, get permission to survey the property, and then record the results. Problem: They have two different database, one for the addresses and contact information and one for the survey information. The two databases do not interact very well.

Case Study - HRWC Step 1: Try to figure out what is going on with the sample data. What do the IDs represent? Are the tables flat? Has uniqueness been maintained where necessary? Is the data clean? What questions would you ask the organization to further elucidate the current structure?

Case Study - HRWC Step 2: Try to think through how the tables could be rearranged and data entry could be better controlled. How would you transform the data to the new structure? Does the new structure lend itself to analysis? Does it maintain flatness and uniquness?

Case Study - HRWC Step 3: Try to think about what analyses could easily be mined from your new data structure. Does the new structure make it easier? Could you manage to achieve the same results with the old structure? What additional analyses might be helpful?