Presentation is loading. Please wait.

Presentation is loading. Please wait.

Review of Coursera Data Analysis Course Jim Thompson

Similar presentations


Presentation on theme: "Review of Coursera Data Analysis Course Jim Thompson"— Presentation transcript:

1 Review of Coursera Data Analysis Course Jim Thompson JamesThompsonC@gmail.com

2 To make sense of my comments… Who’s the reviewer What is MOOC Overview of course (Through this reviewers eyes)

3 The Reviewer (Who am I?) Not a professional data analyst: Chemist by training Develop and commercialize new materials and applications by profession. Not a data analysis layman Data analysis as a hobby, on and off for 25 years. Downloaded R, Jan 2009, used ever sinse “Data Analysts Captivated by R’s Power” The New York Times, January 2009 http://www.nytimes.com/2009/01/07/technology/business- computing/07program.html?pagewanted=all

4 How I taught myself R Whatever fancies me at the moment No mentor, nor colleague Books (> 10 on R), Internet articles, R vignettes Learning by doing, mainly work data, for fun not for work. Because hobby, lacked discipline in: Clean code Reporting Reproducible research Appropriate use of stat technique

5 How I taught myself R Whatever fancies me at the moment No mentor, nor colleague Books (> 10 on R), Internet articles, R vignettes Learning by doing, mainly work data, for fun not for work. Because a hobby, lacked discipline in: Clean code Reporting Reproducible research Appropriate use of stat technique

6 I tried Open University Excellent Teachers One hour long lectures Some class homework provided. No grading Complete at your own pace Intro to Programing, Stanford

7 I tried Open University Excellent Teachers One hour long lectures The class homework provided. No grading Complete at your own pace Intro to Programing, Stanford

8 “The Year of the MOOC” the New York Times [1] A massive open online course (MOOC) is … aimed at large-scale interactive participation and open access via the web. [2] www.Udacity.com www.edX.org www.Coursera.org [1] http://www.nytimes.com/2012/11/04/education/edlife/massive-open-online-courses-are-multiplying-at-a- rapid-pace.html?pagewanted=all&_r=0http://www.nytimes.com/2012/11/04/education/edlife/massive-open-online-courses-are-multiplying-at-a- rapid-pace.html?pagewanted=all&_r=0 [2] http://en.wikipedia.org/wiki/Massive_open_online_coursehttp://en.wikipedia.org/wiki/Massive_open_online_course

9 Data Analysis by Jeffrey Leek An applied statistics course focusing on data analysis, not mathematical details. How to: Organize and perform analysis, interpret results, diagnose potential problems write-up data analyses Statistical methods :

10 Data Analysis by Jeffrey Leek An applied statistics course focusing on data analysis, not mathematical details. How to: Organize and perform analysis, interpret results, diagnose potential problems write-up data analyses Statistical methods :

11 How does this work? Time bond (i.e 6 weeks) Plan on 3-10 hrs/wks Watch three to five videos a week, 10-15 min long Weekly quizzes Submit two papers/reports Slides, video, R code available for download A certificate

12 Structure the analysis: Tips of finding, organizing, cleaning the data and the code. Week 1 Week 2 Personal comments:

13 Structure the analysis: Tips of finding, organizing, cleaning the data and the code. Very useful. Week 1 Week 2

14 Exploratory & Inferential: Clustering for exploratory analysis Week 3 Week 4

15 Inferential & Predictive Analysis learned new techniques, best practices Week 5 Week 6

16 Advanced Techniques Good stuff, but I was running out of gas Week 5

17 Submit Two Reports 1.Inference analysis of mortgage data: “This analysis considers whether any other variables have an important association with interest rate after taking into account the applicant's FICO score” 2.Predictive modeling using censors on cell phones: “Given the output Samsung phone, can we predict whether the owner is sitting, laying, standing, walking, walking up stairs, or walking down stairs.”

18 Submit Two Reports 1.Inference analysis of mortgage data: “This analysis considers whether any other variables have an important association with interest rate after taking into account the applicant's FICO score” 2.Predictive modeling using censors on cell phones: “Given the output Samsung phone, can we predict whether the owner is sitting, laying, standing, walking, walking up stairs, or walking down stairs.”

19 Data analysis rubric Main text  Does the analysis have an introduction, methods, analysis, and conclusions?  Are figures labeled and referred to by number in the text?  Is the analysis written in grammatically correct English?  Are the names of variables reported in plain language, rather than in coded names?  Does the analysis report the number of samples?  Does the analysis report any missing data or other unusual features?  Does the analysis include a discussion of potential confounders?  Are the statistical models appropriately applied?  Are estimates reported with appropriate units and measures of uncertainty?  Are estimators/predictions appropriately interpreted?  Does the analysis make concrete conclusions?  Does the analysis specify potential problems with the conclusions?

20 Data analysis rubric Figure  Is the figure caption descriptive enough to stand alone?  Does the figure focus on a key issue in the processing/modeling of the data?  Are axes labeled and are the labels large enough to read? References  Does the analysis include references for the statistical methods used? R script  Can the analysis be reproduced with the code provided?

21 Final comments On MOOC Thumbs up! On Data Analysis by Jeffrey Leek Thumbs up! Target audience: I might be the sweet-spot Excellent reference (links attached). On submitting reports: Learned most by writing the reports and grading others NOTE: Intro to R course scheduled for September 2013

22 Data Analysis by Jeffrey Leek The Class https://www.coursera.org/ course/dataanalysis https://www.coursera.org/ course/dataanalysis https://github.com/jtleek/d ataanalysis https://github.com/jtleek/d ataanalysis The Prof http://www.biostat.jhsph.e du/~jleek/ http://www.biostat.jhsph.e du/~jleek/ http://simplystatistics.org/

23 MOOC

24


Download ppt "Review of Coursera Data Analysis Course Jim Thompson"

Similar presentations


Ads by Google