Presentation is loading. Please wait.

Presentation is loading. Please wait.

DATA SCIENCE MIS0855 | Spring 2016 Data Cleansing David Schuff

Similar presentations


Presentation on theme: "DATA SCIENCE MIS0855 | Spring 2016 Data Cleansing David Schuff"— Presentation transcript:

1 DATA SCIENCE MIS0855 | Spring 2016 Data Cleansing David Schuff David.Schuff@temple.edu http://community.mis.temple.edu/dschuff

2 Discuss (5 minutes) Have you fallen victim to any of Taber’s “stupid data corruption tricks?” From the readings, what are the best tips for cleaning data?

3 Cleaning Data Consider this Excel spreadsheet of sales in Pennsylvania, New Jersey, and Delaware for the years 2009 through 2013. Identify two problems with this data set.

4 And the problems show up during analysis… How do you find the “errors” and fix them?

5 The problem of outliers Do you correct this by… Removing the data point? Using the average of the other data points? Guessing at the right value? And is this an error or just an anomaly?


Download ppt "DATA SCIENCE MIS0855 | Spring 2016 Data Cleansing David Schuff"

Similar presentations


Ads by Google