Presentation is loading. Please wait.

Presentation is loading. Please wait.

ACIS 1504 - Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning.

Similar presentations


Presentation on theme: "ACIS 1504 - Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning."— Presentation transcript:

1 ACIS 1504 - Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning

2 Concept Map Text Mining Implementation Mixed Cell References Design: Accuracy Random Search, Left, Right, Mid, Len, & Paste Values

3 Objectives Define Text Mining Demonstrate Excel features that support text mining.

4 Segment A: Text Mining

5 Text Analytics / Text Mining Software that searches vast amounts of textual data (unstructured) identifying patterns.

6 Nestle Nestle processes Social Media http://uk.reuters.com/article/video/idUKBRE89P07Q20121 026?videoId=238680321

7 Segment B: Text Functions

8 Text Mining Search Parse Concatenate SEARCH LEFT, MID, RIGHT, LEN &

9 Name Example Open Grades Textfile.xlsx. Divide Last Name, First Name into two separate columns. 1.Locate the comma (SEARCH) 2.Extract all characters to left of comma (LEFT) 3.Locate end of full name (LEN) 4.Extract almost all characters between comma and end of name (RIGHT)

10 SEARCH Function

11 LEFT Function

12 LEN or Length Function

13 RIGHT Function

14 MID Function Extract the first initial of first name.

15 Concatenate Combine First Name, space and Last Name. & is the concatenate symbol Quotes are required around constant strings of text

16 Student ID Example Extract each student’s PID from their email address. Create a new student identifier by combining the first three letters of the last name with the last four digits of the student ID number.

17 Segment C: Data Cleaning & Generation

18 Data Cleaning Delete Unnecessary Columns & Rows Resize Columns Format Numeric Values Separate Distinct Values Shorten Lengthy Values Data Validation for Future Entries Generate Values

19 Favorite Pie Example

20 1.Ensure pie flavor data is consistent. 2.Replace confidential clicker ID # with randomly generated 6 digit number. 3.Ensure new ID number is static and unique.

21 Favorite Pie Example OriginalSortedConsistent

22 Random Number Functions =RAND() =RANDBETWEEN(low#, high#)

23 Paste Special - Values MAC: Edit Menu, Paste Special

24 Exam Feedback Example Open Exam Feedback.xlsx


Download ppt "ACIS 1504 - Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning."

Similar presentations


Ads by Google