Presentation is loading. Please wait.

Presentation is loading. Please wait.

Finding Information A337/A523. What are some of the possible problems with finding information?

Similar presentations


Presentation on theme: "Finding Information A337/A523. What are some of the possible problems with finding information?"— Presentation transcript:

1 Finding Information A337/A523

2 What are some of the possible problems with finding information?

3  Information is often lacks STRUCTURE  ASSOCIATION between the identifying information (i.e., labels and the actual information is not always obvious) and the data  CONSISTENCY is not always present. E.g.,  317-274-0185  (317)274-0185  3172740185  May later need to MANIPULATE data (filter, sorting, etc.)

4 Typical “Office” Applications  Word Processing  Spreadsheet  Database Management System (DBMS)

5 Spreadsheets and DBMSes  Columns (labels)  Rows (“instance” or record)  Intersection (value)  Information often lacks STRUCTURE  ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious  CONSISTENCY is not always present. E.g.,  317-274-0185  (317)274-0185  3172740185  May later need to MANIPULATE data (deeper search, sorting, etc.)

6 Spreadsheets Tables in MS Excel  Information often lacks STRUCTURE  ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious  CONSISTENCY is not always present. E.g.,  317-274-0185  (317)274-0185  3172740185  May later need to MANIPULATE data (deeper search, sorting, etc.)

7 DBMSes Tables in MS Access  Table is one of many objects in a database  Easier to associate tables than in a spreadsheet (i.e., vlookup)  Tables have several unique properties we’ll discuss later  Information often lacks STRUCTURE  ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious  CONSISTENCY is not always present. E.g.,  317-274-0185  (317)274-0185  3172740185  May later need to MANIPULATE data (deeper search, sorting, etc.)

8 ERP Systems Centralized database eliminates the need to associated data located on separate systems  Information often lacks STRUCTURE  ASSOCIATION between the identifying information (i.e., labels and the actual information) is not always obvious  CONSISTENCY is not always present. E.g.,  317-274-0185  (317)274-0185  3172740185  May later need to MANIPULATE data (deeper search, sorting, etc.)

9 Data Quality: What is Dirty Data?  It happens when the UPC code on a package doesn't match the item.  Causes? Vendor-Unique product code and cost Retailer-Unique product code and price

10 Data Quality: What is Dirty Data? Potential Problems?  Inventory Reorder  Profit per unit  Net profit  Customer Satisfaction  Repeat Business  Angry Bloggers Solution: Same code for vendor and retailer Data Integrity: Wal-Mart's Dirty Secret

11 Extract, Transform, Load (ETL) From Computerworld QuickStudy

12


Download ppt "Finding Information A337/A523. What are some of the possible problems with finding information?"

Similar presentations


Ads by Google