Presentation is loading. Please wait.

Presentation is loading. Please wait.

Business Intelligence and Analytics: Overview and Examples Dr. Hsinchun Chen Director, Artificial Intelligence Lab, University of Arizona

Similar presentations


Presentation on theme: "Business Intelligence and Analytics: Overview and Examples Dr. Hsinchun Chen Director, Artificial Intelligence Lab, University of Arizona"— Presentation transcript:

1 Business Intelligence and Analytics: Overview and Examples Dr. Hsinchun Chen Director, Artificial Intelligence Lab, University of Arizona hchen@eller.arizona.edu http://ai.arizona.edu;

2 The Data Deluge (The Economists, March 2010); internet traffic 667 Exabytes by 2013, Cisco; Total amount of information in 2010, 1.2 Zettabyte (KB-MB-GB-TB-PB-EB-ZB- YB) BIG DATA  BIG COMPUTATION  BIG ANALYTICS  BIG (SOCIETAL) IMPACT $3B BI revenue in 2009 (Gartner, 2006); $9.4B BI software M&A spending in 2010 and $14.1B by 2014 (Forrester) IBM spent $14B in BI in five years; $9B BI revenue in 2010 (USA Today, November 2010); 24 acquisitions, 10,000 BI software developers, 8,000 BI consultants, 200 BI mathematicians  IBM acquired I2/COPLINK in 2011 BI & Analytics: The Field

3 BI & Analytics: Definition and Components BI and Analytics refers to: (1) the technologies, systems, practices and applications that (2) analyze critical business data to (3) help an enterprise better understand its business and market.” Core technologies: data warehousing, Extraction, Transformation, and Load (ETL); Business Performance Management (BPM), visual dashboards; enterprise text and multimedia search; data and text mining, social network analysis BI 2.0 research: web analytics, web 2.0, social media analytics, opinion mining; in-memory and real-time BI; cloud computing, data/web services; Hadoop, MapReduce; stream and mobile data mining

4 BI Industry and Capabilities (Garter Report, 2011) Magic Quadrant for BI Platforms (13 Capabilities) Integration (e.g., Microsoft, Oracle, SAP) BI (shared) infrastructure Metadata management Development tools, collaboration Information Delivery (e.g., SAP, Microsoft, IBM/Cognos) Reporting, dashboards Ad hoc query Microsoft Office integration Search-based BI (structured and unstructured) Analysis (e.g., IBM/SPSS, SAS) OLAP Interactive visualization Predictive modeling and data mining Scorecards 4

5 Magic Quadrant for Business Intelligence Platforms

6 Hype Cycle for Business Intelligence, 2011

7 BI Hype Cycle (Garter Report, 2011) On the Rise Collaborative decision making Information semantic services Search-based data discovery tools Natural language question answering At the Peak Enterprise metadata repositories BI SaaS Visualization-based data discovery tools Mobile BI In-memory DMBS Sliding into the Trough Real-time decisoning Analytics, content analytics, in-memory analytics, text analytics Open-source BI tools Interactive visualization 7

8 BI Hype Cycle (Cont’d) Climbing the Slope BI consulting and system integration Business activity monitoring Column-based DBMS Dashboards, data quality tools Predictive analytics Excel as a BI front end Entering the Plateau BI platforms Data-mining workbenchs 8

9 Sample BI Applications (AI Lab) Security informatics Securing cyber space, cyber security, predicting Arab Spring Information and system security, enterprise risk management Market intelligence Data/text/web mining, web 2.0, social media analytics Big data (volume/variety/velocity/mobility), Hadoop, Cloud apps Healthcare informatics Healthcare IT integration and solutions, decision support EHR data/text mining, patient empowerment and social media 9

10 10 (1) BI for Security: COPLINK

11 COPLINK Identity Resolution and Criminal Network Analysis 11

12 (2) BI for Market Intelligence (AZ BizIntel) Mass media, social media contents Text & social media analytics techniques Finance/accounting/marketing models (Tetlock/Columbia, Antweiler/UBC, Das/Santa Clara)  NYU (Dhar), Arizona (Dhaliwal, Kelly, Jiang, Lusch, Yong), National Taiwan U (Li, Hong, Lu) Bag of words, named entities, proper nouns, topics (1, 2-, 3- grams) Sentiment/valence, lexicons, machine learning, stakeholder analysis, EFLS analysis Time series models, spike detection, decaying function, trading windows, targeted sentiment Econometrics/regression models (R-sqr, p-value), 10-fold validation (F, accuracy), simulated trading (cost, frequency, exit)

13 13 Predefined Data Sources Data Sources for US Public Companies SEC/EdgarNYSE.com NASDAQ.com Finance.Yahoo.com Company Information Database Ticker CUSIP CIK PERMNO Company Keywords Company Name Dynamic Data Sources BlogsNews Search Engines WSJTwitter BasicInformation Yahoo Finance Forums Company Websites Stock Exchange 10K Report Data Collection DataProcessing Transformation/Integration Topics & Sentiments Time Series / Burst Risk ModelSNA Analysis Analytic Approaches Finance/Econ models and metrics Cross Media Analysis Single Media Analysis Predicting Markets AZ BIZ INTEL System Design Visualization Static Figures/Dashboards Interactive Applications Simulated Trading

14 (3) BI for Healthcare: AZ Smart Health 14

15 AZ Smart Health Research Healthcare Decision Support  Symptom-Disease-Treatment Extraction for Medical Knowledge Re-use  Scenario-based Association Rule Mining and Result Validation for Effective Healthcare  Outcome Assessment and Medication Compliance to Signify Quality of Care  Temporal Episodes and Disease Progression Modeling for Better Patient Condition Assessment  Patients-Like-You-and-Me EHR Search Interface to Accelerate Clinical Decision Making Patient-centered Smart Health  Personalized Healthcare for Chronic and Family Diseases Management  Long Term Medication Effects to Improve New Drug Development  Public Health Modeling and Monitoring for Government Agencies  Patient Social Media to Empower Patients and Improve Self Care at Home Healthcare Business Analytics  Cost Modeling and Containment  Improving Rate Calculation for the National Health Insurance  Competency and Performance Benchmarking  Quality-based Insurance Reimbursement  Workflow Planning and Coordination for Inter- and Intra- Hospital Process

16 ARM in Medicine: Symptoms, Diseases, and Treatments

17 Patient Statistics: Breast Cancer

18 Consistency of Top Treatment Orders Department 03: General Surgery; Department BD: Gastrointestinal surgery Age group 4: 15 to 24; Age group 5: 25 to 44; Age group 6: 45 to 64; Age group 7: > 65 Cooccurred Diagnosis 196.3: Secondary and unspecified malignant neoplasm of lymph nodes; Cooccurred Diagnosis 198.5: Secondary malignant neoplasm of bone and bone marrow Top 20 treatments from aggregated population PhysicianDepartmentAge GroupCooccurred Diagnosis M1130M1529M1540M158503BD4567196.3198.5 1 Exemestane (Aromasin) ( 諾曼癌素 ) VV VV V 2 Her-2/neu 螢光原位雜交法 (Her-2/neu FISH) VV V VV 3 Trastuzumab (Herceptin) ( 賀癌平 ) VVVVVV VVVVV 4 Anastrozole (Anazo) ( 安納柔 ) V V V V 5 Zoledronic acid (Zometa) ( 卓古祂 ) VVVVVV VVV V 6 Pegylated liposomal doxorubicin (Caelyx) ( 康利斯微脂利 ) VV VVV VVV V 7 Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) VVVVVV VVV 8 Tamoxifen citrate ( 得適 ) VVV VVV 9 Docetaxel (Taxotere) ( 剋癌易 ) VVVVVV VVV V 10 Cyclophosphamide (Endoxan-Asta) ( 癌得星 ) VVVVVVVVVV V 11 Vinorelbine (Navelbine) ( 溫諾平 ) VV VVV VV V 12 Docetaxel (Taxotere) ( 剋癌易 ) VVVVVV VVV V 13 Epirubicin HCl (Pharmorubicin RD) ( 泛艾黴素 ) VVVVVV VVV V 14 Epirubicin (Pharmorubicin) ( " 速溶 " 泛艾黴素 ) VVVVVV VVV 15 CA-153 tumor marker (CA-153 腫瘤標記 ) VVV VV VVV V 16 Epirubicin (Pharmorubicin) ( " 速溶 " 泛艾黴素 ) V V V VVVVV 17 Methotrexate sodium inj (Amethopterin) ( 滅殺除癌 ) V V V 18 Dissection of axillary lymphatics ( 腋窩淋巴腺清除術 ) V V VVV 19 Breast tumor biopsy ( 乳房腫瘤組織檢查切片術 ) V V 20 Intravenous chemotherapy 4-8 hours ( 靜脈化學藥物注射 4-8 小時 ) VVVV VVV V

19 Treatment Comparison Among Different Physicians DOCTOR_NO=M1130DOCTOR_NO=M1529DOCTOR_NO=M1540 1 Caelyx 20mg/10ml/vial ( 康利斯微脂利 )Methotrexate Inj 50mg/2ml ( 滅殺除癌 ) Zometa Powder For Solution For Infusion 4mg/vial ( 卓古 祂 ) 2 Aromasin S.C. Tablets 25mg ( 諾曼癌素 )Gemzar 200mg/vial ( 健擇 )Anazo F.C. Tablets ( 安納柔 ) 3 Navelbine 10mg/1ml/vial ( 溫諾平 ) Zometa Powder For Solution For Infusion 4mg/vial ( 卓古 祂 )Taxotere 20mg/0.5ml/vial ( 剋癌易 ) 4 Intravenous chemotherapy <1 hours ( 靜脈化學藥物 注射 ) FORMOXOL 30mg/5ml/vial ( 伏摩素 )Taxotere 80mg/2ml/vial ( 剋癌易 ) 5 Herceptin 440mg/20ml/vial ( 賀癌平 )Aromasin S.C. Tablets 25mg ( 諾曼癌素 )Herceptin 440mg/20ml/vial ( 賀癌平 ) 6 Zometa Powder For Solution For Infusion 4mg/vial ( 卓古 祂 )Herceptin 440mg/20ml/vial ( 賀癌平 ) Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) 7 FORMOXOL 30mg/5ml/vial ( 伏摩素 )Navelbine 10mg/1ml/vial ( 溫諾平 ) Granocyte 100ug/vial ( 顆球諾得 ) 8 CA-153 tumor marker (CA-153 腫瘤標記 ) Endoxan-Asta Injection 200mg/vial( 癌得星 ) Sentinel lymphadenectomy ( 腋窩淋巴腺清除術 ) 9 Abitrexate 50mg/2ml/vial ( 必除癌 )Caelyx 20mg/10ml/vial ( 康利斯微脂利 ) CA-153 tumor marker (CA-153 腫瘤標記 ) 10 Taxotere 80mg/2ml/vial ( 剋癌易 )Taxotere 20mg/0.5ml/vial ( 剋癌易 )Endoxan-Asta Injection 200mg/vial( 癌得星 ) 11 Taxotere 20mg/0.5ml/vial ( 剋癌易 )Taxotere 80mg/2ml/vial ( 剋癌易 )Pharmorubicin Rapid Dissolation 10mg ( " 速溶 " 泛艾黴素 ) 12 Endoxan-Asta Injection 200mg/vial( 癌得星 ) CA-153 tumor marker (CA-153 腫瘤標記 ) Whole body bone scan ( 全身骨骼掃描 ) 13 Pharmorubicin Rapid Dissolation 10mg ( " 速溶 " 泛艾黴素 ) Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 ) 14 Pharmorubicin RD 50mg/vial ( 泛艾黴素 ) Intravenous chemotherapy 1-4 hours ( 靜脈化學藥 物注射 ) Simulation procedure ( 模擬定位攝影 ) 15 Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) Intravenous chemotherapy 4-8 hours ( 靜脈化學藥 物注射 ) Pharmorubicin RD 50mg/vial ( 泛艾黴素 ) 16 Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 )Rasitol Tablets 40mg (Furosemide) ( 來喜妥 ) Breast tumor biopsy examination ( 乳房腫瘤組織檢查切 片術 ) 17 Gemzar 200mg/vial ( 健擇 )Emetrol Tablets 10mg (Domperidone) ( 愈吐寧 ) Intravenous chemotherapy 4-8 hours ( 靜脈化學藥物 注射 ) 18 Intravenous chemotherapy 4-8 hours ( 靜脈化學藥 物注射 ) Pharmorubicin RD 50mg/vial ( 泛艾黴素 ) Intravenous chemotherapy 1-4 hours ( 靜脈化學藥物 注射 ) 19 Intravenous chemotherapy 1-4 hours ( 靜脈化學藥 物注射 ) Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 ) Vascular exploration ( 血管探查 ) 20 Neurotin Tablets 600mg ( 鎮頑癲 )Sodium chloride injection ( 氯化鈉注射液 ) Fixed mold-large ( 固定模具之設計及製作 - 大 )

20 Treatment Comparison Among Different Patient Age Groups Anazo F.C. Tablets ( 安納柔 ) is a treatment for advanced breast cancer in postmenopausal women (advanced age). Abitrexate ( 必除癌 ) is a drug in the FDA pregnancy risk categories, which has proven to cause fetal risks and abnormalities. Therefore, it is less likely to be prescribed for patients in young age group=5 (i.e., age 25 to 44) Age group=5Age group=6Age group=7 1 Caelyx 20mg/10ml/vial ( 康利斯微脂利 ) Her-2/neu 螢光原位雜交法 (Her-2/neu FISH) Anazo F.C. Tablets ( 安納柔 ) 2 Herceptin 440mg/20ml/vial ( 賀癌平 )Aromasin S.C. Tablets 25mg ( 諾曼癌素 )Abitrexate 50mg/2ml/vial ( 必除癌 ) 3 Zometa Powder For Solution For Infusion 4mg/vial ( 卓古祂 )Herceptin 440mg/20ml/vial ( 賀癌平 ) Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) 4 Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 )Zometa Powder For Solution For Infusion 4mg/vial ( 卓古祂 )Herceptin 440mg/20ml/vial ( 賀癌平 ) 5 Taxotere 80mg/2ml/vial ( 剋癌易 )Navelbine 10mg/1ml/vial ( 溫諾平 ) Sentinel lymphadenectomy ( 腋窩淋巴腺清除術 ) 6 Taxotere 20mg/0.5ml/vial ( 剋癌易 )Caelyx 20mg/10ml/vial ( 康利斯微脂利 )Tadex 10mg/tab ( 得適 ) 7 Navelbine 10mg/1ml/vial ( 溫諾平 ) Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) Zometa Powder For Solution For Infusion 4mg/vial ( 卓古祂 ) 8 Pharmorubicin RD 50mg/vial ( 泛艾黴素 )Tadex 10mg/tab ( 得適 )Caelyx 20mg/10ml/vial ( 康利斯微脂利 ) 9 Endoxan-Asta Injection 200mg/vial ( 癌得星 )Taxotere 80mg/2ml/vial ( 剋癌易 )Endoxan-Asta Injection 200mg/vial( 癌得星 ) 10 Tadex 10mg/tab ( 得適 )Endoxan-Asta Injection 200mg/vial( 癌得星 ) CA-153 tumor marker (CA-153 腫瘤標記 ) 11 Xeloda Tablets 500mg ( 結瘤達 ) Breast tumor biopsy ( 乳房腫瘤組織檢查切片術 ) Pharmorubicin Rapid Dissolation 10mg ( " 速溶 " 泛艾黴素 ) 12 Radical mastectomy-unilateral ( 乳癌根除術- 單側 ) Taxotere 20mg/0.5ml/vial ( 剋癌易 ) Partial mastectomy-unilateral ( 部份乳癌根除術- 單側 ) 13 Pharmorubicin Rapid Dissolation 10mg ( " 速溶 " 泛艾黴素 ) CA-153 tumor marker (CA-153 腫瘤標記 ) Granocyte 100ug/vial ( 顆球諾得 ) 14 CA-153 tumor marker (CA-153 腫瘤標記 ) Pharmorubicin RD 50mg/vial ( 泛艾黴素 )Taxotere 20mg/0.5ml/vial ( 剋癌易 ) 15 Intravenous chemotherapy 4-8 hours ( 靜脈化學藥物注射 ) Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 )Pharmorubicin RD 50mg/vial ( 泛艾黴素 ) 16 Granocyte 100ug/vial ( 顆球諾得 )Abitrexate 50mg/2ml/vial ( 必除癌 )Taxotere 80mg/2ml/vial ( 剋癌易 ) 17 Methotrexate Inj 50mg/2ml ( 滅殺除癌 ) Sentinel lymphadenectomy ( 腋窩淋巴腺清除術 ) Pharmorubicin 10mg/vial ( " 速溶 " 泛艾黴素 ) 18 Intravenous chemotherapy 1-4 hours ( 靜脈化學藥物注射 ) Pharmorubicin Rapid Dissolation 10mg ( " 速溶 " 泛艾黴素 ) 19 Gemzar 200mg/vial ( 健擇 ) Intravenous chemotherapy 1-4 hours ( 靜脈化學藥物注射 ) 20 FORMOXOL 30mg/5ml/vial ( 伏摩素 ) Compensator design and production ( 補償器之設計及製 作 )

21 Cancer Community Mapping: Text Mining & Visualization for Documents and Patient Forums 21 A Brain Neoplasms article about toddlers Meningeal Neoplasms and Brain Diaseases subtopics Breast cancer patient forum messages Red Blood Cell and Lymph Nodes subtopics 4(c): A Chinese SOM Map about Colon Cancer 4(d): A Tag Cloud for PLM Breast Cancer Patients

22 BI & Analytics Research Opportunities and Challenges Opportunities: BIG DATA  BIG COMPUTATION  BIG ANALYTICS  BIG (SOCIETAL) IMPCTS (NAE Grand Challenges: security, healthcare) Challenges: data deluge (TB/PB)  data variety (numbers, text, multilingual, multimedia)  data velocity (mobile, streaming)  data organization & access (DBMS, Hadoop, IR, image, mobile)  data analytics (statistical analysis, data/text/web mining) 22

23 Training the New “Data Scientists”: Core Knowledge B-School (Management Information Systems): economics/finance/accounting/marketing, statistical analysis/modeling, organizational/behavioral  business knowledge; statistics C-School (Computer Science): programming language, data structure & algorithm, database management system, artificial intelligence, networking, data mining, web computing & mining  computational techniques I-School (Information/Library Science): information organization, information retrieval, information visualization, NLP, text mining, HCI  information processing 23


Download ppt "Business Intelligence and Analytics: Overview and Examples Dr. Hsinchun Chen Director, Artificial Intelligence Lab, University of Arizona"

Similar presentations


Ads by Google