Why Data Mining Research Does Not Contribute to Business? Mykola Pechenizkiy, Seppo Puuronen Department of Computer Science University of Jyväskylä Finland.

Slides:



Advertisements
Similar presentations
State Staff Development and Training Team January 2012.
Advertisements

Information Systems in Business
Using concept mapping to develop a career studies curriculum My objectives in this presentation are to: 1/ Share and explain a career studies concept map.
Introduction to Research Methodology
 delivers evidence that a solution developed achieves the purpose for which it was designed.  The purpose of evaluation is to demonstrate the utility,
Competitive advantage from Data Mining: some lessons learnt in the Information Systems field Mykola Pechenizkiy, Seppo Puuronen Department of Computer.
DECO3008 Design Computing Preparatory Honours Research KCDCC Mike Rosenman Rm 279
Search Engines and Information Retrieval
Knowledge Management Challenges in Knowledge Discovery Systems Mykola Pechenizkiy, Seppo Puuronen Department of Computer Science University of Jyväskylä.
ACM SAC’06, DM Track Dijon, France “The Impact of Sample Reduction on PCA-based Feature Extraction for Supervised Learning” by M. Pechenizkiy,
Research Methods for Business Students
Origins and Implications
IIBA Denver | may 20, 2015 | Kym Byron , MBA, CBAP, PMP, CSM, CSPO
Research problem, Purpose, question
University of Jyväskylä – Department of Mathematical Information Technology Computer Science Teacher Education ICNEE 2004 Topic Case Driven Approach for.
Certified Business Process Professional (CBPP®) Exam Overview
Program Improvement Committee Report Larry Caretto College Faculty Meeting December 3, 2004.
Computational Thinking Related Efforts. CS Principles – Big Ideas  Computing is a creative human activity that engenders innovation and promotes exploration.
Capstone Design Project (CDP) Civil Engineering Department First Semester 1431/1432 H 10/14/20091 King Saud University, Civil Engineering Department.
FLCC knows a lot about assessment – J will send examples
Chapter 6 View Alignment Techniques and Method Customization (Part I) Object-Oriented Technology From Diagram to Code with Visual Paradigm for UML Curtis.
 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.
Day 1 Session 2/ Programme Objectives
Assistive Technology Clinical Outcomes Research Management System (AT-CORMS) Tool Utilizing the International Classification of Functioning (ICF) Cognitive.
Assessment of Higher Education Learning Outcomes (AHELO): Update Deborah Roseveare Head, Skills beyond School Division Directorate for Education OECD 31.
Developing an IS/IT Strategy
Project Management: Still More Art Than Science Presented By Donald W. Larson AC Bronze, CL June 6, 2007.
Ciarán O’Leary Wednesday, 23 rd September Ciarán O’Leary School of Computing, Dublin Institute of Technology, Kevin St Research Interests Distributed.
Asynchronous Discussions and Assessment in Online Learning Vonderwell, S., Liang, X., & Alderman, K. (2007). Asynchronous Discussions and Assessment in.
Search Engines and Information Retrieval Chapter 1.
#17 - Involve Users in the Development Model of Multinational Corporations - Is it worth it? Experience Report IRCSE '08: IDT Workshop Friday 31 October.
Applying creativity in CS high school education - criteria, teaching example and evaluation Romeike, R. (2007). Applying creativity in CS high school education.
Towards an activity-oriented and context-aware collaborative working environments Presented by: Ince T Wangsa Supervised by:
Design Science Method By Temtim Assefa.
Employability skills workshop This work has been produced on behalf of the National Quality Council with funding provided through the Australian Government.
Quality Management.  Quality management is becoming increasingly important to the leadership and management of all organisations. I  t is necessary.
EOPOWER impact assessment overview For assessment of earth observation products and services and promotion and dissemination activities in EOPOWER regions.
Systems Development AIMS 2710 R. Nakatsu. Overview Why do IT projects succeed and fail? Two philosophies of systems development –Systems Development Life.
Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 1: Software and Software Engineering.
NATURE OF OB Total System Approach Nature of Organisational behaviour
Copyright © 2003 Sherif Kamel Issues in Knowledge Management Dr Sherif Kamel The American University in Cairo.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
1 Requirements Management - General concepts - Noureddine Abbadeni King Saud University College of Computer and Information Sciences Based on “Software.
Eloise Forster, Ed.D. Foundation for Educational Administration (FEA)
Slide 1.1 Saunders, Lewis and Thornhill, Research Methods for Business Students, 5 th Edition, © Mark Saunders, Philip Lewis and Adrian Thornhill 2009.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
The Evolution of ICT-Based Learning Environments: Which Perspectives for School of the Future? Reporter: Lee Chun-Yi Advisor: Chen Ming-Puu Bottino, R.
Chapter 4 Developing and Sustaining a Knowledge Culture
Chapter 4 Developing and Sustaining a Knowledge Culture
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
27/3/2008 1/16 A FRAMEWORK FOR REQUIREMENTS ENGINEERING PROCESS DEVELOPMENT (FRERE) Dr. Li Jiang School of Computer Science The.
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models.
MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.
Pertemuan 16 Materi : Buku Wajib & Sumber Materi :
Chapter 14: Affective Assessment
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Knowledge is our most important engine of production – Alfred Marshal Knowledge is the key resource of the 21 st century Problem today is not how to find.
Increasing Rigor in the Classroom Natalie Redman.
Fifth Edition Mark Saunders, Philip Lewis and Adrian Thornhill 2009 Research Methods for Business Students.
Stages of Research and Development
Day 1 Session 2/ Programme Objectives
Research Methods for Business Students
Agency in educational robotics settings: a new design approach
Object-Oriented Software Engineering Using UML, Patterns, and Java,
Preface to the special issue on context-aware recommender systems
Research Methods in Computer Science
School of Information Management Nanjing University China
Meeting LIS Competences to Serve Inclusive Community through Curriculum: Case Study in LIS Study Program UIN Sunan Kalijaga Yogyakarta Indonesia Marwiyah.
Information Technology (IT)
PBL at Aalborg University
Presentation transcript:

Why Data Mining Research Does Not Contribute to Business? Mykola Pechenizkiy, Seppo Puuronen Department of Computer Science University of Jyväskylä Finland Alexey Tsymbal Department of Computer Science Trinity College Dublin Ireland DMBiz’05 Porto, Portugal October 3, 2005

2 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Outline Introduction and What is our message? Where we are? – rigor vs. relevance in DM Towards the new framework for DM research –DM System as adaptive Information System (IS) –DM research as IS Development: DM system as artefact –DM success model: success factors Further plans and Discussion

3 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Our Message DM is still a technology having great expectations to enable organizations to take more benefit of their huge databases. There exist some success stories where organizations have managed to have competitive advantage of DM. Still the strong focus of most DM-researchers in technology-oriented topics does not support expanding the scope in less rigorous but practically very relevant sub-areas. Research in the IS discipline has strong traditions to take into account human and organizational aspects of systems beside the technical ones.

4 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Our Message Currently the maturation of DM-supporting processes which would take into account human and organizational aspects is still living its childhood. DM community might benefit, at least from the practical point of view, looking at some other older sub-areas of IT having traditions to consider solution-driven concepts with a focus also on human and organizational aspects. The DM community by becoming more amenable to research results of the IS community might be able to increase its collective understanding of –how DM artifacts are developed – conceived, constructed, and implemented, –how DM artifacts are used, supported and evolved, –how DM artifacts impact and are impacted by the contexts in which they are embedded.

5 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Existing Frameworks for DM Theory-oriented –Databases; –Statistics; –Machine learning; –Data compression Process-oriented –Fayyad’s –CRISP-DM –Reinartz’s –Reductionist approach of viewing DM as statistics has advantages of the strong background, and easy- formulated problems. –The DM tasks concerning processes like clustering, regression and classification fit easily into these approaches. –More recent (process- oriented) frameworks address the issues related to a view of DM as a process, and its iterative and interactive nature

6 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Rigor and Relevance in DM Research Lin in Wu et al. notices that a new successful industry (as DM) can follow consecutive phases: 1.discovering a new idea, 2.ensuring its applicability, 3.producing small-scale systems to test the market, 4.better understanding of new technology and 5.producing a fully scaled system. At the present moment there are several dozens of DM systems, none of which can be compared to the scale of a DBMS system. –This fact indicates that we are still in the 3rd phase in the DM area!

7 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Rigor vs Relevance in DM Research

8 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Where is the focus? Still! … speeding-up, scaling-up, and increasing the accuracies of DM techniques. Piatetsky-Shapiro : “we see many papers proposing incremental refinements in association rules algorithms, but very few papers describing how the discovered association rules are used” Lin claims that the R&D goals of DM are quite different: –since research is knowledge-oriented while development is profit-oriented. –Thus, DM research is concentrated on the development of new algorithms or their enhancements, –but the DM developers in domain areas are aware of cost considerations: investment in research, product development, marketing, and product support. However, we believe that the study of the DM development and DM use processes is equally important as the technological aspects and therefore such research activities are likely to emerge within the DM field. Towards the new framework for DM research …

9 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal DMS in the Kernel of an Organization DM Task(s) DMS (Artifact) Organization Environment DM is fundamentally application-oriented area motivated by business and scientific needs to make sense of mountains of data. A DMS is generally used to support or do some task(s) by human beings in an organizational environment both having their desires related to DMS. Further, the organization has its own environment that has its own interest related to DMS, e.g. that privacy of people is not violated.

10 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal The ISs-based paradigm for DM Ives B., Hamilton S., Davis G. (1980). “A Framework for Research in Computer-based MIS” Management Science, 26(9), “Information systems are powerful instruments for organizational problem solving through formal information processing” Lyytinen, K., 1987, “Different perspectives on ISs: problems and solutions.” ACM Computing Surveys, 19(1), 5-46.

11 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal DM Artifact Development DM Artifact Development Experimentation Theory Building Observation Adapted from: Nunamaker, W., Chen, M., and Purdin, T , Systems development in information systems research, Journal of Management Information Systems, 7(3), A multimethodological approach to the construction of an artefact for DM

12 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal The Action Research and Design Science Approach to Artifact Creation Design Knowledge Awareness of business problem Action planning Action taking Conclusion Business Knowledge Artifact Development Artifact Evaluation Contextual Knowledge

13 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal DM Artifact Use: Success Model 1 of 3 System Quality Information Quality Use User Satisfaction Individual Impact Organizational Impact Service Quality Adapted from D&M IS Success Models

14 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal DM Artifact Use: Success Model 2 of 3 What are the key factors of successful use and impact of DMS both at the individual and organizational levels. 1.how the system is used, and also supported and evolved, and 2.how the system impacts and is impacted by the contexts in which it is embedded. Coppock: the failure factors of DM-related projects. have nothing to do with the skill of the modeler or the quality of data. But those do include: 1.persons in charge of the project did not formulate actionable insights, 2.the sponsors of the work did not communicate the insights derived to key constituents, 3.the results don't agree with institutional truths the leadership, communication skills and understanding of the culture of the organization are not less important than the traditionally emphasized technological job of turning data into insights

15 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal DM Artifact Use: Success Model 3 of 3 Hermiz communicated his beliefs that there are the four critical success factors for DM projects: (1) having a clearly articulated business problem that needs to be solved and for which DM is a proper tool; (2) insuring that the problem being pursued is supported by the right type of data of sufficient quality and in sufficient quantity for DM; (3) recognizing that DM is a process with many components and dependencies – the entire project cannot be "managed" in the traditional sense of the business word; (4) planning to learn from the DM process regardless of the outcome, and clearly understanding, that there is no guarantee that any given DM project will be successful.

16 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal New Research Framework for DM Research

17 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal New Research Framework for DM Research

18 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Further Work Definition of Relevance concept in DM research The revision of the book chapter Further work on the new framework for DM research Organization of Workshop/Working conf. or ST on –more social directions in DM research likely with one of the focuses on IS as a sister discipline. –SIAM DM 2006 Interests include Human Factors and Social Issues:  Ethics of Data  Mining Intellectual Ownership  Privacy Models  Privacy Preservation Techniques  Risk Analysis  User Interfaces  Data and Result Visualization

19 DMBiz’05 Porto, Portugal October 3, 2005 Why Data Mining Research Does Not Contribute to Business? by M. Pechenizkiy, S. Puuronen, A. Tsymbal Thank You! Book chapter draft is available on request from Mykola Pechenizkiy Department of Computer Science and Information Systems, University of Jyväskylä, FINLAND Tel.: Fax: Feedback is very welcome: Questions Suggestions Collaboration