Clustering and Research Works Dr. Bernard Chen Ph.D. University of Central Arkansas.

Slides:



Advertisements
Similar presentations
Interdisciplinary Research: Opportunities and Challenges
Advertisements

Dr Linda Allin Division of Sport Sciences The value of real life evaluation research for student learning and employability in Sports Development.
Assessment of Undergraduate Programs Neeraj Mittal Department of Computer Science The University of Texas at Dallas.
MoHealthWINs MoHealthWINs Open Learning Initiative Co-Development Project October 31, 2013.
School of Electrical Engineering & Computer Science
MULTIMEDIA Development Team.
Research Topics Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
What is Software Engineering? And why is it so hard?
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
1 Writing Undergraduate Programme Outcomes Dr Ciara O’Farrell.
Building Knowledge-Driven DSS and Mining Data
Science and Engineering Practices
IT Job Roles Task 20. Software Engineer Job Description Software engineers are responsible for creating and maintaining software of various different.
Lean Six Sigma Black Belt Blended Learning Program Course Description Blended Learning FLEXIBLE: Class sessions can be 100% online or augmented with live.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Database Systems: Design, Implementation, and Management Ninth Edition
Information Technology
MoHealthWINs MoHealthWINs Open Learning Initiative Co-Development Project October 31, 2013.
Wine Informatics Dr. Bernard Chen Ph.D. University of Central Arkansas.
Taylor Trayner. Definition  Set of business processes developed in an organization to create, store, transfer, and apply knowledge  Knowledge is a firm.
Database Design - Lecture 1
Nurjana Technologies Company Presentation. Nurjana Technologies (NT) is a small business enterprise founded in 2012 and operating in Aerospace and Defence.
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design.
SOL Changes and Preparation A parent presentation.
UNIVERSITY OF SOUTH CAROLINA Department of Computer Science and Engineering CSCE 190 Careers in Computer Science, Computer Engineering, and Computer Information.
Automata, Computability, and Complexity Lecture 1 Section 0.1 Wed, Aug 22, 2007.
Classifying Attributes with Game- theoretic Rough Sets Nouman Azam and JingTao Yao Department of Computer Science University of Regina CANADA S4S 0A2
Protein Local 3D Structure Prediction by Super Granule Support Vector Machines (Super GSVM) Dr. Bernard Chen Assistant Professor Department of Computer.
Patterns of Event Causality Suggest More Effective Corrective Actions Abstract: The Occurrence Reporting and Processing System (ORPS) has used a consistent.
My Research Work and Clustering Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2010.
IT job research By Megan McGonigle Sources: - responsibilites-explainedhttp://targetcourses.co.uk/study-areas/computer-science-and-it/it-job-roles-and-
The CRISP Data Mining Process. August 28, 2004Data Mining2 The Data Mining Process Business understanding Data evaluation Data preparation Modeling Evaluation.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Chapter 2 Database System Concepts and Architecture Dr. Bernard Chen Ph.D. University of Central Arkansas.
Adding Quantitative Reasoning to Your Course Some Ideas and Places to Begin.
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
CS 127 Introduction to Computer Science. What is a computer?  “A machine that stores and manipulates information under the control of a changeable program”
GEM: The GAAIN Entity Mapper Naveen Ashish, Peehoo Dewan, Jose-Luis Ambite and Arthur W. Toga USC Stevens Neuroimaging and Informatics Institute Keck School.
Software Engineering Chapter: Computer Aided Software Engineering 1 Chapter : Computer Aided Software Engineering.
Unit Summary  During this Unit of study the Music Theory students will study the historical facts and compositional techniques associated with Baroque.
COMPUTER SCIENCE Computer science (CS) is The systematic study of algorithmic.
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Networks of Public Accounts Committees: Approaches to Capacity Building Mitchell O’Brien Governance Specialist Team Lead – Parliamentary Strengthening.
OMIS 694, Big Data Analytics
You will provide oversight, leadership and direction to a group of IT professionals responsible for architecting, implementing and supporting a broad range.
Common Core State Standards in English/Language Arts What science teachers need to know.
Essay Questions. Two Main Purposes for essay questions 1. to assess students' understanding of and ability to think with subject matter content. 2. to.
Writing a Science or Engineering Paper: It is just a story Frank Shipman Department of Computer Science Texas A&M University.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Of An Expert System.  Introduction  What is AI?  Intelligent in Human & Machine? What is Expert System? How are Expert System used? Elements of ES.
Impact of the New ASA Undergraduate Curriculum Guidelines on the Hiring of Future Undergraduates Robert Vierkant Mayo Clinic, Rochester, MN.
Teaching slides Chapter 1. Chapter 1: Introduction Introduction Components of a computer Building the software products What is software engineering?
S1-1 ADM730, Section 1, September 2005 Copyright  2005 MSC.Software Corporation SECTION 1 INTRODUCTION Open Retracted - Bad Retracted - Good.
System A system is a set of elements and relationships which are different from relationships of the set or its elements to other elements or sets.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
Waqas Haider Khan Bangyal. Organization of the Lecture Research and Methodology: Research defined and described Some classifications of research Define.
1 Prepared by: Laila al-Hasan. 1. Definition of research 2. Characteristics of research 3. Types of research 4. Objectives 5. Inquiry mode 2 Prepared.
Pengenalan Ilmu Komputasi. Computational Science??
1. ABSTRACT Information access through Internet provides intruders various ways of attacking a computer system. Establishment of a safe and strong network.
Instructional Computer Instructional Computer TECH2111 Dr. Alaa Sadik Instructional & Learning Technologies Department
Clouds , Grids and Clusters
Chapter 2 Database System Concepts and Architecture
Software Engineering Development of procedures and systematic applications that are used on electronic machines. Software engineering incorporates various.
The new Professional Leadership Body: supporting advanced and specialist practice Dr Catherine Duggan.
MBI 630: Systems Analysis and Design
Gavin Brown Pro-Vice-Chancellor for Education 20th January 2017
An Introduction to Software Engineering
OMIS 665, Big Data Analytics
Measurement What is it and why do it? 2/23/2019
NextGen STEM Teacher Preparation in WA State
Presentation transcript:

Clustering and Research Works Dr. Bernard Chen Ph.D. University of Central Arkansas

Outline Clustering Data Science Future Works

Clustering Algorithms There are two clustering algorithms we used in our approach: K-means Clustering Fuzzy C-means Clustering

K-means Clustering

Fuzzy C-means Clustering

Real World example

Outline Clustering Data Science Future Works

Data Science wikipedia Data science is the study of the generalizable extraction of knowledge from data.knowledgedata It incorporates varying elements and builds on techniques and theories from many fields wikipedia

Outline Clustering Data Science Future Works

Data Science wikipedia A practitioner of data science is called a data scientist. Data scientists solve complex data problems through employing deep expertise in some scientific discipline. It is generally expected that data scientists are able to work with various elements

Data Science wikipedia Good data scientists are able to apply their skills to achieve a broad spectrum of end results. the ability to find and interpret rich data sources, manage large amounts of data despite hardware, software and bandwidth constraints, merge data sources together, ensure consistency of data-sets, create visualizations to aid in understanding data, build mathematical models using the data, present and communicate the data insights/findings to specialists and scientists in their team and if required to a naive audience.

Outline Clustering Data Science Future Works

Data Science in WINE Once viewed as a luxury good, nowadays wine is increasingly enjoyed by a wider range of consumers. Wine certification is generally assessed by physicochemical and sensory tests

sensory tests Example: Chateau Latour Latour-2010/wine/110508/detail.aspx

sensory tests Among those expert reviews, we use “Wine Spectator’s” version "Unbelievably pure, with distilled cassis and plum fruit that cuts a very precise path, while embers of anise, violet and black cherry configure form a gorgeous backdrop. A bedrock of graphite structure should help this outlive other 2010s. Powerful, sleek and incredibly long. Not perfect, but very close. Best from 2020 through 2050." 99 Points Wine Spectator

sensory tests Wine Spectator has the following advantages: Words are precise Well-known Famous for it’s Top 100 wine of the year selection Well maintained database

Research Topic 1 Clustering on past 10 years Top 100 wine (1000 wines) Challenges: Extract attributes from 1000 wine Clustering algorithm Analysis of the results

Research Topic 2 Multi-label (4 classes) Classification on 1000 wines, which composed of 250 wines for 4 category (95+, 90~94, 89~85, 85-) Challenges: Classification algorithm 4 classes How to improve accuracy

Research Topic 3 Association Rules on region-specific dataset (such as Napa) for attribute correlation and quality prediction. Challenges: Association Rules algorithm Analysis of the results How to improve accuracy

Research Topic 4 Region Prediction (such as France vs Italy), open for association rules or classification algorithms. Challenges: More free-style (more suitable for experienced researchers) Not only focus on accuracy, but also try to tell the difference between the regions

Research Topic 5 Clustering + Classification for higher accuracy prediction. Challenges: TWO type of algorithms More complex in understanding and coding

Research Topic 6 Multi-label research: since we have multiple reviews available, how to use those information for data science research? Challenges: Very flexible!!!