Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.

Slides:



Advertisements
Similar presentations
Computer Science Department FMIPA IPB 2003 Neural Computing Yeni Herdiyeni Computer Science Dept. FMIPA IPB.
Advertisements

Kostas Kontogiannis E&CE
Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.
Machine Learning Neural Networks
Decision Support Systems
Supervised learning 1.Early learning algorithms 2.First order gradient methods 3.Second order gradient methods.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
Neural Networks. R & G Chapter Feed-Forward Neural Networks otherwise known as The Multi-layer Perceptron or The Back-Propagation Neural Network.
Neural Networks Chapter Feed-Forward Neural Networks.
Artificial Neural Networks (ANNs)
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Introduction to Directed Data Mining: Neural Networks
CHAPTER 12 ADVANCED INTELLIGENT SYSTEMS © 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang.
Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.
MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way
Presentation on Neural Networks.. Basics Of Neural Networks Neural networks refers to a connectionist model that simulates the biophysical information.
Business Intelligence
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Artificial Neural Networks
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
IE 585 Introduction to Neural Networks. 2 Modeling Continuum Unarticulated Wisdom Articulated Qualitative Models Theoretic (First Principles) Models Empirical.
Chapter 9 Neural Network.
Chapter 3 Neural Network Xiu-jun GONG (Ph. D) School of Computer Science and Technology, Tianjin University
11 CSE 4705 Artificial Intelligence Jinbo Bi Department of Computer Science & Engineering
1 Chapter 6: Artificial Neural Networks Part 2 of 3 (Sections 6.4 – 6.6) Asst. Prof. Dr. Sukanya Pongsuparb Dr. Srisupa Palakvangsa Na Ayudhya Dr. Benjarath.
Outline What Neural Networks are and why they are desirable Historical background Applications Strengths neural networks and advantages Status N.N and.
NEURAL NETWORKS FOR DATA MINING
Chapter 7 Neural Networks in Data Mining Automatic Model Building (Machine Learning) Artificial Intelligence.
Artificial Intelligence Techniques Multilayer Perceptrons.
Artificial Neural Networks. The Brain How do brains work? How do human brains differ from that of other animals? Can we base models of artificial intelligence.
1 Chapter 11 Neural Networks. 2 Chapter 11 Contents (1) l Biological Neurons l Artificial Neurons l Perceptrons l Multilayer Neural Networks l Backpropagation.
Chapter 6: Techniques for Predictive Modeling
Artificial Neural Networks An Introduction. What is a Neural Network? A human Brain A porpoise brain The brain in a living creature A computer program.
1 Introduction to Neural Networks And Their Applications.
Neural Networks II By Jinhwa Kim. 2 Neural Computing is a problem solving methodology that attempts to mimic how human brain function Artificial Neural.
METU Informatics Institute Min720 Pattern Classification with Bio-Medical Applications Part 8: Neural Networks.
Chapter 6: Artificial Neural Networks for Data Mining
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 12-1 Chapter 12 Advanced Intelligent Systems.
NEURAL NETWORKS FOR DATA MINING
Neural Network Basics Anns are analytical systems that address problems whose solutions have not been explicitly formulated Structure in which multiple.
CHAPTER 15 Neural Computing: The Basics. n Artificial Neural Networks (ANN) n Mimics How Our Brain Works n Machine Learning.
1 Lecture 6 Neural Network Training. 2 Neural Network Training Network training is basic to establishing the functional relationship between the inputs.
Neural Networks Demystified by Louise Francis Francis Analytics and Actuarial Data Mining, Inc.
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22
Chapter 8: Adaptive Networks
Back-propagation network (BPN) Student : Dah-Sheng Lee Professor: Hahn-Ming Lee Date:20 September 2003.
Artificial Neural Networks (ANN). Artificial Neural Networks First proposed in 1940s as an attempt to simulate the human brain’s cognitive learning processes.
Chapter 6 Neural Network.
Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.
Artificial Neural Networks for Data Mining. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the.
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Supervised Learning – Network is presented with the input and the desired output. – Uses a set of inputs for which the desired outputs results / classes.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
1 Neural Networks MUMT 611 Philippe Zaborowski April 2005.
INTRODUCTION TO NEURAL NETWORKS 2 A new sort of computer What are (everyday) computer systems good at... and not so good at? Good at..Not so good at..
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining.
Introduction to Artificial Neural Network Session 1
Artificial neural networks
Learning in Neural Networks
Neural Computing: The Basics
Neural Nets for Data Mining
Introduction to Neural Networks And Their Applications
CSE P573 Applications of Artificial Intelligence Neural Networks
Prof. Carolina Ruiz Department of Computer Science
Chapter 12 Advanced Intelligent Systems
Artificial Intelligence Methods
CSE 573 Introduction to Artificial Intelligence Neural Networks
Lecture Notes for Chapter 4 Artificial Neural Networks
Prof. Carolina Ruiz Department of Computer Science
Presentation transcript:

Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 6: Artificial Neural Networks for Data Mining

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-2 Learning Objectives Understand the concept and definitions of artificial neural networks (ANN) Know the similarities and differences between biological and artificial neural networks Learn the different types of neural network architectures Learn the advantages and limitations of ANN Understand how backpropagation learning works in feedforward neural networks

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-3 Learning Objectives Understand the step-by-step process of how to use neural networks Appreciate the wide variety of applications of neural networks; solving problem types of Classification Regression Clustering Association Optimization

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-4 Neural Network Concepts Neural networks (NN): a brain metaphor for information processing Neural computing Artificial neural network (ANN) Many uses for ANN for pattern recognition, forecasting, prediction, and classification Many application areas finance, marketing, manufacturing, operations, information systems, and so on

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-5 Biological Neural Networks Two interconnected brain cells (neurons)

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-6 Processing Information in ANN A single neuron (processing element – PE) with inputs and outputs

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-7 Biology Analogy

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-8 Elements of ANN Processing element (PE) Network architecture Hidden layers Parallel processing Network information processing Inputs Outputs Connection weights Summation function

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-9 Elements of ANN Neural Network with One Hidden Layer

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-10 Elements of ANN Summation Function for a Single Neuron (a) and Several Neurons (b)

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-11 Elements of ANN Transformation (Transfer) Function Linear function Sigmoid (logical activation) function [0 1] Tangent Hyperbolic function [-1 1]  Threshold value?

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-12 Neural Network Architectures Several ANN architectures exist Feedforward Recurrent Associative memory Probabilistic Self-organizing feature maps Hopfield networks … many more …

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-13 Neural Network Architectures Architecture of a neural network is driven by the task it is intended to address Classification, regression, clustering, general optimization, association, …. Most popular architecture: Feedforward, multi-layered perceptron with backpropagation learning algorithm Used for both classification and regression type problems

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-14 Learning in ANN A process by which a neural network learns the underlying relationship between input and outputs, or just among the inputs Supervised learning For prediction type problems E.g., backpropagation Unsupervised learning For clustering type problems Self-organizing E.g., adaptive resonance theory

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-15 A Taxonomy of ANN Learning Algorithms

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-16 A Supervised Learning Process Three-step process: 1. Allocate random weights 2. Compute temporary outputs 3.Compare outputs with desired targets 4.Adjust the weights and repeat the process

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-17 How a Network Learns Example: single neuron that learns the inclusive OR operation (Page 257) *See your book for step-by-step progression of the learning process Learning parameters: Learning rate Momentum

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-18 How a Network Learns X 1 =0 Processing element(PE) X 2 =1 W 1 = 0. 1 W 2 =0.3.. Y j =0

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-19 Learning Model Architecture of ANN, Single neuron, 2 inputs Allocate random weights, 0.1, 0.3 Compute the Output Yj Error (Delta) = Zj – Yj Update the weights Wi(Final)=Wi(Initial) + alpha X delta X Xi

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-20 Backpropagation Learning Backpropagation of Error for a Single Neuron

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-21 Backpropagation Learning The learning algorithm procedure: 1. Initialize weights with random values and set other network parameters 2. Read in the inputs and the desired outputs 3. Compute the actual output (by working forward through the layers) 4. Compute the error (difference between the actual and desired output) 5. Change the weights by working backward through the hidden layers 6. Repeat steps 2-5 until weights stabilize

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-22 Development Process of an ANN

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-23 Sensitivity Analysis on ANN Models A common criticism for ANN: The lack of expandability The black-box syndrome! Answer: sensitivity analysis Conducted on a trained ANN The inputs are perturbed while the relative change on the output is measured/recorded Results illustrates the relative importance of input variables

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-24 Sensitivity Analysis on ANN Models

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-25 Other Popular ANN Paradigms Self Organizing Maps (SOM)  First introduced by the Finnish Professor Teuvo Kohonen  Applies to clustering type problems

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-26 Other Popular ANN Paradigms Self Organizing Maps (SOM) SOM Algorithm – 1. Initialize each node's weights 2. Present a randomly selected input vector to the lattice 3. Determine most resembling (winning) node 4. Determine the neighboring nodes 5. Adjusted the winning and neighboring nodes (make them more like the input vector) 6. Repeat steps 2-5 for until a stopping criteria is reached

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-27 Other Popular ANN Paradigms Self Organizing Maps (SOM) Applications of SOM Customer segmentation Bibliographic classification Image-browsing systems Medical diagnosis Interpretation of seismic activity Speech recognition Data compression Environmental modeling, many more …

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-28 Applications Types of ANN Classification Feedforward networks (MLP), radial basis function, and probabilistic NN Regression Feedforward networks (MLP), radial basis function Clustering Adaptive Resonance Theory (ART) and SOM Association Hopfield networks

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-29 Advantages of ANN Able to deal with (identify/model) highly nonlinear relationships Not prone to restricting normality and/or independence assumptions Can handle variety of problem types Usually provides better results (prediction and/or clustering) compared to its statistical counterparts Handles both numerical and categorical variables (transformation needed!)

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-30 Disadvantages of ANN They are deemed to be black-box solutions, lacking expandability It is hard to find optimal values for large number of network parameters Optimal design is still an art: requires expertise and extensive experimentation It is hard to handle large number of variables (especially the rich nominal attributes) Training may take a long time for large datasets; which may require case sampling

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-31 ANN Software Standalone ANN software tool NeuroSolutions BrainMaker NeuralWare NeuroShell, … for more (see pcai.com) … Part of a data mining software suit PASW (formerly SPSS Clementine) SAS Enterprise Miner Statistica Data Miner, … many more …

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-32 End of the Chapter Questions / comments…

Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall 6-33 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America. Copyright © 2011 Pearson Education, Inc. Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall