Convolutional neural networks Abin - Roozgard.

Slides:



Advertisements
Similar presentations
Patient information extraction in digitized X-ray imagery Hsien-Huang P. Wu Department of Electrical Engineering, National Yunlin University of Science.
Advertisements

EA C461 - Artificial Intelligence
Face Recognition: A Convolutional Neural Network Approach
Introduction to Neural Networks Computing
Convolutional Neural Networks
Thesis title: “Studies in Pattern Classification – Biological Modeling, Uncertainty Reasoning, and Statistical Learning” 3 parts: (1)Handwritten Digit.
Machine Learning Neural Networks
Lecture 14 – Neural Networks
Aula 5 Alguns Exemplos PMR5406 Redes Neurais e Lógica Fuzzy.
Chapter 5 NEURAL NETWORKS
Multi Layer Perceptrons (MLP) Course website: The back-propagation algorithm Following Hertz chapter 6.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
NEURAL NETWORKS Introduction
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
Soft Computing Colloquium 2 Selection of neural network, Hybrid neural networks.
MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Hurieh Khalajzadeh Mohammad Mansouri Mohammad Teshnehlab
 The most intelligent device - “Human Brain”.  The machine that revolutionized the whole world – “computer”.  Inefficiencies of the computer has lead.
Explorations in Neural Networks Tianhui Cai Period 3.
LINEAR CLASSIFICATION. Biological inspirations  Some numbers…  The human brain contains about 10 billion nerve cells ( neurons )  Each neuron is connected.
Yang, Luyu.  Postal service for sorting mails by the postal code written on the envelop  Bank system for processing checks by reading the amount of.
EE459 Neural Networks Examples of using Neural Networks Kasin Prakobwaitayakit Department of Electrical Engineering Chiangmai University.
Analysis of Classification Algorithms In Handwritten Digit Recognition Logan Helms Jon Daniele.
CSC321 Lecture 5 Applying backpropagation to shape recognition Geoffrey Hinton.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 6: Applying backpropagation to shape recognition Geoffrey Hinton.
Mentor Prof. Amitabha Mukerjee Deepak Pathak Kaustubh Tapi 10346
Each neuron has a threshold value Each neuron has weighted inputs from other neurons The input signals form a weighted sum If the activation level exceeds.
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Optical Character Recognition
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
CSE343/543 Machine Learning Mayank Vatsa Lecture slides are prepared using several teaching resources and no authorship is claimed for any slides.
Big data classification using neural network
Convolutional Neural Network
Deep Learning Amin Sobhani.
an introduction to: Deep Learning
Gradient-based Learning Applied to Document Recognition
Data Mining, Neural Network and Genetic Programming
Data Mining, Neural Network and Genetic Programming
Intelligent Information System Lab
Classification of Hand-Written Digits Using Scattering Convolutional Network Dongmian Zou Advisor: Professor Radu Balan.
A VERY Brief Introduction to Convolutional Neural Network using TensorFlow 李 弘
Lecture 5 Smaller Network: CNN
Neural Networks 2 CS446 Machine Learning.
Deep Learning Qing LU, Siyuan CAO.
Non-linear classifiers Neural networks
Introduction to Neural Networks
Outline Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, “Gradient-based learning applied to document recognition,” Proceedings of the IEEE, vol. 86, no.
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Deep learning Introduction Classes of Deep Learning Networks
network of simple neuron-like computing elements
Neural Network - 2 Mayank Vatsa
CSC 578 Neural Networks and Deep Learning
Creating Data Representations
Matteo Fischetti, University of Padova
On Convolutional Neural Network
Spatial Transformer Networks
Zip Codes and Neural Networks: Machine Learning for
Analysis of Trained CNN (Receptive Field & Weights of Network)
Introduction to Radial Basis Function Networks
Convolutional Neural Networks
Face Recognition: A Convolutional Neural Network Approach
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,
Image recognition.
CSC 578 Neural Networks and Deep Learning
Outline Announcement Neural networks Perceptrons - continued
Presentation transcript:

Convolutional neural networks Abin - Roozgard

Presentation layout Introduction Drawbacks of previous neural networks Convolutional neural networks LeNet 5 Comparison Disadvantage Application

introduction

Introduction Convolutional neural networks Signal processing, Image processing improvement over the multilayer perceptron performance, accuracy and some degree of invariance to distortions in the input images

Drawbacks

Behavior of multilayer neural networks Structure Types of Decision Regions Exclusive-OR Problem Classes with Meshed regions Most General Region Shapes Single-Layer Two-Layer Three-Layer Half Plane Bounded By Hyper plane Convex Open Or Closed Regions Arbitrary (Complexity Limited by No. of Nodes) A B

Multi-layer perceptron and image processing One or more hidden layers Sigmoid activations functions

Drawbacks of previous neural networks the number of trainable parameters becomes extremely large

Drawbacks of previous neural networks Little or no invariance to shifting, scaling, and other forms of distortion

Drawbacks of previous neural networks Little or no invariance to shifting, scaling, and other forms of distortion Shift left

Drawbacks of previous neural networks 154 input change from 2 shift left 77 : black to white 77 : white to black

Drawbacks of previous neural networks scaling, and other forms of distortion

Drawbacks of previous neural networks the topology of the input data is completely ignored work with raw data. Feature 1 Feature 2

Drawbacks of previous neural networks Black and white patterns: Gray scale patterns : 32 * 32 input image

Drawbacks of previous neural networks

Improvement Fully connected network of sufficient size can produce outputs that are invariant with respect to such variations. Training time Network size Free parameters

Convolutional neural network (CNN)

History In 1995, Yann LeCun and Yoshua Bengio introduced the concept of convolutional neural networks. Yann LeCun, Professor of Computer Science The Courant Institute of Mathematical Sciences New York University Room 1220, 715 Broadway, New York, NY 10003, USA. (212)998-3283     yann@cs.nyu.edu

About CNN’s CNN’s Were neurobiologically motivated by the findings of locally sensitive and orientation-selective nerve cells in the visual cortex. They designed a network structure that implicitly extracts relevant features. Convolutional Neural Networks are a special kind of multi-layer neural networks.

About CNN’s CNN is a feed-forward network that can extract topological properties from an image. Like almost every other neural networks they are trained with a version of the back-propagation algorithm. Convolutional Neural Networks are designed to recognize visual patterns directly from pixel images with minimal preprocessing. They can recognize patterns with extreme variability (such as handwritten characters).

Shift and distortion invariance Classification Classification Convolutional neural network f1 f 2 Classification Pre-processing for feature extraction Input f1 … fn output Feature extraction Input Shift and distortion invariance classification output

Feature extraction layer Shift and distortion invariance or CNN’s Topology Feature maps C Feature extraction layer Convolution layer Shift and distortion invariance or Subsampling layer S

Feature extraction layer or Convolution layer detect the same feature at different positions in the input image. features

Feature extraction Convolve with Threshold 1 w13 w23 w22 w21 w33 w32 -1 Convolve with Threshold w13 w12 w11 w23 w22 w21 w33 w32 w31 w13 w12 w11 w23 w22 w21 w33 w32 w31

Feature extraction Shared weights: all neurons in a feature share the same weights (but not the biases). In this way all neurons detect the same feature at different positions in the input image. Reduce the number of free parameters. Inputs C S

Feature extraction If a neuron in the feature map fires, this corresponds to a match with the template.

Subsampling layer the subsampling layers reduce the spatial resolution of each feature map By reducing the spatial resolution of the feature map, a certain degree of shift and distortion invariance is achieved.

Subsampling layer the subsampling layers reduce the spatial resolution of each feature map

Subsampling layer

Subsampling layer The weight sharing is also applied in subsampling layers.

Subsampling layer the weight sharing is also applied in subsampling layers reduce the effect of noises and shift or distortion Feature map

Up to now …

LeNet 5

LeNet5 Introduced by LeCun. raw image of 32 × 32 pixels as input.

LeNet5 C1,C3,C5 : Convolutional layer. 5 × 5 Convolution matrix. S2 , S4 : Subsampling layer. Subsampling by factor 2. F6 : Fully connected layer.

LeNet5 All the units of the layers up to F6 have a sigmoidal activation function of the type:

LeNet5 +1 +1 F1 W1 F2 W2 Y0 +1 W84 F84

LeNet5 About 187,000 connection. About 14,000 trainable weight.

LeNet5

LeNet5

Comparison

Comparison Database : MNIST (60,000 handwritten digits ) Affine distortion : translation, rotation. elastic deformations : corresponding to uncontrolled oscillations of the hand muscles. MLP (this paper) : has 800 hidden unit.

Comparison “This paper” refer to reference[ 3] on references slide.

Disadvantages

Disadvantages From a memory and capacity standpoint the CNN is not much bigger than a regular two layer network. At runtime the convolution operations are computationally expensive and take up about 67% of the time. CNN’s are about 3X slower than their fully connected equivalents (size-wise).

Disadvantages Convolution operation 4 nested loops ( 2 loops on input image & 2 loops on kernel) Small kernel size make the inner loops very inefficient as they frequently JMP. Cash unfriendly memory access Back-propagation require both row-wise and column-wise access to the input and kernel image. 2-D Images represented in a row-wise-serialized order. Column-wise access to data can result in a high rate of cash misses in memory subsystem.

Application

Application A Convolutional neural network achieves 99.26% accuracy on a modified NIST database of hand-written digits. MNIST database : Consist of 60,000 hand written digits uniformly distributed over 0-9. Mike O'Neill

Application

Application

Application

Application

References [1].Y. LeCun and Y. Bengio.“Convolutional networks for images, speech, and time-series.” In M. A. Arbib, editor, The Handbook of Brain Theory and Neural Networks. MIT Press, 1995. [2].Fabien Lauer, ChingY. Suen, Gérard Bloch,”A trainable feature extractor for handwritten digit recognition“,Elsevier, october 2006. [3].Patrice Y. Simard, Dave Steinkraus, John Platt, "Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis," International Conference on Document Analysis and Recognition (ICDAR), IEEE Computer Society, Los Alamitos, pp. 958-962, 2003.

Questions 8 4 6