Ch7: Hopfield Neural Model

Slides:

Advertisements

Similar presentations

Feedback Neural Networks

Advertisements

Bioinspired Computing Lecture 16

Chapter3 Pattern Association & Associative Memory

Pattern Association.

Computational Intelligence

Introduction to Neural Networks Computing

Separating Hyperplanes

1 Neural networks 3. 2 Hopfield network (HN) model A Hopfield network is a form of recurrent artificial neural network invented by John Hopfield in 1982.

Applied Discrete Mathematics Week 12: Trees

Neural Networks for Optimization William J. Wolfe California State University Channel Islands.

Neural Networks for Optimization Bill Wolfe California State University Channel Islands.

Introduction to Neural Networks John Paxton Montana State University Summer 2003.

18 1 Hopfield Network Hopfield Model 18 3 Equations of Operation n i - input voltage to the ith amplifier a i - output voltage of the ith amplifier.

Hypercubes and Neural Networks bill wolfe 10/23/2005.

Neural Networks Chapter 2 Joost N. Kok Universiteit Leiden.

December 7, 2010Neural Networks Lecture 21: Hopfield Network Convergence 1 The Hopfield Network The nodes of a Hopfield network can be updated synchronously.

Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.

CS623: Introduction to Computing with Neural Nets (lecture-10) Pushpak Bhattacharyya Computer Science and Engineering Department IIT Bombay.

Eigenvalue Problems Solving linear systems Ax = b is one part of numerical linear algebra, and involves manipulating the rows of a matrix. The second main.

Solving Scalar Linear Systems Iterative approach Lecture 15 MA/CS 471 Fall 2003.

Chapter 6 Associative Models. Introduction Associating patterns which are –similar, –contrary, –in close proximity (spatial), –in close succession (temporal)

Chapter 7 Other Important NN Models Continuous Hopfield mode (in detail) –For combinatorial optimization Simulated annealing (in detail) –Escape from local.

Presentation on Neural Networks.. Basics Of Neural Networks Neural networks refers to a connectionist model that simulates the biophysical information.

10/6/20151 III. Recurrent Neural Networks. 10/6/20152 A. The Hopfield Network.

Neural Networks Architecture Baktash Babadi IPM, SCS Fall 2004.

Neural Networks Ellen Walker Hiram College. Connectionist Architectures Characterized by (Rich & Knight) –Large number of very simple neuron-like processing.

1 Chapter 6: Artificial Neural Networks Part 2 of 3 (Sections 6.4 – 6.6) Asst. Prof. Dr. Sukanya Pongsuparb Dr. Srisupa Palakvangsa Na Ayudhya Dr. Benjarath.

Artificial Neural Network Supervised Learning دكترمحسن كاهاني

Hebbian Coincidence Learning

Recurrent Network InputsOutputs. Motivation Associative Memory Concept Time Series Processing – Forecasting of Time series – Classification Time series.

Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.

Neural Networks and Fuzzy Systems Hopfield Network A feedback neural network has feedback loops from its outputs to its inputs. The presence of such loops.

IE 585 Associative Network. 2 Associative Memory NN Single-layer net in which the weights are determined in such a way that the net can store a set of.

So Far……  Clustering basics, necessity for clustering, Usage in various fields : engineering and industrial fields  Properties : hierarchical, flat,

Activations, attractors, and associators Jaap Murre Universiteit van Amsterdam

Optimization with Neural Networks Presented by: Mahmood Khademi Babak Bashiri Instructor: Dr. Bagheri Sharif University of Technology April 2007.

1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.

Chapter 2 Single Layer Feedforward Networks

CHAPTER 10 Widrow-Hoff Learning Ming-Feng Yeh.

EEE502 Pattern Recognition

Activations, attractors, and associators Jaap Murre Universiteit van Amsterdam en Universiteit Utrecht

Hopfield Neural Networks for Optimization 虞台文大同大學資工所智慧型多媒體研究室.

Solving Scalar Linear Systems A Little Theory For Jacobi Iteration

Chapter 3.5 and 3.6 Heuristic Search Continued. Review:Learning Objectives Heuristic search strategies –Best-first search –A* algorithm Heuristic functions.

Lecture 9 Model of Hopfield

Neural Network to solve Traveling Salesman Problem Amit Goyal Koustubh Vachhani Ankur Jain 01D05007.

Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.

Assocative Neural Networks (Hopfield) Sule Yildirim 01/11/2004.

Lecture 39 Hopfield Network

J. Kubalík, Gerstner Laboratory for Intelligent Decision Making and Control Artificial Neural Networks II - Outline Cascade Nets and Cascade-Correlation.

Chapter 5 Unsupervised learning

Solving Traveling salesman Problem with Hopfield Net

Chapter 6 Associative Models

Real Neurons Cell structures Cell body Dendrites Axon

Perturbation method, lexicographic method

NEURONAL DYNAMICS 2: ACTIVATION MODELS

Synaptic Dynamics: Unsupervised Learning

Degree and Eigenvector Centrality

ECE/CS/ME 539 Neural Networks and Fuzzy Systems

Hopfield Network.

Recurrent Networks A recurrent network is characterized by

Boltzmann Machine (BM) (§6.4)

Lecture 39 Hopfield Network

Computational Intelligence

Ch6: AM and BAM 6.1 Introduction AM: Associative Memory

Agenda Review Lecture Content: Shortest Path Algorithm

Computational Intelligence

CS623: Introduction to Computing with Neural Nets (lecture-11)

CSC 578 Neural Networks and Deep Learning

Presentation transcript:

Ch7: Hopfield Neural Model HNM is a kind of BAM Two versions of Hopfield memory 1. Discrete: (a) sequential, (b) parallel 2. Continuous 7.1. Discrete Hopfield Memory Recall auto-BAM Mathematically, Training set: Weight matrix:

The HNM has no self-loop connection for each unit. HNM Architecture: The HNM has no self-loop connection for each unit. Each unit has an external input signal . 2 2

Weight matrix: 2. Force the diagonal elements of W to be zero ( no self-loop) Input: Output: Threshold to be defined is different from BAM (= 0).

Energy function: : external input (w = 1) : threshold viewed as a negative (inhibitive) input (w = 1) 7.1.1 Sequential (Asynchronous) Hopfield Model Given a set of M binary patterns of N components Weight matrix: Threshold vector:

If stored exemplars are orthogonal, every exemplar □ Energy Function ( with a minus sign) If stored exemplars are orthogonal, every exemplar corresponds to a local minimum of E. Feature space Energy space A particular exemplar is located by looking for its corresponding local minimum in the energy space using a descent approach.

8

where

Let i.e., the net input to unit i. Consider one-bit change, say To decrease energy, should be in consistence with in sign.

□ Algorithm (Sequential Hopfield Model) Input a i. Compute Sequential fashion: ii. Update iii. Repeat until none of elements changes state

□ Convergence proof From (A), on any bit-change. □ Local minimum and attractors Local minimum: a point that has an energy level ≦ any nearest neighbor Attraction: an equilibrium state ※ Local minimum must be attraction, while it is not necessarily true for the reverse

○ Example 1: Two training pattern vectors Weight matrix:

where nullifies the diagonal element Threshold vector: Suppose input vector By cyclic update ordering: i. First iteration (k = 0) Initial vector =

a. lst bit (i = 1) Compute the net input Update Obtain 1st bit updated

b. 2nd bit (i = 2) Compute the net input Update the state Obtain 2nd bit unchanged c. 3rd bit (i = 3) Unchanged d. 4th bit (i = 4) Unchanged

The above can simply be performed as 1. Compute 2. Update

ii. Second iteration 1. Compute 2. Update

iii. Terminate ※ Different ordering retrieves different output ○ Example 2: Convergent state depends on the order of update Two patterns: Weight matrix:

Threshold vector: ※ The output can be obtained by following the energy-descending directions in a hypercube.

i. Energy level for [0 1 1 0]

ii. Energy level for [0 0 1 1] There are more than one directions in which the energy level can descend. The selection of the path is determined by the order of updating bits.

。 Start with with energy -1 Two paths lead to lower energy (0 0 1 0)→(0 0 1 1)/-2, (0 0 1 0)→(1 0 1 0)/-2 depend on left- or right-most bit updated first

7.1.2 Parallel (Synchronous) Hopfield Model □ Weights: ※ The diagonal weights are not set to zero (i.e., having self-loop) Thresholds: 。 Algorithm: During the kth iteration: i. Compute the net input in parallel i = 1, …, N 25

ii. Update the states in parallel Repeat until none of the element changes □ Convergence: At the kth parallel iteration, energy function The energy-level change due to one iteration

∵ W is a nonnegative definite matrix (∵ W formed by outer product Symmetric, nonnegative definite (1), (2)

□ Remarks A local/global minimum must be an attractor. An attractor is not necessarily a local/global minimum. There are many more spurious attractors in the parallel model than sequential model. The parallel model does not get trapped to local minimum as easily as the sequential model (∵ Even if a state is one bit away from a local minimum, it may not be trapped by that attractor because more bits changed in one iteration)

The parallel model appears to outperform the sequential model in terms of percentage of correct retrieval 7.1.3. Capacities of Hopfield and Hamming Networks Capacity: the number of distinct patterns that can be stored in the network. □ If a neural network contain N neurons, the capacity C of the network is at most

Proof: Given p patterns Idea: (i) For a pattern, if it is of sufficiently low probability that any bit may change, then the pattern is considered to be a good attractor (ii) If all the p patterns are good attractors, then the network is said to have a capacity p; otherwise, lower than p.

。Work with bipolar representation Consider an input examplar Ignore and let

Multiply where When changes to When changes to 1 changes when and only when (i.e., or 。 Define bit-error-rate

Suppose If Np large, from central limit theory

Suppose the total error probability < ε criterion of stability discernibility The error probability for each pattern and each neuron (bit) This leads to Take the logarithm If N large, (N dominates)

□ Central Limit Theorem iid r.v. with mean and variance □ Change of variation formulas : differentiable strictly increasing or strictly decreasing function X : a continuous r.v.,

7.2. Continuous Hopfield memory □ Resemble actual neuron having continuous graded output □ An electronic circuit using amplifies, resistors, and capacitors is possibly built using VLSI A PE consists of an amplifier and an inverting amplifier used to simulate inhibitory signal. A resister , where is the weight matrix, is placed at the intersection connecting units i and j. 36

Total input current: external current, linking current leakage current where 37

Treat the circuit as a transient RC circuit, in which charging the capacitor is due to the net-input current, i.e., □ Energy function: c : capacity From (A), Show E is a Lyapunov function From (C), From (B),

Let the output function monotonically increasing functions

7.3.3 The Traveling-Salesperson Problem Constraints: 1. visit each city, 2. only once, 3. minimum distance Brute force: 2 directions fixed starting city

i. A set of n PEs representing n possible positions □ Hopfield solution i. A set of n PEs representing n possible positions for a city in the tour Example tour solution: BAECD A: 01000, B: 10000, C: 00010, D: 00001, E: 00100 ii. Entries of matrix x: city, i: position Matrix representation

( Distance between cities x and y) iii. Energy function Criteria: (a) Each city visited only once (b) Each position appearing on the tour only once (c) Include all cities, (d) Shortest total distance (1) (2) (3) (4) ( Distance between cities x and y)

When the network is stabilized, ideally Term 1: Each row of the matrix contains a single value 1 Term 2: Each column of the matrix contains a single 1 Term 3: Each row and each column contain at most one 1

Term 4: minimum when x, y are not in sequence on the tour 0 when x, y are in sequence on the tour

iv. Weight matrix Defined in terms of inhibitions between PEs (A) Inhibition term from criterion (a) (B) Inhibition term from criterion (b)

(C) Inhibition term from criterion (c) -C -C : constant (global inhibition) (D) Inhibition term from criterion (d) If j = i－1 or i+1, x and y are adjacent cities. two cities far apart should receive a large inhibition

Pattern of inhibitory connections Unit a illustrates the inhibition between units on a single row. Unit b illustrates the inhibition between units on a single column. Unit c shows the inhibition of units in adjacent columns. The global inhibition is not shown.

。 Evolution of the network where Discretize: (1-D) (2-D)

Substitute into

Update: Output: 。 Example: n = 10 (cities) Select A = B = 500, C = 200, D = 500 Initialize s.t.

1. Initialize s.t. 4. Update: 6. Repeat steps 1-5 until stopping criteria are satisfied.