Presentation is loading. Please wait.

Presentation is loading. Please wait.

ACM CIKM October 31, 2013 Jeff Hawkins On-line Learning From Streaming Data.

Similar presentations


Presentation on theme: "ACM CIKM October 31, 2013 Jeff Hawkins On-line Learning From Streaming Data."— Presentation transcript:

1 ACM CIKM October 31, 2013 Jeff Hawkins jhawkins@GrokSolutions.com On-line Learning From Streaming Data

2 1) Discover operating principles of neocortex 2)Build systems based on these principles Anatomy, Physiology Theoretical principles Software Industrial Research Track Anomaly detection in high velocity data Cortical algorithms

3 The neocortex is a memory system. data stream retina cochlea somatic The neocortex learns a model from sensory data - predictions - anomalies - actions The neocortex learns a sensory-motor model of the world

4 Principles of Neocortical Function retina cochlea somatic 1) On-line learning from streaming data data stream

5 Principles of Neocortical Function 2) Hierarchy of memory regions retina cochlea somatic 1) On-line learning from streaming data data stream

6 Principles of Neocortical Function 2) Hierarchy of memory regions retina cochlea somatic 3) Sequence memory - inference - motor data stream 1) On-line learning from streaming data

7 Principles of Neocortical Function 4) Sparse Distributed Representations 2) Hierarchy of memory regions retina cochlea somatic 3) Sequence memory data stream 1) On-line learning from streaming data

8 Principles of Neocortical Function retina cochlea somatic data stream 2) Hierarchy of memory regions 3) Sequence memory 5) All regions are sensory and motor 4) Sparse Distributed Representations Motor 1) On-line learning from streaming data

9 Principles of Neocortical Function retina cochlea somatic data stream x x x x x x xxxxxx x 2) Hierarchy of memory regions 3) Sequence memory 5) All regions are sensory and motor 6) Attention 4) Sparse Distributed Representations 1) On-line learning from streaming data

10 Principles of Neocortical Function 4) Sparse Distributed Representations 2) Hierarchy of memory regions retina cochlea somatic 3) Sequence memory 5) All regions are sensory and motor 6) Attention data stream 1) On-line learning from streaming data These six principles are necessary and sufficient for biological and machine intelligence. - All mammals from mouse to human have them

11 Sparse Distributed Representations (SDRs) Many bits (thousands) Few 1’s mostly 0’s Example: 2,000 bits, 2% active Each bit has semantic meaning Meaning of each bit is learned, not assigned 01000000000000000001000000000000000000000000000000000010000…………01000 Dense Representations Few bits (8 to 128) All combinations of 1’s and 0’s Example: 8 bit ASCII Individual bits have no inherent meaning Representation is assigned by programmer 01101101 = m

12 A Few SDR Properties 1) Similarity: shared bits = semantic similarity subsampling is OK Indices 1 2 | 10 2) Store and Compare: store indices of active bits Indices 1 2 3 4 5 | 40

13 Coincidence detectors How does a layer of neurons learn sequences? Sequence Memory (for inference and motor)

14 Each cell is one bit in our Sparse Distributed Representation SDRs are formed via a local competition between cells.

15 SDR (time =1)

16 SDR (time =2)

17 Cell forms connections to subsample of previously active cells. Predicts its own future activity.

18 Multiple Predictions Can Occur at Once With one cell per column, 1 st order memory We need a high order memory

19 High Order Sequence Memory Enabled by Columns of Cells Cortical Learning Algorithm (CLA) Distributed sequence memory High order High capacity Multiple simultaneous predictions Semantic generalization

20 1)NuPIC Open Source Project Three Current Directions

21 www.Numenta.org Single source tree (used by GROK) GPLv 3 Steady community growth – 67 contributors (+26 since July) – 245 mailing list subscribers – 1621 total messages eBook from community member OS community joining Kaggle Competitions Fall Hackathon: 70 attendees NuPIC Open Source Project

22 1)NuPIC Open Source Project 2)Custom CLA Hardware -Needed for scaling research and commercial applications -DARPA “Cortical Processor” -IBM, Seagate, Sandia Labs 3)Commercialization Three Current Directions

23 2. Look at data3. Build models Problem: - Doesn’t scale with velocity and #models Solution: - Automated model creation - Continuous learning - Temporal inference 1. Store data Stream data Automated model creation Continuous learning Temporal inference Predictions Anomalies Actions Past Future Data: Past and Future

24 Anomaly Detection Using Predictive Cortical Models Cortical Memory Encoder SDR Prediction Point anomaly score Time average Distribution of averages Metric anomaly score Metric 1 Cortical Memory Encoder SDR Prediction Point anomaly score Time average Distribution of averages Metric anomaly score SDR Metric N...... System Anomaly Score

25 Largely predictable Largely unpredictable Metric value Anomaly score Metric value Anomaly score

26  Smartphone-centric  Ranks anomalous instances  Rapid drill down  Continuously updated  User-controlled notifications Breakthrough Science for Anomaly Detection Reinventing UX for IT Monitoring Grok for IT Monitoring  Detects problems thresholds miss  Continuous learning  Automated model building  State-of-the art neocortical model In private beta for Amazon AWS cloud users grokbeta@GrokSolutions.com

27  Custom metrics for any application/server  Web interface and mobile client source code available under no-cost license  Engine API to be published  NuPIC open source community Extensible Architecture

28


Download ppt "ACM CIKM October 31, 2013 Jeff Hawkins On-line Learning From Streaming Data."

Similar presentations


Ads by Google