L’età della parola Giuseppe Attardi Dipartimento di Informatica Università di Pisa ESA SoBigDataPisa, 24 febbraio 2015.

Slides:



Advertisements
Similar presentations
Advanced Topics Giuseppe Attardi Dipartimento di Informatica Università di Pisa.
Advertisements

Nathan Wiebe, Ashish Kapoor and Krysta Svore Microsoft Research ASCR Workshop Washington DC Quantum Deep Learning.
Deep Learning in NLP Word representation and how to use it for Parsing
The Unreasonable Effectiveness of Data Alon Halevy, Peter Norvig, and Fernando Pereira Kristine Monteith May 1, 2009 CS 652.
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
1 Empirical Learning Methods in Natural Language Processing Ido Dagan Bar Ilan University, Israel.
Center for Computational Learning Systems Independent research center within the Engineering School NLP people at CCLS: Mona Diab, Nizar Habash, Martin.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
تمرين شماره 1 درس NLP سيلابس درس NLP در دانشگاه هاي ديگر ___________________________ راحله مکي استاد درس: دکتر عبدالله زاده پاييز 85.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
ELN – Natural Language Processing Giuseppe Attardi
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
Answer Validation Exercise - AVE QA subtrack at Cross-Language Evaluation Forum 2007 UNED (coord.) Anselmo Peñas Álvaro Rodrigo Valentín Sama Felisa Verdejo.
Ronan Collobert Jason Weston Leon Bottou Michael Karlen Koray Kavukcouglu Pavel Kuksa.
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
Eric H. Huang, Richard Socher, Christopher D. Manning, Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA ImprovingWord.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Page 1 SenDiS Sectoral Operational Programme "Increase of Economic Competitiveness" "Investments for your future" Project co-financed by the European Regional.
CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.
Markov Logic and Deep Networks Pedro Domingos Dept. of Computer Science & Eng. University of Washington.
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
Seminar Topics and Projects Giuseppe Attardi Dipartimento di Informatica Università di Pisa.
Xinhao Wang, Jiazhong Nie, Dingsheng Luo, and Xihong Wu Speech and Hearing Research Center, Department of Machine Intelligence, Peking University September.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning Ronan Collobert Jason Weston Presented by Jie Peng.
Big data classification using neural network
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
CS 388: Natural Language Processing: LSTM Recurrent Neural Networks
Taking a Tour of Text Analytics
Deep Learning Amin Sobhani.
Machine Learning overview Chapter 18, 21
Natural Language Processing with Qt
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Deep learning and applications to Natural language processing
AV Autonomous Vehicles.
Information Retrieval and Web Search
Unsupervised Learning and Autoencoders
Text Analytics Giuseppe Attardi Università di Pisa
Machine Learning Week 1.
Machine Learning in Natural Language Processing
LING 388: Computers and Language
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Introduction to Neural Networks And Their Applications - Basics
Seminar Topics and Projects
How to publish in a format that enhances literature-based discovery?
Computational Linguistics: New Vistas
PolyAnalyst Web Report Training
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Word embeddings (continued)
III. Introduction to Neural Networks And Their Applications - Basics
Lecture 21: Machine Learning Overview AP Computer Science Principles
Information Retrieval and Web Search
Introduction to Artificial Intelligence
Introducing Apache Mahout
Sequence-to-Sequence Models
Bidirectional LSTM-CRF Models for Sequence Tagging
Huawei CBG AI Challenges
Lecture 9: Machine Learning Overview AP Computer Science Principles
Presentation transcript:

L’età della parola Giuseppe Attardi Dipartimento di Informatica Università di Pisa ESA SoBigDataPisa, 24 febbraio 2015

Natural Language Learning Children learn to speak naturally, by talking with others Children learn to speak naturally, by talking with others Teach computers to learn language in a similarly natural way Teach computers to learn language in a similarly natural way

Statistical Machine Learning Training on large document collections Training on large document collections Requires ability to process Big Data Requires ability to process Big Data  If we used same algorithms 10 years ago they would still be running The Unreasonable Effectiveness of Big Data The Unreasonable Effectiveness of Big Data

Example: Machine Translation Arabic to English, five-gram language models, of varying size

Deep Learning Breakthrough: 2006 … … … … Output layer Prediction of target Hidden layers Learn more abstract representations Input layer Raw input

Lots of Unlabeled Data Language Model Language Model  Corpus: 2 B words  Dictionary: 130,000 most frequent words  4 weeks of training Parallel + CUDA algorithm Parallel + CUDA algorithm  2 hours

Word Embeddings Word Embeddings neighboring words are semantically related

A Unified Deep Learning Architecture for NLP NER (Named Entity Recognition) NER (Named Entity Recognition) POS tagging POS tagging Chunking Chunking Parsing Parsing SRL (Semantic Role Labeling) SRL (Semantic Role Labeling) Sentiment Analysis Sentiment Analysis

Deep Text Analysis Parsing Parsing Word Sense Disambiguation Word Sense Disambiguation Anafora Resolution Anafora Resolution Information Extraction Information Extraction Sentiment Analysis Sentiment Analysis Text Entailment Text Entailment Question Answering Question Answering Biomedical Text Analysis Biomedical Text Analysis

QA on Alzheimer Disease the γ-secretase inhibitor Semacestat failed to slow cognitive decline disorder SnowMed: C disorder SnowMed: C protein drug substance QA on Alzheimer Competition SUBJ OBJ APPO OBJ ROOT

Correlation Simptoms-Diseases

Big data, Big Brain Google DistrBelief Google DistrBelief  Cluster capable of simulating 100 billion connections  Used to learn unsupervised image classification  Used to produce tiny ASR model Similar basic capability for processing image, audio and language Similar basic capability for processing image, audio and language European FET Brain project European FET Brain project