Yansong Feng and Mirella Lapata

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

Automatic summarization Dragomir R. Radev University of Michigan
Reading a Scientific Journal Article Dr. C.’s AP Chemistry Lake Dallas High School Fall 2014.
 Andisheh Keikha Ryerson University Ebrahim Bagheri Ryerson University May 7 th
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
Applications Chapter 9, Cimiano Ontology Learning Textbook Presented by Aaron Stewart.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Approaches to automatic summarization Lecture 5. Types of summaries Extracts – Sentences from the original document are displayed together to form a summary.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
1 Today  Tools (Yves)  Efficient Web Browsing on Hand Held Devices (Shrenik)  Web Page Summarization using Click- through Data (Kathy)  On the Summarization.
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Information Retrieval in Practice
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
What is Readability?  A characteristic of text documents..  “the sum total of all those elements within a given piece of printed material that affect.
Search Engines and Information Retrieval Chapter 1.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman,
©2008 Srikanth Kallurkar, Quantum Leap Innovations, Inc. All rights reserved. Apollo – Automated Content Management System Srikanth Kallurkar Quantum Leap.
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
COMPUTER-ASSISTED PLAGIARISM DETECTION PRESENTER: CSCI 6530 STUDENT.
What are they and what should you know about them?
Modern Information Retrieval Computer engineering department Fall 2005.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
Similar Document Search and Recommendation Vidhya Govindaraju, Krishnan Ramanathan HP Labs, Bangalore, India JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE.
1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)
A Machine Learning Approach to Sentence Ordering for Multidocument Summarization and Its Evaluation D. Bollegala, N. Okazaki and M. Ishizuka The University.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
Artificial Intelligence Research Center Pereslavl-Zalessky, Russia Program Systems Institute, RAS.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
IAT Text ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY [SIAT]
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Research © 2008 Yahoo! Generating Succinct Titles for Web URLs Kunal Punera joint work with Deepayan Chakrabarti and Ravi Kumar Yahoo! Research.
Information Retrieval
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Principals of Research Writing. What is Research Writing? Process of communicating your research  Before the fact  Research proposal  After the fact.
ASSOCIATIVE BROWSING Evaluating 1 Jinyoung Kim / W. Bruce Croft / David Smith for Personal Information.
The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
The Effect of Database Size Distribution on Resource Selection Algorithms Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
A Survey on Automatic Text Summarization Dipanjan Das André F. T. Martins Tolga Çekiç
Visual Information Retrieval
Clustering of Web pages
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Presented by: Hassan Sayyadi
Applying Key Phrase Extraction to aid Invalidity Search
Data Warehousing and Data Mining
a chicken and egg problem…
Show, not tell.
Unsupervised learning of visual sense models for Polysemous words
Jiwon Kim Steve Seitz Maneesh Agrawala
Presentation transcript:

Yansong Feng and Mirella Lapata How Many Words is a Picture Worth? Automatic Caption Generation for News Images Yansong Feng and Mirella Lapata Ashish Bagate

What this paper is about Explore the feasibility of automatic caption generation for images in news domain Why particularly news domain – training data is available easily and abundantly

Why Lots of digital images available on the Web Improved searching Analysis of the image Keywords only searches are ambiguous Targeted queries using longer search strings Web accessibility

General Approach Two step process Analyze the image and build a representation for the same Run the text generation engine on the image representation, and come up with a natural language representation

Related Work Hede et al. – not practical because of controlled data set and also manual database creation Yao et al. – based on just the image Elzer et al. – what the graphic depicts, little emphasis on graphics generation These methods use some background information /terminologies

Problem Formulation For the given image I and the document D, generate a caption C Training data contains document – image – caption tuples Caption generation is a difficult task even for humans A good caption must be succinct, informative, clearly identify the subject of the picture, draw reader to the article

Overview of the method Similar to Headline generation task Get the training data (it would be noisy) Follows two stage approach Get the keywords from the image (image annotation model) Generate the caption from the given image words Use of image features for faithful and meaningful description for the images

Image Annotation Probabilistic model – well suited for noisy data Calculate SIFT descriptors of images Visual words by K means clustering Get the keywords by LDA dmix - bag of words representing image – document – caption

Extractive Caption Generation Not much linguistic analysis is needed Caption would be a sentence from the document which is maximally similar to description keywords

Types of Similarities Word Overlap Cosine Similarity Probabilistic Similarity KL divergence – similarity between an image and a sentence is measured by the extent to which they share the same topic distributions

Issues with Extractive Caption Generation No single sentence can represent the image Selected caption sentences might be longer than the average length of the sentence May not be catchy

Abstractive Caption Generation Word based model Adapted from headline generation Caption = the sequence of words that maximizes P

Abstractive Caption Generation Phrase based model Caption = the sequence of words that maximizes P

Evaluation…

Evaluation…

Evaluation

Thanks!