Workshop on Social Events in Web Multimedia, ICMR 2014 Social Event Detection at MediaEval: a three-year retrospect of tasks and results Georgios Petkos,

Slides:

Advertisements

Similar presentations

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.

Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.

A Novel Approach for Recognizing Auditory Events & Scenes Ashish Kapoor.

Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)

Patch to the Future: Unsupervised Visual Prediction

Data-driven Visual Similarity for Cross-domain Image Matching

ImageCLEF breakout session Please help us to prepare ImageCLEF2010.

Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.

GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.

Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.

Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.

WIMS 2014, Thessaloniki, June 2014 A soft frequent pattern mining approach for textual topic detection Georgios Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris.

Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)

Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.

Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.

Presented by Zeehasham Rasheed

Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval Presenter: Andy Lim.

EVENT IDENTIFICATION IN SOCIAL MEDIA Hila Becker, Luis Gravano Mor Naaman Columbia University Rutgers University.

Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.

POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.

Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong

Institute of Informatics and Telecommunications – NCSR “Demokritos” Bootstrapping ontology evolution with multimedia information extraction C.D. Spyropoulos,

An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.

Semantic Publishing Update Second TUC meeting Munich 22/23 April 2013 Barry Bishop, Ontotext.

Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation

Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large- scale Data Collections Xuan-Hieu PhanLe-Minh NguyenSusumu Horiguchi GSIS,

Boris Babenko Department of Computer Science and Engineering University of California, San Diego Semi-supervised and Unsupervised Feature Scaling.

Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab

Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.

Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.

Mining the Structure of User Activity using Cluster Stability Jeffrey Heer, Ed H. Chi Palo Alto Research Center, Inc – SIAM Web Analytics Workshop.

Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.

TRECVID Evaluations Mei-Chen Yeh 05/25/2010. Introduction Text REtrieval Conference (TREC) – Organized by National Institute of Standards (NIST) – Support.

Data Mining and Machine Learning Lab Unsupervised Feature Selection for Linked Social Media Data Jiliang Tang and Huan Liu Computer Science and Engineering.

Giorgos Giannopoulos (IMIS/”Athena” R.C and NTU Athens, Greece) Theodore Dalamagas (IMIS/”Athena” R.C., Greece) Timos Sellis (IMIS/”Athena” R.C and NTU.

Source-Selection-Free Transfer Learning

Research Projects 6v81 Multimedia Database Yohan Jin, T.A.

인지구조기반 마이닝 소프트컴퓨팅 연구실 박사 2 학기 박 한 샘 2006 지식기반시스템 응용.

Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.

Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“

Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.

How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.

A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,

April 2014 SEWM Event Detection from Social Media: User-centric Parallel Split-n-merge and Composite Kernel  Truc-Vien T. Nguyen, Lugano University,

Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.

Semantic Publishing Benchmark Task Force Fourth TUC Meeting, Amsterdam, 03 April 2014.

Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.

Co-funded by the European Union WeKnowIt Emerging, Collective Intelligence for personal, organisational and social use Event Detection.

A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,

Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -

The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.

Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department to Information and Computer Science –University of Trento.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.

CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.

哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.

Machine learning & object recognition Cordelia Schmid Jakob Verbeek.

Saliency-guided Video Classification via Adaptively weighted learning

CSE 635 Multimedia Information Retrieval

Summarization for entity annotation Contextual summary

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

Privacy-Aware Tag Recommendation for Image Sharing

Presentation transcript:

Workshop on Social Events in Web Multimedia, ICMR 2014 Social Event Detection at MediaEval: a three-year retrospect of tasks and results Georgios Petkos, Symeon Papadopoulos, Vasileios Mezaris, Raphael Troncy, Philipp Cimiano, Timo Reuter, Yiannis Kompatsiaris

ICMR 2014, SEWM Workshop Vasileios Mezaris #2 Overview The problem of social event detection. The social event detection task. Evolution of the task and datasets. Overview of approaches pursued by participants and results. Outlook.

ICMR 2014, SEWM Workshop Vasileios Mezaris #3 entertainment personal news wedding / birthday / drinks concert / play / sports demonstration / riot / speech Social events? Attended by people and represented by multimedia content shared online

ICMR 2014, SEWM Workshop Vasileios Mezaris #4 Pope Francis Pope Benedict 2007: iPhone release 2008: Android release 2010: iPad release

ICMR 2014, SEWM Workshop Vasileios Mezaris Social event detection Social event detection involves the discovery and retrieval of social events in collections of multimedia. COLLECTION SOCIAL EVENT DETECTION EVENT SET E1E1 E2E2 ENEN #5

ICMR 2014, SEWM Workshop Vasileios Mezaris The social event detection task As part of the well-known MediaEval benchmarking activity, a task on social event detection has been running for 3 years ( ). The interest on the task has grown significantly: –2011: 7 participants –2012: 5 participants –2013: 11 participants #6

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2011: The task Two challenges were defined in In both, participants were provided with a set of images collected from Flickr and were asked to surface events of a particular type at particular locations: 1.Soccer matches in Barcelona and Rome. 2.Concerts in Paradiso and Parc Del Forum. Two differences between the two challenges: 1.In the first, both a topical and a location criterion are defined, whereas in the second only a location criterion is defined. 2.The specificity of the location of interest is different. #7

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2011: The dataset, ground truth and evaluation 73,645 photos collected from Flickr. All photos were originally geo-tagged and were taken at 5 different cities in May 2009 (geo-tags were removed for 80% of the pictures in the provided dataset). The ground truth was generated by utilizing machine tags provided by event directories as well as an automatic cluster-based framework. Evaluation measures: 1.F-score 2.NMI #8

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2012: The task Three challenges, similar to those of the first year, were defined in Again, participants were provided with a set of images collected from Flickr and were asked to surface events of a particular type at particular locations: 1.Technical events (e.g. exhibitions and fairs) that took place in Germany. 2.Soccer events in Hamburg and Madrid. 3.Demonstration and protest events of the Indignados movement in Madrid. Characteristic of the challenges: 1.Theme and location of queries quite different. 2.Notion of “technical events” somewhat fuzzy. 3.Indignados events are spontaneously organized. #9

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2012: The dataset, ground truth and evaluation 167,332 photos collected from Flickr. Again, all photos were originally geo-tagged but geo-tags were removed for 80% of the pictures in the provided dataset. The ground truth was again generated by utilizing machine tags provided by event directories as well as an automatic cluster-based framework. Evaluation measures: 1.F-score 2.NMI #10

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2013: The task Two completely new challenges were defined: 1.Produce a complete clustering of the image dataset according to events. –Extension: Assign a set of videos to the clusters generated from the image dataset. 2.Classify event media as either representing a social event or not and for those that do represent a social event identify the type of event (eight event types were defined). #11

ICMR 2014, SEWM Workshop Vasileios Mezaris SED 2013: The dataset, ground truth and evaluation Separate dataset, ground truth and evaluation for each challenge: Challenge 1: –Dataset: 427,370 pictures from Flickr and 1,327 videos from YouTube corresponding to 21,169 events. –Ground truth: obtained from last.fm and upcoming machine tags. –Evaluation: F-score and NMI. Challenge 2: –Dataset: 27,754 training images and 29,411 test images collected from Instagram. –Ground truth: obtained by manual annotation. –Evaluation: NMI. #12

ICMR 2014, SEWM Workshop Vasileios Mezaris Evolution of the task Two distinct eras of the task: 1.First two years. Datasets contained both event and non- event images and the task was to retrieve sets of images matching these criteria. 2.Third year. Broken into two subtasks: 1.Full clustering. 2.Detection of event type in individual images. (no filtering subtask though) Also, datasets have become larger and richer. #13

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: First era, (1/4) At a very high level there are two types of approaches: 1.Matching images to event descriptions retrieved from online event directories. 2.Applying a sequence of filtering or classification and clustering steps on the datasets. #14

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: First era, (2/4) Methods in the first class differ in the way that matching is carried out: –Indexing and querying in Lucene. –Probabilistic matching. Methods in the second class are much more popular and: –Some utilize external sources, e.g. DBPedia or the Google geocoding API to enrich the matching criteria. –For most of them time and location (sometimes inferred by the textual metadata when geo-tags are not available) are the primary criteria for clustering. –Alternatively, some approaches treat the problem as a multimodal clustering problem utilizing a learned similarity metric #15

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: First era, (3/4) –For challenge 1 best approach performs early classification of images into cities and then groups images into buckets containing same day and city photos. –For challenge 2, two matching-based approaches achieved the best results (most likely because the type of event makes it more likely to find relevant info in online directories). # Challenge 1Challenge 2 F-scoreNMIF-scoreNMI Brenner et al Hintsa et al Liu et al Nguyen et al Papadopoulos et al Ruocco et al Wang et. al

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: First era, (4/4) The best approach by Vavliakis et al. involves the following steps: 1.City classification. 2.For the images of each city, topic modeling using LDA is performed. 3.The topic model is used to match the photos that are relevant to the queries. 4.Events are identified by finding for each topic and city of interest the days for which there a number of images above some threshold. # Challenge 1Challenge 2 F-scoreNMIF-scoreNMIF-scoreNMI Zeppelzauer et al Vavliakis et al Schinas et al Brenner et al Dao et al

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: Second era, 2013 (1/4) For the first challenge, there are two main types of approaches: 1.Sequence of unimodal clustering operations. 2.Multimodal clustering using a learned similarity measure. However, there are also some rather distinct approaches, e.g.: 1.An approach that applies a Chinese Restaurant Process to perform a stochastic clustering of images. 2.An approach that utilizes WordNet to compute appropriate semantic similarity measures. #18

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: Second era, 2013 (2/4) Results are better that in the 2 previous years (probably because a filtering step is not required) The best approach computes one affinity matrix per modality, averages them and uses the average for clustering as part of a DBScan or spectral clustering procedure. # – Challenge 1 F-scoreNMI Rafailidis et al Samangooei et al Schinas et al Vizuete et al Nguyen et al Zeppelzauer et al Sutanto et al Wistuba et al Papaoikonomou et al Gupta et al Brenner et al

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: Second era, 2013 (3/4) For the second challenge, all approaches adopt a classification procedure. They differ in the set of features that they utilize. For instance: –One approach utilizes scalable Laplacian Eigenmaps to obtain in a semi- supervised manner, an appropriate representation of the images. –Another approach used semantic similarity features based on WordNet. #20

ICMR 2014, SEWM Workshop Vasileios Mezaris Approaches: Second era, 2013 (4/4) The best performing approach uses an SVM classifier and a very rich set of textual features, including a set of ontological features (visual features are not used). # – Challenge 2 F-Score (per category)F-Score (Event/Non-event) Schinas Nguyen Sutanto Brenner

ICMR 2014, SEWM Workshop Vasileios Mezaris Outlook for the SED task Remarkable number of participants in the last year and appearance of quite novel approaches. The SED task is organized in 2014 as well! Three challenges this year: –Full clustering. –Retrieval / filtering. –Summarization / labelling of events. Registration opens soon! #22

ICMR 2014, SEWM Workshop Vasileios Mezaris Outlook for the problem of social event detection We haven’t seen any approach for dealing with the problem of social event detection “into the wild”: –Examined image collections so far had a high ratio of event to non-event photos; the application to a random collection of images would most likely produce poor results. –Classification of images as event or non-event related is important for dealing with the more general scenario. –Additionally, accurate event/non-event classifiers may assist for obtaining more focused crawling mechanisms. Combination of agnostic approaches (such as clustering) and approaches that utilize event directories. More extensive usage of visual content, rather than mostly of metadata. #23

ICMR 2014, SEWM Workshop Vasileios Mezaris Acknowledgments #24