WING Research Group Demos and Posters. Min-Yen Kan, Digital Libraries 22nd CSAIL MIT Workshop Demos SlideSeer (M.-Y. Kan) Coordinating presentation slides.

Slides:



Advertisements
Similar presentations
Collections Management Software for Museums and Archives r e d i s c o v e r y s o f t w a r e. c o m O V E R V I E W P R E S E N T A T I O N.
Advertisements

The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Multimedia Answer Generation for Community Question Answering.
Extraction of text data and hyperlink structure from scanned images of mathematical journals Ann Arbor, March 19, 2002 Masakazu Suzuki (Kyushu University)
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Evaluating Search Engine
Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.
IR & Metadata. Metadata Didn’t we already talk about this? We discussed what metadata is and its types –Data about data –Descriptive metadata is external.
E-resources for the social sciences A brief overview of general resources for the social sciences: –Bibliographic databases –Resources for news and statistics.
ACL 2011 Debrief Lin Ziheng 1. Portland 2 Pride parade 3.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Annotated Bibliography And PowerPoint Presentations
Systematic Review Assistant De-duplication Module (SRA-DM)
Louisa Lambregts, What Makes a Web Site Successful and Effective? Bottom Line... Site are successful if they meet goals/expectations.
Internet and Social Networking Research Tools for Academic Writing Copyright © 2014 Todd A. Whittaker
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)
Library research workshop for ENSC 100/101 Gordon Coleman Librarian for Engineering Simon Fraser University Library Fall.
1 The BT Digital Library A case study in intelligent content management Paul Warren
Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.
Addressing the Metadata Bottleneck* *By Developing and Evaluating an Online Tool to Support Non-specialists to Evaluate Dublin Core Metadata Records Michael.
Multimodal Alignment of Scholarly Documents and Their Presentations Bamdad Bahrani JCDL 2013 Submission Feb 2013.
Which of the two appears simple to you? 1 2.
2008 International Workshop on Web and Databases (WebDB) Efficient Web-Based Linkage of Short to Long Forms Yee Fan Tan 1, Ergin Elmacioglu 2, Min-Yen.
Trends in Web Search and its relevance to Digital Libraries Min-Yen Kan Web IR NLP Group (WING) National University of Singapore.
Login Instructions 1.Windows Login –User name=Student ID –Password (case sensitive) = Upper case letter Lower case letter Five numerals One symbol (use.
Web IR/NLP Group NUS Min-Yen Kan School of Computing National University of Singapore
Researching & Writing a Literature Review Karen Ciccone NCSU Libraries.
Open access & visibility Management Digital Preservation ORA: Purposes.
A N AJAX INTERFACE FOR THE LINC SYSTEM By Jesse Prabawa Gozali.
Crawling and Aligning Scholarly Presentations and Documents from the Web By SARAVANAN.S 09/09/2011 Under the guidance of A/P Min-Yen Kan 10/23/
Personalized Search Xiao Liu
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
First Indico Workshop An Introduction to the Indico Software Thomas Baron May 2013 CERN.
LOGO A comparison of two web-based document management systems ShaoxinYu Columbia University March 31, 2009.
1 NODC Geoportal Server Yuanjie Li & Jefferson Ogata.
27-31 May 2008LREC 2008 (Marrakech, Morocco)1 The ACL ARC Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics.
Searching CiteSeer Metadata Using Nutch Larry Reeve INFO624 – Information Retrieval Dr. Lin – Winter 2005.
SlideSeer: A DL of aligned document and presentation pairs Min-Yen Kan WING (Web IR / NLP Group) National University of Singapore.
Medical Information Retrieval: eEvidence System By Zhao Jin Mar
A Rich OPAC User Interface with AJAX Jesse Prabawa Gozali and Min-Yen Kan WING (Web IR / NLP Group) National University of Singapore.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Intern: Sofien Lazreg MSc in Informatics: Information Management.
User Interface Design for a Large-Scale Computer Science Research Digital Library Min-Yen Kan Department of Computer Science National University of Singapore.
Presented by: AKHIL GADA CSCI 572 University of Southern California Full Text Indexing Based On Lexical Relations An Application :Software Library by YS.
Word Processing Word processing packages such as Microsoft Word are text based. When text is entered via a keyboard, the characters are displayed on screen.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
ForeCite: towards a reader-centric scholarly digital library Thuy Dung Nguyen, Min-Yen Kan, Dinh- Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang.
A System for Automatic Personalized Tracking of Scientific Literature on the Web Tzachi Perlstein Yael Nir.
Name Disambiguation in Digital Libraries Tan Yee Fan 2005 October 19 WING Group Meeting.
1 JCDL 2013 Report Kazunari Sugiyama WING meeting 23 rd August, 2013.
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Event-Based Model for Reconciling Digital Entities Ahmet Fatih Mustacoglu Ahmet E. Topcu Aurel Cami Geoffrey C. Fox Indiana University Computer Science.
ELISQ Systems Demonstration Sagnik Ray Choudhury Doha -- May 2015.
Min’s Research Update WING Group Meeting Min’s research direction NL Work at Stanford.
An Experience Report from the Use of Digital Repositories in Building a New Module Simon McGinnes Trinity College Dublin.
In Business Series © Prentice Hall Microsoft Office Word 2007 In Business Core Chapter 1 Introduction to Word Basics.
1 CS 430 / INFO 430: Information Retrieval Lecture 20 Web Search 2.
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Proposal for Term Project
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
Visualizing Document Collections
Text Categorization Document classification categorizes documents into one or more classes which is useful in Information Retrieval (IR). IR is the task.
Download from Zotero Home Page
PubMed Database Interface (Basic Course: Module 4)
Presentation transcript:

WING Research Group Demos and Posters

Min-Yen Kan, Digital Libraries 22nd CSAIL MIT Workshop Demos SlideSeer (M.-Y. Kan) Coordinating presentation slides and scholarly papers - Align set of slides to set of paragraphs - Difficult as alignment is not monotonic, nil alignments possible - JCDL 2007 LINC 2.0 (J. Prabawa) - How to build a better library catalog interface using AJAX - 4 key areas of user design change: overview + details, AJAX grid, tabs, suggestion bar, - Usability studies for evaluation - JCDL 2007 short paper

Min-Yen Kan, Digital Libraries 32nd CSAIL MIT Workshop Posters Scenario Template Generation (L. Qiu) - Joint work with Tat-Seng Chua - Create template of slots for articles from multiple different event reports - Uses EM based context sensitive clustering -Phrase-based Statistical MT (H. Setiawan) - Joint work with Haizhou Li (I2R) - Focus on function words to reorder arguments (e.g., English preposition “above” to Chinese postposition “ 上 “) Record Linkage using Web resources (Y. F. Tan) - Joint work with Dongwon Lee (PSU) - How to best combine disparate info from various sources of web evidence Math IR (J. Zhao) - Working on indexing, retrieving mathematical information - Key problem: co-ref between formula name “Pythagorean theorem” and equation form and variants. Graph-Based Text Summarization (Z. Lin) - Joint work with Wee Sun Lee - Using incrementally built graphs and PageRank to build better generic and update summarization systems

Min-Yen Kan, Digital Libraries 42nd CSAIL MIT Workshop

SlideSeer: Aligned Presentation and Document Alignment Min-Yen Kan Web IR / NLP Group (WING)

Min-Yen Kan, Digital Libraries 62nd CSAIL MIT Workshop Aligning slides to scholarly papers Why do it? Slides can be seen as a dual and/or summary of papers Useful as a learning and comprehension aid See OCW example in Leslie and Tomas’ lecture notes What’s the goal? A digital library of slides/papers and their fine-grained alignment to scholarly papers (an extension of CiteSeer or Rexa) What data to use? Seeded with CiteSeer data (thanks to CL Giles), merged in DBLP Searched for corresponding presentation using Google API

Min-Yen Kan, Digital Libraries 72nd CSAIL MIT Workshop SlideSeer Demo Proof of concept 10 presentation/paper pairs manually aligned Allows focus on one medium with other in context Allows full slideshow mode Navigation using mouse, keyboard shortcuts Search using Lucene IR package

Min-Yen Kan, Digital Libraries 82nd CSAIL MIT Workshop Next steps Planning to hook up current work in progress: – 2 stage CRF/SVM re-ranking citation segmentation algorithm – Automatic keyphrase extraction program – Automatic synthetic image classification – Automatic de-deduplication module Partnering with Simone Teufel (Cambridge, UK) to do argumentative zoning of documents – What is a citation used for?