Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.

Slides:



Advertisements
Similar presentations
Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.
Advertisements

National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Abuse Testing Laboratory Management Laboratory Management.
Giri Palanisamy Oak Ridge National Laboratory & Lorrie Apple Johnson U.S. Department of Energy October 16, 2013.
Lorrie Apple Johnson Lead Librarian, Information Analysis & Services Office of Scientific and Technical Information (OSTI) National Academy of Sciences.
TECHNOLOGY FOR MOBILE ADVERTISING SEARCH & COMMERCE © 2007 Apptera Inc. Optimizing Software Architecture for Voice Search SpeechTek 2007.
ACCESSIBLE TECHNOLOGIES FOR SPEECH MANAGEMENT “Making media accessible to all” ITU workshop – Geneva October 2013.
Nexidia Confidential “Searching Audio and Video Sources On the Web” SpeechTEK West 2007.
1 Texmex – November 15 th, 2005 Strategy for the future Global goal “Understand” (= structure…) TV and other MM documents Prepare these documents for applications.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Docsoft:AV Automatic Closed Captioning and Transcribing Appliance July 9 th, 2007.
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Voice-enabled Image Identification System Design Aashish P. Shrestha Ming Ming Zheng Multimedia Signal Processing, University of Bridgeport, Connecticut.
CH 11 Multimedia IR: Models and Languages
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.
Databases & Data Warehouses Chapter 3 Database Processing.
The WorldWideScience Alliance: An International Partnership to Improve Access to Scientific and Technical Information Lorrie A. Johnson United States Department.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
Dataset Citation: From Pilot to Production Mark Martin Assistant Director, Office of Scientific and Technical Information U.S. Department of Energy.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
The PrestoSpace Project Valentin Tablan. 2 Sheffield NLP Group, January 24 th 2006 Project Mission The 20th Century was the first with an audiovisual.
1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.
NoteSearch - Find what you’re looking for. Prototype Team B.
1 Federated Search (Emphasizing WorldWideScience.org) as a Transformational Technology Enabling Knowledge Discovery InterLending and Document Supply Conference.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and Audio Brandon Muramatsu Andrew McKinney
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
WorldWideScience.org: An International Knowledge-Sharing Model Brian A. Hitson Office of Scientific & Technical Information U.S. Department of Energy.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Dr. Walter L. Warnick Director Office of Scientific and Technical Information Office of Science ARPA-E June 24, 2010 Innovative Web Resources Can Advance.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Unlocking Audio/Video Content with Speech Recognition Behrooz Chitsaz Director, IP Strategy Microsoft Research Frank Seide Lead.
1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.
Competence Centre on Information Extraction and Image Understanding for Earth Observation PLATO for Information Mining in Satellite Imagery Soufiane RITAL,
Search and Annotation Tool for Oral History INTER-VIEWS Henk van den Heuvel, Centre for Language and Speech Technology (CLST) Radboud University Nijmegen,
Definition, purposes/functions, elements of IR systems Lesson 1.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
INTRODUCTION TO INFORMATION SYSTEMS LECTURE 9: DATABASE FEATURES, FUNCTIONS AND ARCHITECTURES PART (2) أ/ غدير عاشور 1.
Multi-Source Information Extraction Valentin Tablan University of Sheffield.
Digital Video Library - Jacky Ma.
Live Global Sports Events
Visual Information Retrieval
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Chapter Five Web Search Engines
Introduction Multimedia initial focus
Tim Smith CERN Geneva, Switzerland
3.0 Map of Subject Areas.
ICSTI Annual Conference 2012
DIGITAL LIBRARY.
Cloud Platform Helps to Empower Citizens and Keep Costs in Check for Local Governments MINI-CASE STUDY “By moving Love Clean Streets to the Microsoft Azure.
Office of Scientific and Technical Information
Atelier Progress Report
Presentation transcript:

Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy

Multimedia Research Speech Search Face identification Object recognition Video browsing Semantic extraction (3D) Segmentation (3D) Image search

Speech as interface Speech as 1 st class content Mobile access Directory services Automation PC application Web service Text input Dictation Indexing Search Keyword extraction Transcription Meetings Voic s Closed Caption Translation Translating phone Speech Applications

Speech recognition Spectral Analysis Matching (Decoding) time alignment  most likely hypothesis W’=argmax (w 1..w N ) p(o t..o  |w 1..w N ) P(w 1..w N ) Acoustic Models p(o t..o  |phoneme) Dictionary P(phonemes|w) Grammar (Language Model) P(w 1..w N ) “Hello World” o 1..o T (w 1..w N )^

MAVIS technology Indexing automatic transcripts as text –Automatic transcription accuracy is only 50-80% MAVIS techniques –Word-level lattice indexing index word alternatives – robust to recognizer errors % accuracy improvement index timing – navigate to exact point in video –Vocabulary Adaptation Use NLP and Bing Search to expand word dictionary –Automatic keywords to expose to search engines Enables discovery of speech content through search engines Bi-product of vocabulary adaptation –See

MAVIS Architecture SQL Server(s) 1. Submit audio/video RSS 2. Retrieve AIB 3. Import AIB in SQL Web server(s) 4. Search/Retrieve results Store content to be processed in temporary Azure storage Do vocabulary adaptation using Bing Run recognition engine on content Store results or recognition process (AIB)

U.S. Department of Energy Office of Scientific and Technical Information (OSTI) Mission DOE invests > $10 billion/year in basic sciences, clean energy technology, and nuclear research. The immediate output from this investment is Information…Knowledge… R&D results OSTI’s mission is to accelerate scientific progress by accelerating access to this information.

OSTI’s Core Products Information Bridge Science Accelerator Science.gov

WorldWideScience.org

Emerging Forms of Scientific Information Require New Tools Numeric data, multimedia, and social media are emerging forms of scientific information Each form presents special opportunities and challenges

Search and Retrieval Challenges with Multimedia Science Information Lack of written transcripts, i.e. no “full text” to search Metadata, if available, is often minimal Scientific, technical, and medical terminology/vocabulary Videos can be long, often up to an hour or more

Video files collected from DOE’s National Laboratories RSS feeds with metadata and URLs sent to Microsoft Research Audio indexing performed via MAVIS Audio index blob (AIB) returned to OSTI and integrated with SQL servers Users can search for a precise term within the video, and be directed to the exact point in the video where the term was spoken OSTI and Microsoft Research Partnership

Demonstration of ScienceCinema ScienceCinema

Looking to the Future Additional content from DOE researchers Integration of multimedia searches into WorldWideScience.org by June High quality automatic closed captions Multilingual translation capabilities