Media Manager Mail Access Unified Messaging Barbara Hohlt UC Berkeley Ericsson Presentation August 22, 2000.

Slides:



Advertisements
Similar presentations
Unified Communications Bill Palmer ADNET Technologies, Inc.
Advertisements

Voice over Internet Protocol at Space Telescope Science Institute.
Service Encapsulation in ICEBERG Bhaskaran Raman ICEBERG, EECS, U.C.Berkeley Presentation at Ericsson, Sweden, June 2001.
Enabling Technologies UM, UC and Avaya Modular Messaging R 4
Building Applications Using SIP Scott Hoffpauir Vice President, Engineering Fall 1999 VON, Atlanta.
Universal Inbox: Personal Mobility and Service Mobility in an Integrated Network Bhaskaran Raman ICEBERG, EECS, U.C.Berkeley Home Phone Voice Mail Pager.
MXIE overview 5/4/ Update1. MXIE Media Exchange Interface for End Users 5/4/ Update2.
Problem Statement Requirement –Service integration and personalization Goals –Any-to-any capability –Extensibility: ease of adding new end-points –Scalability:
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Video Object Tracking and Replacement for Post TV Production LYU0303 Final Year Project Spring 2004.
Automatic Content Extraction for Voic Using Ninja Goal: Make voic more accessible Enable faster browsing of many voic s Access from different.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Building Applications Using SIP Scott Hoffpauir Vice President, Engineering Fall 1999 VON, Atlanta.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Electrical Engineering Department Software Systems Lab TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY Meeting recorder Application based on Software Agents.
The Case for ICEBERG Integrated services from diverse networks-- “PANS” (Potentially Any Network Services) Service infrastructure that allows user level.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Richard Furuta Texas A&M University Center for the Study of Digital Libraries and The Department of Computer Science Firing a transition.
Internet Telephony Helen J. Wang Network Reading Group, Jan 27, 99 Acknowledgement: Jimmy, Bhaskar.
1 An overview Always Best Connected Networks Dênio Mariz Igor Chaves Thiago Souto Aug, 2004.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Technion – Israel Institute of Technology Department of Electrical Engineering Software Lab Remote Mailbox based on.NET technology Michael and Eugene Shamis.
SNMP & MIME Rizwan Rehman, CCS, DU. Basic tasks that fall under this category are: What is Network Management? Fault Management Dealing with problems.
Overview of Search Engines
23 October 2002Emmanuel Ormancey1 Spam Filtering at CERN Emmanuel Ormancey - 23 October 2002.
Exchange 2007 Overview. What Will We Cover? New features in Microsoft® Exchange 2007 The Exchange Management Console The Exchange Management Shell New.
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
1 Outlook Lesson 1 Outlook Basics and Microsoft Office 2010 Introductory Pasewark & Pasewark.
Media Manager Mail Access Barbara Hohlt and Steve Czerwinski UC Berkeley Ericsson Presentation 2000.
Copyright John Wiley & Sons, Inc. Chapter 3 – Interactive Technologies HCI: Developing Effective Organizational Information Systems Dov Te’eni Jane.
Computer Networking From LANs to WANs: Hardware, Software, and Security Chapter 12 Electronic Mail.
A form of communication in which electronic messages are created and transferred between two or more devices connected to a network.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter Four Configuring Outlook and Outlook Web Access.
© 2008 Cisco Systems, Inc. All rights reserved.CIPT1 v6.0—5-1 Implementing Media Resources, Features, and Applications Integrating Cisco Unified Communications.
Your personal Display Section Presence notes Display Picture & ‘Indicator’ Contacts Activity Feeds Conversations Phone Options Your Location Your Presence.
Postacademic Interuniversity Course in Information Technology – Module C1p1 Contents Data Communications Applications –File & print serving –Mail –Domain.
Universal Inbox: Extensible Personal Mobility and Service Mobility in an Integrated Network Bhaskaran Raman, Randy H. Katz, Anthony D. Joseph ICEBERG,
1.1 What is the Internet What is the Internet? The Internet is a shared media (coaxial cable, copper wire, fiber optics, and radio spectrum) communication.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Iptel not telip 1 03/19/99 Internet Telephony: not Telephony over Internet Jonathan Rosenberg Bell Laboratories Spring VoN 99.
February 2006Colby College ITS Introduction to Entourage 2008.
Computer and Information Science Ch1.3 Computer Networking Ch1.3 Computer Networking Chapter 1.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Performix Business Services Converging to Unified Messaging With convergence of telephony, computer and multimedia communication becoming a realistic vision,
Design Decisions / Lessons Learned Monday 21 August : :35 Top-level design decisions Rationale for IP-based approach Why an infrastructure.
Microsoft Office Outlook 2013 Microsoft Office Outlook 2013 Courseware # 3252 Lesson 6: Organizing Information.
Introducing Microsoft Outlook Session Version 1.0 © 2011 Aptech Limited.
IT WEB TECHNOLOGY Prepared by, K.ABINAYA Lect/IT.
Accessible Media Using Video and Audio to meet the needs of a diverse populations Presented by Kaela Parks.
NetTech Solutions Troubleshooting Office Applications Lesson Seven.
Michael Möhlenhof T-Mobil;Systemtechnology page:1 T-Mobil Germany From Voic to Unified Messaging Concepts Presentation at the VMA Conference.
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
30 Copyright © 2009, Oracle. All rights reserved. Using Oracle Business Intelligence Delivers.
1 Internet Telephony: Architecture and Protocols an IETF Perspective Authors:Henning Schulzrinne, Jonathan Rosenberg. Presenter: Sambhrama Mundkur.
IP Columbia Internet Real-Time Laboratory Department of Computer Science Columbia University.
800-MEDIA-MGR UID: UID: hohltb: Prefers Desktop mediamgr: Cluster locn. Bhaskar’s Cell-Phone.
Spring 2006 CPE : Application Layer_ 1 Special Topics in Computer Engineering Application layer: Some of these Slides are Based on Slides.
Discovering Computers 2011: Living in a Digital World Chapter 3
Internet Business Associate v2.0
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Simon Tucker and Steve Whittaker University of Sheffield
Design Decisions / Lessons Learned
Inserting Graphics, Media, and Objects
Lesson 21 Enhancing Presentations with Multimedia Effects
Alan Shen Director Unify Square
ICEBERG Release Version 0
Problem Statement Communication devices Communication services
Part II Application Layer.
Presentation transcript:

Media Manager Mail Access Unified Messaging Barbara Hohlt UC Berkeley Ericsson Presentation August 22, 2000

Messages from many sources PSTN Phone Cell-Phone Desktop Pager MediaManager Mail Access

Project Overview Make messages more accessible –Get all types of messages –Access from different devices with different capabilities –Enable faster browsing of many voic s Media Mail services –A unified messaging infrastructure –Voic is encoded in MIME Transcoding services –Enhance voic interaction –Includes: skimmed audio, transcript, text/audio summary, and outline

Related Work Universal Inboxes/Unified Messaging –onebox.com –CoolMail.net –Lucent/Octel Unified Messenger –Stanford Mobile People Architecture Audio Content Extraction Techniques –SpeechSkimmer, MIT’s MultiMedia Lab [Arons95] –Auto-Summarization, Microsoft Research –CueVideo, IBM

Architecture Transcoder Service Voic ->Text Transcript Voic ->Text Summary Voic ->Text Outline ->Plain Audio -. GSM Audio Voic -> GSM Summary Voic ->Audio Summary Voic ->Skimmed Audio Mail Access Interface NinjaMail Client Folder Store Client Mail Access Interface POP Mail Access Interface IMAP Media Manager Interface Media Manager Service

Applications Conventional GUIs Context-Aware Applications Iceberg Universal Inbox Component Desktop MediaManager Mail Access A conventional desktop gui can contact the Media Manager directly and request messages as text. The Media Manager will return s and voic s as text.

Context-Aware Application Palm Device Desktop Redirection Proxy 1 1 palm device asks for a list of messages as text and selects a voic 2 2 requests a redirection from the proxy, which forwards the redirection request to the desktop 3 3 desktop asks for the voic and plays it MediaManager Mail Access

Bhaskar’s Cell-Phone Automatic Path Creation Service 800-MEDIA-MGR UID: Naming Service 1 1 Preference Registry mediamgr: Cluster locn Barbara’s PSTN Phone Universal Inbox Iceberg Universal Inbox MediaManager Mail Access

Architecture Transcoder Service Voic ->Text Transcript Voic ->Text Summary Voic ->Text Outline ->Plain Audio -. GSM Audio Voic -> GSM Summary Voic ->Audio Summary Voic ->Skimmed Audio Mail Access Interface NinjaMail Client Folder Store Client Mail Access Interface POP Mail Access Interface IMAP Media Manager Interface Media Manager Service

MediaManagerServiceIF getFolders( ) and getFoldersAs( ) –Given a username, returns a list of folder names –Returns the list as audio or gsm getList( ) and getListAs( ) –Given a username, foldername, and count –Returns a list of messages (sendername, title, date) –Returns the list as audio or gsm getMessage( ) –Given a Message Ref, returns the entire message getMessageContent( ) –Given a Content ID and return type –Returns one part of the message as the return type

Media Message –Media Reference id –Array of Content Objects Content Object –Content ID –Data Content ID –Media Reference id –Content Part index –Content Type Messages and Content Objects

Interface Example MediaManager Mail Access User asks for list of messages as GSM Media Manager returns a list of message headers Cell Phone sends a Content ID back Media Manager sends a voic Content Object Cell-Phone Media Message Header Content Object Content ID

Audio Tools Speech Recognition/Synthesis –Transcribe voic to text –IBM ViaVoice SDK and custom audio libs Natural Language Processing –Directed word spotting by “understanding” content –ViaVoice SRCL Pitch –Detecting important words by emphasized pitch Pause –Compression through pause removal Spurts –Retrieve sentence structure of voic

Transcoding Techniques Voice Mail ->Text TranscriptSpeech recognition Voice Mail ->Text Summary NLP, pitch detection and recognition Voice Mail ->Text Outline Pause detection and speech recognition E Mail ->Plain AudioSpeech synthesis E Mail ->GSM AudioSpeech synthesis and toast Voice Mail ->Skimmed AudioPause detection Voice Mail ->Audio Summary Text summary and speech synthesis Voice Mail ->GSM SummaryAudio summary and toast

Examples Original Voic “Hello, This is Barbara. How are you and the cats doing? I was wondering if you would feed them a little more the first time in case they eat too much. My number is (713) You can call me anytime. Have a very good holiday. Bye bye” Processed Voic Phyllis Barbara Area in the cat staring And then if you run but feed them A little more the first time in case they eat too much On my number is (713) You can call me anytime. Have every holiday Of light Translated Talk spurts (Pitch emphasized words in green) (Skimmed)(Just pitch) Translated using NLP Hello this is Barbara My number is (713)

Examples continued... Original Voic “Faced with a seemingly inevitable engineering task authors tend to adopt one of two strategies for adding new services to the Internet landscape: inflexible, highly tuned, hand-constructed services….” Processed Voic Translated Talk spurts (Pitch emphasized words in green) (Skimmed)(Just pitch) Translated using NLP Faced with a seemingly inevitable engineering task authors tend to adopt what it to strategies for adding new services to the internet landscape. Inflexible, highly Tate, had constructed services….”

Results Pause detection –Worked well for given applications –Playback speedup by 50-70% Pitch detection –Problems due to high pitch sounds and transitions Speech recognition –Performance decrease in conversational settings Natural Language Processing –Performed well with small grammar

Example: Adding GSM Acess Define a specific types, ie GSMAudio, GSMSummary Optionally create new Content Objects Add Content Object definition to MediaManager Add add gsm transcoder to TranscoderService

Detail: Adding GSM Access Add Content Object definition to MediaManager –Define GSMAUDIO and GSMSUMMARY –Add cases to createObject() in Content Object –Add cases to Media Manager Add GSM to Transcodeer –Add method toGSM() to Transcoder –Edit.config file External.transcoder.gsmrungsm –Edit related transcoders speechSynthesizer and audioSummary()

Implementing Other Mail Stores Examples: IMAP, POP, Microsoft Exchange Server Implement MailAccessIF –String [] getMAFolders( userName ) –MediaMessage [] getMAList( userName, folderName, count ) –MediaMessage getMAMessage( MediaRef ) –ContentObject getMAMessageContent( ContentID ) Add new protocol to Media Manager protocol table Optionally add protocol for users in to FolderStore

Conclusion Overall –System useful as navigational hints –To achieve total comprehension, need better voice recognition What works well –Skimming using pause removal –Detecting spurts for structure What needs work –Speech detection in conversational settings –Pitch emphasis needs refining Future Directions –Implementing more mail stores –Enhancing interfaces –Pause detection/word boundaries using speech detection –Developing voic grammars –Using NLP feedback with pitch emphasis detection –Improved speech detection in noisy environments

MediaManagerServiceIF String[] getFolders( userName ) byte[][] getFoldersAs( userName, returnType ) MediaMessage [] getList( userName, folderName, count ) byte[][] getListAs( userName, folderName, count, returnType ) MediaMessage getMessage( MediaRef ) ContentObject getMessageContent( ContentID, returnType )

Pitch Detection The Idea –A speaker’s pitch naturally changes when introducing topics or emphasizing words [Hirshberg92] –Use pitch increases as hints for “important” words Algorithm [Aaron95] –Determine pitch for each 20 ms frame (FFT with SHS) –Set emphasis threshold to be top 1% of pitch values (by histogram) –Mark 1 sec interval as emphasized if contains >=3 emphasized frames

Pause Detection Why is pause detection useful? –Removing pauses speedups playback Typically, 50-70% of original time [Foulke71] –Long pauses signify groups (talk spurts) Noise and soft sounds create difficulties Algorithm: Smoothed Histogram [Lamet81] –Calculate energy per 10 ms frame –Threshold based on smoothed histogram (5 dB after first peak) –Use heuristics to remove artifacts Average energy (dB) Percent of Frames

Results Pause detection –Worked well for given applications –Playback speedup by 50-70% Pitch detection –Problems due to high pitch sounds and transitions Speech recognition –Performance decrease in conversational settings Natural Language Processing –Performed well with small grammar

Conclusion Overall –System useful as navigational hints –To achieve total comprehension, need better voice recognition What works well –Skimming using pause removal –Detecting spurts for structure What needs work –Speech detection in conversational settings –Pitch emphasis needs refining Future Directions –Implementing more mail stores –Enhancing interfaces –Pause detection/word boundaries using speech detection –Developing voic grammars –Using NLP feedback with pitch emphasis detection –Improved speech detection in noisy environments

Works Cited [Arons95] B. Arons. Interactively Skimming Recorded Speech, Ph.D. dissertation, MIT [Foulke71] E. Foulke The Perception of Time Compressed Speech. Ch 4 in Perception of Language, edit by P.M. Kjeldergaaid, D.L. Horton, and J.J. Jenkins, Charles E. Merill Publishing Company, pp [Hirshberg92] J. Hirschberg and B. Grosz. Intonational Features of Local and Global Discourse. In Proceedings of the Speech and Natural Language workshop (Harriman, NY, Feb ). Morgan Kaufman Publishers, pp [Lamel81] L.F. Lamel, L.R. Rabiner, A.E. Rosenberg, and J.G. Wilpson. An Improved Endpoint Detector for Isolated Word Recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-29, 4. (Aug, 1981),

Architecture Transcoder Service Voic ->Text Transcript Voic ->Text Summary Voic ->Text Outline ->Plain Audio -. GSM Audio Voic -> GSM Summary Voic ->Audio Summary Voic ->Skimmed Audio Mail Access Interface NinjaMail Mail Access Interface POP Mail Access Interface IMAP Client Folder Store Media Manager Service Media Manager Interface