SceneMaker: Multimodal Visualisation of Natural Language Film Scripts Dr. Minhua Eunice Ma School of Computing & Intelligent Systems Faculty of Computing.

Slides:



Advertisements
Similar presentations
National Technical University of Athens Department of Electrical and Computer Engineering Image, Video and Multimedia Systems Laboratory
Advertisements

The Three Ways of Reading a Film
Film Terms & Techniques
Extraction and Visualisation of Emotion from News Articles Eva Hanser, Paul Mc Kevitt School of Computing & Intelligent Systems Faculty of Computing &
HOMER: A Creative Story Generation System Student: Dimitrios N. Konstantinou Supervisor: Prof. Paul Mc Kevitt School of Computing and Intelligent Systems.
Irek Defée Signal Processing for Multimodal Web Irek Defée Department of Signal Processing Tampere University of Technology W3C Web Technology Day.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents.
MediaHub: An Intelligent Multimedia Distributed Hub Student: Glenn Campbell Supervisors: Dr. Tom Lunney Prof. Paul Mc Kevitt School of Computing and Intelligent.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
Spring 2007COMP Design Teams Team Structure Interdisciplinary Teams.
Aug 24, Fall 2005ITCS4010/50101 Design Teams Team Structure Interdisciplinary Teams.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
MUSCLE movie data base is a multimodal movie corpus collected to develop content- based multimedia processing like: - speaker clustering - speaker turn.
Sep 14, Fall 2006IAT 4101 Design Teams Team Structure Interdisciplinary Teams.
IT 342 : Fundamentals of Multimedia Introduction & Multimedia Authoring.
Drama, Script and Screenplay Writing.  Drama in writing means creating a story intended to be acted out either on the stage or on the movie screen. 
Multimedia Enabling Software. The Human Perceptual System Since the multimedia systems are intended to be used by human, it is a pragmatic approach to.
Object Orientated Data Topic 5: Multimedia Technology.
Sunee Holland University of South Australia School of Computer and Information Science Supervisor: Dr G Stewart Von Itzstein.
Using Visual Literacy as a Stimulus to Support High Quality Literacy Teaching and Learning. Jane Denyer.
Television Production Team. Standard 7.0 Standard Text: Exhibit knowledge of the television production team. Learning Goal: Students will be able to understand.
Supporting Content Creation for Games Through Assistive Technologies Dr. Michael Katchabaw Department of Computer Science The University of Western Ontario.
CONFUCIUS: An Intelligent MultiMedia Storytelling Interpretation and Presentation System Minhua Eunice Ma Supervisor: Prof. Paul Mc Kevitt School of Computing.
AmbiLearn: an ambient intelligent multimodal learning environment for children Jennifer Hyndman Supervisors: Dr. Tom Lunney, Prof. Paul Mc Kevitt Intelligent.
GUI: Specifying Complete User Interaction Soft computing Laboratory Yonsei University October 25, 2004.
Animating Virtual Humans in Intelligent Multimedia Storytelling Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent Systems Faculty.
Writing Movie Reviews. Pair Activity While watching the video, answer the following questions on a size 2: How did the two critics begin their review.
3 Aspects of Film Literary Elements Dramatic Elements
Steps Toward an AGI Roadmap Włodek Duch ( Google: W. Duch) AGI, Memphis, 1-2 March 2007 Roadmaps: A Ten Year Roadmap to Machines with Common Sense (Push.
Game Industry and The Future of Game Pertemuan 12 Matakuliah: T0944-Game Design and Programming Tahun: 2010.
Building character animation for intelligent storytelling with the H-Anim standard Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney & Paul Mc Kevitt School of Computing and Intelligent Systems Faculty.
Entertainment Writing “Critical reporting”. “Critical Reporting” Critical reporting refers to the coverage of drama, music, art, and literature by print.
Pre Production Concept Story Development Visual Development Technical Direction Production Management.
Object Orientated Data Topic 5: Multimedia Technology.
SceneMaker Intelligent Multimodal Visualisation of Natural Language Scripts Eva Hanser Dipl.-Des. (FH), M.Sc. Prof. Paul Mc Kevitt, Dr. Tom Lunney, Dr.
ETM Toolkit: A Development Tool Based On Extended Topic Map Lu Jiang, Jun Liu, Zhaohui Wu, Qinghua Zheng, Yanan Qian Speaker: Zhaohui Wu Xi’an Jiaotong.
Temporal Relations in Visual Semantics of Verbs Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering.
CONFUCIUS: an Intelligent MultiMedia storytelling interpretation & presentation system Minhua Eunice Ma Supervisor: Prof. Paul Mc Kevitt School of Computing.
SceneMaker: Automatic Visualisation of Screenplays School of Computing & Intelligent Systems Faculty of Computing & Engineering University of Ulster, Magee,
A Reusable Scripting Engine for Automating Cinematics and Cut-Scenes in Video Games M. McLaughlin and M. Katchabaw Department of Computer Science The University.
(or, “Who Does What”).  The “writer” is the person who writes the original story for the film – he or she might simply come up with the concept and basic.
MULTIMEDIA Hardware 4/24/2017.
1 MPML and SCREAM: Scripting the Bodies and Minds of Life-Like Characters Soft computing Laboratory Yonsei University October 27, 2004.
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt (ICT), Thomas Russ (ISI),
Toward a Unified Scripting Language 1 Toward a Unified Scripting Language : Lessons Learned from Developing CML and AML Soft computing Laboratory Yonsei.
1 1. Representing and Parameterizing Agent Behaviors Jan Allbeck and Norm Badler 연세대학교 컴퓨터과학과 로봇 공학 특강 학기 유 지 오.
Österreichisches Forschnungsinstitut für Artificial Intelligence Representational Lego for ECAs Brigitte Krenn.
A MBI L EARN Ambient Intelligent Multimodal Learning Environment for Children 100 day review December 2008 Jennifer Hyndman Supervisors: Dr. Tom Lunney,
INTRODUCTION GORT is a virtual 3D modeling environment for computer programmers. Its main area of focus is to aid in the education of programmers learning.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
“The primary purpose of cinematic lighting is to support the story by contributing to the overall visual structure of the film.” From Advanced Renderman.
Intelligent MultiMedia Storytelling System (IMSS) - Automatic Generation of Animation From Natural Language Input By Eunice Ma Supervisor: Prof. Paul Mc.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
Directing FOR STAGE, FILM AND TV. What is the Director?  Director is responsible for integrating all the elements of a production: acting, sets, costumes,
IMSTD:Intelligent Multimedia System for teaching Databases By : NAZLIA OMAR Supervisors: Prof. Paul Mc Kevitt Dr. Paul Hanna School of Computing and Mathematical.
SceneMaker: Automatic Visualisation of Screenplays Eva Hanser Prof. Paul Mc Kevitt Dr. Tom Lunney Dr. Joan Condell School of Computing & Intelligent Systems.
Elements of Drama.
ENTERFACE’08 Multimodal Communication with Robots and Virtual Agents mid-term presentation.
A New Approach to Decision-making within an Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Aiden Mc Caughey, Paul Mc Kevitt.
What is a Storyboard Graphical (visual) representation of the action sequence to create a story Translates the words of the script to images Quite similar.
Literary Genres are a category or certain kind of literature or writing. These categories are identified by examining the characteristics of each piece.
Media Studies: Key Concepts.
Visual Information Retrieval
Motion Picture Language
Web 2.0 Tools GoAnimate For Schools
Presentation transcript:

SceneMaker: Multimodal Visualisation of Natural Language Film Scripts Dr. Minhua Eunice Ma School of Computing & Intelligent Systems Faculty of Computing & Engineering University of Ulster, Magee, Northern Ireland {p.mckevitt, tf.lunney, Eva Hanser Prof. Paul Mc Kevitt Dr. Tom Lunney Dr. Joan Condell School of Computing and Mathematics Faculty of Business, Computing and Law University of Derby, England

PRESENTATION OUTLINE Aims & Objectives Related Projects SceneMaker Design and Implementation Relation to Other Work Conclusion and Future Work

AIMS Automatically generate well-designed and affective virtual scenes from screenplays Realistic visualisation of emotional aspects Multimodal representation with 3D animation, speech, audio and cinematography Enhance believability of virtual actors and scene presentation : AIMS & OBJECTIVES Input: Screen- play SceneMaker System Output: Animation

OBJECTIVES Processing/inferencing emotions and semantic information within story context Common sense, affective and cinematic knowledge ontologies reflecting human cognitive reasoning rules Automatic genre recognition from text Design, implementation and evaluation of SceneMaker : AIMS & OBJECTIVES

Standardized format and language of screenplays Automatic annotation of formal screenplay elements (Jhala 2008) Semantic information on location, timing, props, actors, events, manners, dialogue and camera direction SEMANTIC TEXT PROCESSING : RELATED PROJECTS INT. M.I.T. HALLWAY -- NIGHT Lambeau and Tom come around a corner. His P.O.V. reveals a figure in silhouette blazing through the proof on the chalkboard. There is a mop and a bucket beside him. As Lambeau draws closer, reveal that the figure is Will, in his janitor's uniform. There is a look of intense concentration in his eyes. LAMBEAU Excuse me! WILL Oh, I'm sorry. LAMBEAU What're you doing? WILL (walking away) I'm sorry. Screenplay Extract from ‘Good Will Hunting (1997)’

Emotion recognition from text: keyword spotting, lexical affinity, statistical models, fuzzy logic rules, machine learning, commonsense knowledge, cognitive models XML-based annotations defining visual appearance of animated characters and scenes: BEAT – Behaviour Expression Animation Toolkit (Cassell et al. 2001) MSML – Movie Script Markup Language (Van Rijsselbergen et al. 2009) VISUAL AND EMOTIONAL SCRIPTING : RELATED PROJECTS

Automatic physical transformation and synchronisation of 3D models reflecting emotion Manner influences intensity, scale, force, fluency and timing of an action Multimodal annotated affective video or motion captured data (Gunes and Piccardi 2006) MODELLING AFFECTIVE BEHAVIOUR Personality & Emotion Engine (Su et al. 2007) Greta (Pelachaud 2005) : RELATED PROJECTS

WordsEye – Scene composition (Coyne and Sproat 2001) ScriptViz – Screenplay visualisation (Liu and Leung 2006) CONFUCIUS – Action, speech & scene animation (Ma 2006) CAMEO – Cinematic and genre visualisation (Shim and Kang 2008) VISUALISING 3D SCENES WordsEye CONFUCIUSScriptVizCAMEO : RELATED PROJECTS

Emotional speech synthesis (Schröder 2001) - Prosody rules Music recommendation systems - Categorisation of rhythm, chords, tempo, melody, loudness and tonality - Sad or happy music and genre membership (Cano et al. 2005) - Associations between emotions and music (Kuo et al. 2005) AUDIO GENERATION : RELATED PROJECTS

Context consideration through natural language processing, common sense knowledge and reasoning methods Extract genre and moods from screenplays Influence on all elements of visualisation Enhance naturalism and believability Text-to-animation software prototype, SceneMaker KEY OBJECTIVES : DESIGN AND IMPLEMENTATION

Animatio n Player Script Editor Screen- play Text & Language Processing Text & Language Processing Context Interpretation Context Interpretation Multimedia Generation Multimedia Generation } Genre Emotio n Action } ARCHITECTURE OF SCENEMAKER : DESIGN AND IMPLEMENTATION

SOFTWARE AND TOOLS : DESIGN AND IMPLEMENTATION LVSR (2) Lexical Visual Semantic Representation Script Format Ontology Unity (6) 3D Engine (JavaScript,XML) MSML (5) /SMIL Concept Net (3) Common Sense Knowledge Gate (1) ANNIE Onto-Gazetteer Genre Ontology RDFS/OWL Movie Ontology RDFS/OWL WordNet- Affect (4) Festival (7) Speech Synthesiser Natural Language Processing & Script Segmentation Context + Emotion Reasoning Event Synchronisation 3D Rendering + Multimedia 3D Models (3D Studio Max) Movie Script Automatic Sound & Music Selection (1) (2) Ma 2006 (3) Liu and Singh 2004 (4) Strapparava and Valitutti 2004 (5) Van Rijsselbergen et al (6) (7)

Evaluating 4 aspects of SceneMaker: EVALUATION OF SCENEMAKER AspectEvaluation Correctness of screenplay analysis & visual interpretation Hand-animating scenes Effectiveness of output scenes Existing feature film scenes Suitability for genre typeScenes of unknown scripts categorised by readers Functionality of interfaceTesting with drama students and directors : DESIGN AND IMPLEMENTATION

RELATION TO OTHER WORK

CONCLUSION AND FUTURE WORK Automatic expressive multi-media animation of screenplays Focus on: – automatic reasoning about story context and emotional interpretation – based on world knowledge and context memory – emotions influencing scene compositions and event execution – scene direction refined by genre-specifics Analysis of script format to access semantic information Automatic genre specification from script Heightened expressiveness, naturalness and artistic quality Assist directors, actors, drama students, script writers Future work: Implementation & Testing of SceneMaker

Thank you. QUESTIONS OR COMMENTS ?