Presentation is loading. Please wait.

Presentation is loading. Please wait.

Interacting with Virtual Humans Zsófia Ruttkay Associate Professor Human Media Interaction, Dept. of CS, University of Twente, NL

Similar presentations


Presentation on theme: "Interacting with Virtual Humans Zsófia Ruttkay Associate Professor Human Media Interaction, Dept. of CS, University of Twente, NL"— Presentation transcript:

1 Interacting with Virtual Humans Zsófia Ruttkay Associate Professor Human Media Interaction, Dept. of CS, University of Twente, NL http://hmi.ewi.utwente.nl/~zsofi/ SSIP 2009 Debrecen http://hmi.ewi.utwente.nl/~zsofi/ zsofi@cs.utwente.nl

2 2 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

3 3 About Virtual Humans I More or less human-like synthetic characters to interact with usersMore or less human-like synthetic characters to interact with users In roles as: - information provider - tutor - sales assistant - ‘translator’ (sign language) - avatar in telepresence and VR - entertainment, games - also medium for experimentsIn roles as: - information provider - tutor - sales assistant - ‘translator’ (sign language) - avatar in telepresence and VR - entertainment, games - also medium for experiments Research since 15 years, recently own events: IVA, Gesture WS, AAMAS workshop seriesResearch since 15 years, recently own events: IVA, Gesture WS, AAMAS workshop series Book by J. Cassell et al. (2000, MIT Press)Book by J. Cassell et al. (2000, MIT Press)

4 4Examples Rea from MIT, Cassell et al. Steve from USC J. Rickel, L. Johnson et al.

5 5Examples OLGA – KTH

6 6Examples Max – Univ. of Bielefeld

7 7Examples Mission Exercise, USC ISI + ICT

8 8Examples Carmen’s Bright Ideas from USC ICT, Marsella et al.

9 9Examples USC – ISI, Adele, Shaw et al.

10 10Examples Let’s Face It! –M. Bartlett et al. Univ. San Diego, 2008.

11 11 Virtual conductor Virtual Conductor HMI, Univ. of Twente, The Netherlands

12 12 Virtual scientific institute Meta Institute for Computational Astrophysics Piet Hut http://www.ugotrade.com/2008/07/19/astrophysics-in-virtual-worlds-implementing-n- body-simulations-in-opensim/

13 13 Robots Hiroshi Ishiguro ATR Intelligent Robotics and Communications Laboratories http://www.irc.atr.jp/Geminoid/

14 14 Geminoids Geminoid by I. Ishiguro at ATR, Japan http://www.irc.atr.jp/Geminoid/

15 15 decision (what to do) About VHs - general architecture perception (input) action (output) nl processing (meta) speech understanding face interpretation gesture recognition environment monitoring (object, resources, situation) multimodal output related to environment user model affect model discourse model domain knowledge goals … real-time suspension of disbelief ‘good’ for …

16 16 About VHs – Design parameters Embodiment: head – torzo - fullEmbodiment: head – torzo - full ‘Look’: gender, age, profession, personality‘Look’: gender, age, profession, personality Design: realistic/not, 2d/3dDesign: realistic/not, 2d/3d Communication channels & directions speech, facial expressions, gestures, body languageCommunication channels & directions speech, facial expressions, gestures, body language What is to be expressed: emotional, cognitive, phys. state references to changing surrounding (incl. user)What is to be expressed: emotional, cognitive, phys. state references to changing surrounding (incl. user) Role: monologue – dialogue, relation to user: gender, cluture, socal statusRole: monologue – dialogue, relation to user: gender, cluture, socal status

17 17 About VHs – Hierarchy of behavior independent behavior independent behavior goal-oriented behavior goal-oriented behavior personality, emotions personality, emotions high-level control high-level control bidirectional mm comm. bidirectional mm comm. multimodal output multimodal output single modal output single modal output rendering, TTS,... rendering, TTS,...

18 18 About VHs - development cycle analysis of H-H interaction computational model implementation of VA evaluation of VA non-realism: animation, art What? How? realtime adoptable micro level in application mind body

19 19 About VHs – Some issues Mind and body issueMind and body issue Why to use VHs? - ease of use: broader user group, less load - more effect - more funWhy to use VHs? - ease of use: broader user group, less load - more effect - more fun Moral and legal issues: clones, manipulationMoral and legal issues: clones, manipulation To mimic humans, or less is more?To mimic humans, or less is more? Interdisciplinary researchInterdisciplinary research

20 20 About VHs – Do people take them? Illusion – do people believe, react to as if they were real?Illusion – do people believe, react to as if they were real? YES Clifford Nass: CASA paradigm Computers Are Social ActorsYES Clifford Nass: CASA paradigm Computers Are Social Actors Series of experiments on effect of micro and macro designSeries of experiments on effect of micro and macro design User’s characteristics matterUser’s characteristics matter

21 21 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

22 22 Motivations for FA research Need for: expressive, animated synthetic faces Need for: expressive, animated synthetic facesexpressive Functions of facial (and other) nonverbal signals: - emotions - cognitive state - speech punctuation - dialog control - additional/redundant info (size, location, …) - personality and culture of speaker Functions of facial (and other) nonverbal signals: - emotions - cognitive state - speech punctuation - dialog control - additional/redundant info (size, location, …) - personality and culture of speaker All contribute to: comprehension, memory, effectivity … motivation, joy, engagement All contribute to: comprehension, memory, effectivity … motivation, joy, engagement

23 23 Questions concerning FA Which expressions to consider? Which expressions to consider? What are those expressions like? What are those expressions like? - static and dynamical characteristics - generic and individual characteristics What facial model to use? - (non-)photorealism - beyond realism/simplification of motions? What facial model to use? - (non-)photorealism - beyond realism/simplification of motions? Tool to make synthetic face models and animate them Tool to make synthetic face models and animate them

24 24 Facial anatomy about 44 pairs of musclesabout 44 pairs of muscles –bundle –sheet –circular physiological data missingphysiological data missing coarticulationcoarticulation many parametersmany parameters nonlinear effectsnonlinear effects

25 25 Psychology of facial expressions “meaningful” – physically possible - impossible“meaningful” – physically possible - impossible biological need/content/emotional state/discourse regulator/…biological need/content/emotional state/discourse regulator/… muscle activity – emotion sensing in brainmuscle activity – emotion sensing in brain difference between real and ‘on demand’ expressions data collection must be done with care: - spontaneous/stimulated expression - natural setting (e.g. in news) - real people vs. actor?difference between real and ‘on demand’ expressions data collection must be done with care: - spontaneous/stimulated expression - natural setting (e.g. in news) - real people vs. actor?

26 26 History …. 1882: Darwin: The Expression of the Emotions in Man and Animals1882: Darwin: The Expression of the Emotions in Man and Animals 1862: Duchenne: photos of expressions induced by electric signals1862: Duchenne: photos of expressions induced by electric signals

27 27

28 28 History: coding of facial expressions 1970 -...: Paul Ekman (U. of California)1970 -...: Paul Ekman (U. of California) 1989: 6 general emotional expressions : happiness, surprise, sadness, fear, disgust, anger1989: 6 general emotional expressions : happiness, surprise, sadness, fear, disgust, anger FACS [1978] coding system – still preferred in USFACS [1978] coding system – still preferred in US discretediscrete visual effectof (groups of) musclesvisual effectof (groups of) muscles

29 29 Coding of facial expressions: MPEG4 1999: ISO standard1999: ISO standard widely usedwidely used 68 normalised parameters (FAPs)68 normalised parameters (FAPs) continuouscontinuous black points x, y, (z) displblack points x, y, (z) displ min, max, neutral valuemin, max, neutral value

30 30 Emberi arckifejezések leirása: MPEG4 # FAP name FAP description Unit# FAP name FAP description Unit 31 raise_l_i_eyebrow Vertical displacement of left inner eyebrow ENS31 raise_l_i_eyebrow Vertical displacement of left inner eyebrow ENS 32 raise_r_i_eyebrow Vertical displacement of right inner eyebrow ENS32 raise_r_i_eyebrow Vertical displacement of right inner eyebrow ENS 33 raise_l_m_eyebrowVertical displacement of left middle eyebrow ENS33 raise_l_m_eyebrowVertical displacement of left middle eyebrow ENS 34 raise_r_m_eyebrow Vertical displacement of right middle eyebrow ENS34 raise_r_m_eyebrow Vertical displacement of right middle eyebrow ENS 35 raise_l_o_eyebrow Vertical displacement of left outer eyebrow ENS35 raise_l_o_eyebrow Vertical displacement of left outer eyebrow ENS 36 raise_r_o_eyebrow Vertical displacement of right outer eyebrow ENS36 raise_r_o_eyebrow Vertical displacement of right outer eyebrow ENS 37 squeeze_l_eyebrow Horizontal displacement of left eyebrow ES37 squeeze_l_eyebrow Horizontal displacement of left eyebrow ES

31 31 Analysis of facial expressions – own investigation 2002 On demand expressions of untrained subjects Track 15 FAPs marked on face Track 15 FAPs marked on face

32 32 The E-Space: PCA

33 33 The E-Space: 2 components’ graph (tracked) Hor: eyebrows Vert: mouth corners

34 34 The E-Space: CharToon data

35 35 The E-Space: CharToon data Experienced animatorExperienced animator 59 expression stills:59 expression stills: –Anger: Be_Careful_What_You_Say –Smile: Absolute_Joy, Adoration, Smile –Other: Doubt_suspicion

36 36 The E-Space: Graph (cartoon) Hor: eyebrowsHor: eyebrows Vert: stretch mouthVert: stretch mouth 3rd: smile/squeeze3rd: smile/squeeze (all a-symmetric)

37 37 The E-Space: Conclusion (stills) Tracker FAPs not always sufficient to distinguish (negative) emotionsTracker FAPs not always sufficient to distinguish (negative) emotions More symmetry in tracked expressionsMore symmetry in tracked expressions Animator uses more of the expression spaceAnimator uses more of the expression space smiling shape of the mouth and raising of the eyebrows are most significantsmiling shape of the mouth and raising of the eyebrows are most significant

38 38 Time curves of FAPs

39 39 Actuation of muscles during smile and anger (I. Essa, PhD at MIT, 1995)

40 40 The E-Space: Time curves

41 41 Co-actuation of muscles during smile (I. Essa, PhD at MIT, 1995)

42 42 Facial expression analysis - dynamism Time IS characteristic and importantTime IS characteristic and important Trapezoid activation function, variants (camel, …)Trapezoid activation function, variants (camel, …) Still open questions: shapes, constraints, …Still open questions: shapes, constraints, … No data(base) published (MS either)No data(base) published (MS either)

43 43 STAR on facial expression tracking Near real-time tracking of faps Other than basic 6 expressions Mixture, masking, cultural variants Dynamism of articulation Spontaneous expressions Bartlett, M.S., Littlewort, G.C., Frank, M.G., Lainscsek,C., Fasel, I., Movellan, J.R. (2006). Automatic Recognition of Facial Actions in Spontaneous Expressions. Journal of Multimedia 1(6) p. 22-35.Bartlett, M.S., Littlewort, G.C., Frank, M.G., Lainscsek,C., Fasel, I., Movellan, J.R. (2006). Automatic Recognition of Facial Actions in Spontaneous Expressions. Journal of Multimedia 1(6) p. 22-35. Cohn, J. F. & Ekman, P. Methods for measuring facial actions. In J.A. Harrigan, R. Rosenthal, & K. Scherer (Eds.), Handbook of nonverbal behavior research methods in the affective sciences. NY: Oxford.Cohn, J. F. & Ekman, P. Methods for measuring facial actions. In J.A. Harrigan, R. Rosenthal, & K. Scherer (Eds.), Handbook of nonverbal behavior research methods in the affective sciences. NY: Oxford. Tian, Y.L., Kanade, T., & Cohn, J.F. Facial expression analysis. In S.Z. Li & A.K. Jain (Eds.), Handbook of face recognition. NY: Springer.Tian, Y.L., Kanade, T., & Cohn, J.F. Facial expression analysis. In S.Z. Li & A.K. Jain (Eds.), Handbook of face recognition. NY: Springer. KTH, Sweeden – cheap and light hw for facial trackingKTH, Sweeden – cheap and light hw for facial tracking

44 Analysis of Mona Lisa smile: 83% disgust: 9% fear: 6% anger: 2% result of computer facial expression analysis UvA + Univ. of Illionis, Dec. 2005 What do the numbers mean? How were they obtained? Would it mean the same in the 1500s?

45 45 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

46 46 Facial animation – model based Basic principle: Deformation principles – 2d skeleton, 3d real/virtual muscle, …Deformation principles – 2d skeleton, 3d real/virtual muscle, … Key positions: values of controllersKey positions: values of controllers Inbetweens by interpolationInbetweens by interpolationIssues: Reusable, parametrised repertoireReusable, parametrised repertoire BlendingBlending Superposition (talking while smiling)Superposition (talking while smiling) Visual speech - coarticulationVisual speech - coarticulation

47 47 Facial animation how to interpolate: linear, C 1, C 2how to interpolate: linear, C 1, C 2 how to choose t i (times for keyframes)how to choose t i (times for keyframes) t1t1 t2t2 t3t3 t4t4 t5t5 toto

48 48 3D model

49 49 3d facial modelling (CWI)

50 50 3d facial animation

51 51 MPEG4 defined animation for different faces

52 52 Visual speech

53 53Lip-sync

54 54 Some 2d head models

55 55 Some movies made with CharToon by A. Lelievre and Zs. Paal Lily Frans Nine faces Magician

56 56 Personality and culture in facial design

57 57 3D realistic3D realistic vs. 2D cartoon faces 2D cartoon 3D realistic2D cartoon

58 58 Component repertoire Unlimited freedom design faces from CharToon components design faces from CharToon components finetune dynamism finetune dynamism Repertoire of: facial feature components facial feature components (still) expressions and animations (still) expressions and animations Support to reuse the components: technical level (design, UI) technical level (design, UI) ‘mix & match’ recipes with respect to expected functionality ‘mix & match’ recipes with respect to expected functionality

59 59 Animating faces in CharToon I

60 60 Animating faces in CharToon II

61 61 Animating faces in CharToon III + constraints technology

62 62 Emotion Disc

63 63 Emotion Disc

64 64 Emotion Disc Cyclic neighborhood of 6 basic emotionsCyclic neighborhood of 6 basic emotionsCyclic neighborhood Cyclic neighborhood Tool to: - explore the expression space - control the expressionsTool to: - explore the expression space - control the expressions Experiments with expressive 2D facesExperiments with expressive 2D faces -how expressive they are? - contradicting expressions (2-3 discs) -how expressive they are? - contradicting expressions (2-3 discs)

65 65 From: Harold Schlosberg: The Description of Facial Expressions in Terms of Two Dimensions, Journal of Experimental Psychology, Vol. 44. No. 4. Oct. 1952, pp 229-237.

66 66 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

67 67 Evaluation of talking heads Work with E. Krahmer at als.Work with E. Krahmer at als. Effect of facial signals to express: - emphasis --- eyebrow & pitch - personality --- eyebrow, gaze and speechEffect of facial signals to express: - emphasis --- eyebrow & pitch - personality --- eyebrow, gaze and speech See publications, experimental mat. on web pageSee publications, experimental mat. on web page

68 68 The evaluation research questions Do facial cues contribute to the perception of extraversion?Do facial cues contribute to the perception of extraversion? How do gaze + brows + speech cues add up?How do gaze + brows + speech cues add up? –inconsistency

69 69 Personality for VHs Personality is important:Personality is important: –to improve comprehension (Laurel 1993) –to like and trust and … the VA better (Nass and Lee 2000, Isbister and Nass 2000, …) –people attribute personality to agents CASA paradigm (Reeves and Nass 1996) –fun, variety

70 70 Personality for VHs Trait theory (e.g. Allport 1937)Trait theory (e.g. Allport 1937) “Big 5” (e.g. Wiggins 1997): Extroversion (dominance), Agreeableness, Conscientiousness, Emotional stability, Culture“Big 5” (e.g. Wiggins 1997): Extroversion (dominance), Agreeableness, Conscientiousness, Emotional stability, Culture Cues studied so far:Cues studied so far: –speech (pitch and duration) –gaze –posture (Isbister and Nass 2000) –language

71 71 The experiment GazeSpeechBrows Extrovert fixed on user 2 blink wide pitch range more variation higher tempo 2x accent 3x100 ms stages each Introvert “move S-W look aside move center” 2x low pitch range little variation 20% slower no movement 2 x 2 x 2 = 8 personality profiles

72 72 The experiment Examples of stimuli from http://www.cwi.nl/~zsofi/stim.htmlExamples of stimuli from http://www.cwi.nl/~zsofi/stim.html http://www.cwi.nl/~zsofi/stim.html is_ig_ib es_eg_ib is_ig_ib

73 73 Evaluation of VHs Hot topic, evaluation is a MUSTHot topic, evaluation is a MUST Zs. Ruttkay, C. Pelachaud: From Brows to Trust (Kluwer, 2004)Zs. Ruttkay, C. Pelachaud: From Brows to Trust (Kluwer, 2004)

74 74 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

75 75 Research questions on gestures What gestures do people use?What gestures do people use? - references to environment & user (location, size, time, motion, shape,…) - to indicate actions and concepts - emotions, cognitive and physical state - dialog status What gestures should an VA use?What gestures should an VA use? - function - application domain - redundancy - expressivity - realization – non-repetitive, tuned - realization – non-repetitive, tuned Modelling and sw engineering issues: motion characteristics, performance, reusability, …Modelling and sw engineering issues: motion characteristics, performance, reusability, … As humans? YES, but …

76 76 Communicative functions emphasize take turn/ start talking a) raise voice b) look at listener c) raise eyebrow d) beat with hand e) 11 (?) combinations a) look at partner b) raise hand c) voice signal (hm- hm) d) … Multi-modal signals

77 77 Why to bother about style? Style is source of information on:Style is source of information on: –gender, age, physical state –personality, emotional state –ethnicity, profession –relationship between speaker and listener –situation (public/private, …) … variety and joyvariety and joy does matter in HCI (Nass et al. 2000, …)does matter in HCI (Nass et al. 2000, …)

78 78 Our objectives VA with style in nonverbal communication (and speech)VA with style in nonverbal communication (and speech) Declare style of VADeclare style of VA Generate styled conversation by VHsGenerate styled conversation by VHs Framework (can be tailored/extended)Framework (can be tailored/extended) Easy to use mark-up languageEasy to use mark-up language Experiments!Experiments!

79 79 GESTYLE: the big picture Style Declaration Meaning Tags Combine Style Dictionaries into a single one Map Meanings to Gesture Expressions Text with Gesture Expressions (relative time) Expand to Basic Gestures with absolute timing Text with Basic Gesture Tags (abs. time) Combined Style Dictionary Timing of speech Generate animation in rendering system Gesture Dictionary Text with Modifier Tags Modify Combined Style Dictionary Modified Combined Style Dictionary Style Dictionary 1 Meaning mapping 1 … Meaning mapping m 1 Style Dictionary 2 Meaning mapping 1 2 … Meaning mapping m 2 … Style Dictionary n Meaning mapping 1 n … Meaning mapping m n Modifier Tags input text with GESTYLE markups

80 80 GESTYLE: Style dictionary <GestureSpec> </GestureSpec><GestureSpec> </GestureSpec></Meaning>…

81 81 Definition and generation of gestures structural characteristics:structural characteristics: composed of basic gestures by par, seq and repeat operators performance (manner) characteristics:performance (manner) characteristics: intensity, duration, start_time, precision, jerkyness, symmetry, … parameters

82 82 GESTYLE: Combining styles Mechanism to resolve conflicting prescriptions on:Mechanism to resolve conflicting prescriptions on: –gestures –modalities –manner Preferences to be given in Style DeclarationPreferences to be given in Style Declaration Dynamical modifications of:Dynamical modifications of: –modality usage –manner –style dominance

83 83 GESTYLE: Gesture repertoire <GestureRepertoire>... </GestureRepertoire>

84 84 Two examles Hamlet: CharToon 2DHamlet: CharToon 2D –face, eyes, hands –.avi Yt: 3D H-anim humanoid for the WebYt: 3D H-anim humanoid for the Web –hand gestures only –real-time, on the fly generation of animation Speech by MindMaker’s FlexVoice TTS, styled too for Hamlet!Speech by MindMaker’s FlexVoice TTS, styled too for Hamlet!

85 85Hamlet HamletMarkUp1.xml HamletMarkUp2.xml StyleDictionaries.xml

86 86Yt hierarchical articulated body, 54 DOF of hand joints modular gesture definition written in STEP (VU)emphasisenumeratesorry_forgreeting

87 87 Cognition and action The Functional Behaviour Markup Languages initiative Intent Planning Behavior Planning Behavior Realization Behavior Repository (BML) FMLBML

88 88 What is BML? - Abstraction gesture head speech posture gaze facialexpression locomotion viseme eyes BML = “complex/multimodal” BML ≠ “simple/unimodal” (that’s at the level of body) torso legs Coordination

89 89 What is BML? - Example This is an example...

90 90 Work on BML On-line wiki, publication at IVA07 Join the small and active community IVA09 conference 14-16 Sept 2009 iva09.dfki.de BML ws: 13 Sept 2009

91 91 Overview of the talk FAAnalysis of facial expressions of humans and cartoons Emotion Disc Challenges for IP Non-photorealistic facial animation - CharToon Full body Control with markup – GESTYLE Current work - conductor Intro About Virtual Humans Evaluation of personality cues

92 92 Conducting with parametrised kinematic hand motion H. van Welbergen, HMI, UT work in progress

93 93 Conducting – body motion with physical model H. van Welbergen, HMI, UT work in progress

94 94 Fat/thin conductor H. van Welbergen, HMI, UT work in progress

95 95 Conductors 30 in real-time H. van Welbergen, HMI, UT work in progress

96 96 Image morphing - automatic identification of feature points - interpolation - per region (e.g. lip sync) - db/analysis of faces - combination of morphing and 3d transformations Challenges for IP Face and gesture recognition - elicitation of characteristics for modeling (off-line) - real-time recognition of conveyed meaning - user modeling for interaction control, on-line - user analysis to learn about how they react to VHs - multimodal recognition

97 97 Challenges for IP Image analysis and synthesis -NPAR (e.g. line drawing) from photos - exaggeration - ageing, tiredness and other effects Texturing - LOD

98 98 zsofi@cs.utwente.nl

99 99 From: J. Walker, L. Sproull, R. Subramani: Using a Human Face in an Interface, CHI’94, pp 85-91.


Download ppt "Interacting with Virtual Humans Zsófia Ruttkay Associate Professor Human Media Interaction, Dept. of CS, University of Twente, NL"

Similar presentations


Ads by Google