SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert.

Slides:



Advertisements
Similar presentations
Summer Computing Workshop. Introduction to Variables Variables are used in every aspect of programming. They are used to store data the programmer needs.
Advertisements

ECE 491 Final Project Multiplayer Card Game Author: Tian Ma Presentation 5/4/2004.
A predictive Collision Avoidance Model for Pedestrian Simulation Author: Ioannis Karamouzas et al. Presented by: Jessica Siewert.
What is a CAT?. Introduction COMPUTER ADAPTIVE TEST + performance task.
Designing an Educational Program Kathy Stewart, MD David Feldstein, MD PCFDP11/13/10.
 Mankyu Sung Scalable, Controllable, Efficient and convincing crowd simulation (2005)  Michael Gleicher “I have a bad case of Academic Attention Deficit.
Crowd Simulation Sai-Keung Wong. Crowd Simulation A process of simulating the movement of a large number of entities or characters. While simulating these.
CUSTOMER NEEDS ELICITATION FOR PRODUCT CUSTOMIZATION Yue Wang Advisor: Prof. Tseng Advanced Manufacturing Institute Hong Kong University of Science and.
ITEC 451 Network Design and Analysis. 2 You will Learn: (1) Specifying performance requirements Evaluating design alternatives Comparing two or more systems.
John S Gero Agents – Agent Simulations AGENT-BASED SIMULATIONS.
Bart van Greevenbroek.  Authors  The Paper  Particle Swarm Optimization  Algorithm used with PSO  Experiment  Assessment  conclusion.
A Combinatorial Fusion Method for Feature Mining Ye Tian, Gary Weiss, D. Frank Hsu, Qiang Ma Fordham University Presented by Gary Weiss.
Real-time crowd motion planning: Scalable Avoidance and Group Behavior (2008) Authors: Yersin, Maïm, Morini, Thalman Presented by: Jessica Siewert.
Constraint-Based Motion Planning for Multiple Agents Luv Kohli COMP November 10, 2003 Progress Update.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
1 Software Testing and Quality Assurance Lecture 30 – Testing Systems.
Retrieval Evaluation. Introduction Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
Principle of Functional Verification Chapter 1~3 Presenter : Fu-Ching Yang.
Review an existing website Usability in Design. to begin with.. Meeting Organization’s objectives and your Usability goals Meeting User’s Needs Complying.
Video Game Visualization Craig Prince. Outline Examine visualization in video games from 3 perspectives: 1.The Game Spectators 2.The Game Designers 3.The.
Session 6: Writing from Sources Audience: K-5 Teachers.
Curious Characters in Multiuser Games: A Study in Motivated Reinforcement Learning for Creative Behavior Policies * Mary Lou Maher University of Sydney.
Heuristic Search Heuristic - a “rule of thumb” used to help guide search often, something learned experientially and recalled when needed Heuristic Function.
Performance Management Measuring Performance Using Information to Improve Performance.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Ioannis Karamouzas, Roland Geraerts, Mark Overmars Indicative Routes for Path Planning and Crowd Simulation.
2005/05/25 Unified Modeling Lanauage 1 Introduction to Unified Modeling Language (UML) – Part One Ku-Yaw Chang Assistant Professor.
Building Effective Assessments. Agenda  Brief overview of Assess2Know content development  Assessment building pre-planning  Cognitive factors  Building.
Using the Corridor Map Method for Path Planning for a Large Number of Characters Roland Geraerts, Arno Kamphuis, Ioannis Karamouzas, Mark Overmars MIG’08.
Information Visualization DH /02/21 Proposal for Project 1 Title Cool Photo of Student 1 Student 1 Cool Photo of Student 2 Student.
L – Modelling and Simulating Social Systems with MATLAB Lesson 5 – Introduction to agent-based simulations A. Johansson & W. Yu ©
What caught your eye at DEAS 2005? 40 DEAS 2005 Participants.
Robust Space-Time Footsteps for Agent-Based Steering By Glen Berseth 1, Mubbasir Kapadia 2, Petros Faloutsos 3 University of British Columbia 1, Rutgers.
Advanced Decision Architectures Collaborative Technology Alliance A Computational Model of Naturalistic Decision Making and the Science of Simulation Walter.
Evaluation of User Interface Design 4. Predictive Evaluation continued Different kinds of predictive evaluation: 1.Inspection methods 2.Usage simulations.
Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac Corporation's express consent.
5/30/20161 Iterative Project Management Chapter 2 – How Do Iterative Projects Function? Part 1 Iterative Project Management / 01 - Iterative and Incremental.
The Evolution of ICT-Based Learning Environments: Which Perspectives for School of the Future? Reporter: Lee Chun-Yi Advisor: Chen Ming-Puu Bottino, R.
Playing GWAP with strategies - using ESP as an example Wen-Yuan Zhu CSIE, NTNU.
Relative Hidden Markov Models Qiang Zhang, Baoxin Li Arizona State University.
Adrian Treuille, Seth Cooper, Zoran Popović 2006 Walter Kerrebijn
Towards a Method For Evaluating Naturalness in Conversational Dialog Systems Victor Hung, Miguel Elvir, Avelino Gonzalez & Ronald DeMara Intelligent Systems.
MODES-650 Advanced System Simulation Presented by Olgun Karademirci VERIFICATION AND VALIDATION OF SIMULATION MODELS.
Crowds (and research in animation and games) CSE 3541 Matt Boggus.
Using Social Network Analysis Methods for the Prediction of Faulty Components Gholamreza Safi.
A Roadmap towards Machine Intelligence
Lucent Technologies - Proprietary 1 Interactive Pattern Discovery with Mirage Mirage uses exploratory visualization, intuitive graphical operations to.
INFOMGP Student names and numbers Papers’ references Title.
WHAT ARE PLANS FOR? Philip E. Agre David Chapman October 1989 CS 790X ROBOTICS Presentation by Tamer Uz.
Current research in Intelligence Agents Victor Govindaswamy.
Characterizing and Optimizing Game Level Difficulty Glen Berseth 1, M. Brandon Haworth 2, Mubbasir Kapadia 3, Petros Faloutsos 2 1 University of British.
Constraint-Based Motion Planning for Multiple Agents Luv Kohli COMP259 March 5, 2003.
Engineering the POlicy-making LIfe CYcle Dr Tina Balke.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Revising Your Paper Paul Lewis With thanks to Mark Weal.
Crowds (and research in computer animation and games)
Lecture 09: Interaction pt. 2: Cost
Bob Wray, Charles Newton, Victor Hung, Norb Timpko 7 Jun 2017
Modelling and Simulating Social Systems with MATLAB
What is a CAT? What is a CAT?.
PostGraduate Research Excellence Symposium (PGReS) 2015
ITEC 451 Network Design and Analysis
Indicative Routes for Path Planning and Crowd Simulation
Cross-Layer Optimization for State Update in Mobile Gaming
Crowd Simulation (INFOMCRWS) - Introduction to Crowd Simulation
Crowds (and research in computer animation and games)
Workshop II UU Crowd Simulation Framework
Robot Motion Planning Project
Presentation transcript:

SteerBench: a benchmark suite for evaluating steering behaviors Authors: Singh, Kapadia, Faloutsos, Reinman Presented by: Jessica Siewert

Content of presentation Introduction Previous work The Method Assessment

Introduction – Context and motivation – Steering of agents – Objective comparison – Standard? – Test cases and scoring, user evaluation – Metric scoring – Demonstration

Introduction – Previous work There is not really anything like it yet (Nov ‘08)

Introduction - Promises Evaluate objectively Help researchers Working towards a standard for evaluation Take into account: – Cognitive decisions – Situation-specific aspects

The test cases – Simple validation scenarios – Basic one – on – one interactions – Agent interactions including obstacles – Group interactions – Large-scale scenarios

The user’s opinion Rank on overal score across test cases (comparing) Rank algorithms based on – a single case, or – one agent’s behavior Pass/fail Visually inspect results Examine detailed metrics of the performance

The metric Number of collisions Time efficiency Effort efficiency Penalties?

Movies…

Developments since then Ioannis Karamouzas, Peter Heil, Pascal Beek, Mark H. Overmars, A Predictive Collision Avoidance Model for Pedestrian Simulation, Proceedings of the 2nd International Workshop on Motion in Games, November 21-24, 2009, Zeist, The Netherlands Ioannis Karamouzas, Peter Heil, Pascal Beek, Mark H. Overmars, A Predictive Collision Avoidance Model for Pedestrian Simulation, Proceedings of the 2nd International Workshop on Motion in Games, November 21-24, 2009, Zeist, The Netherlands Shawn Singh, Mubbasir Kapadia, Billy Hewlett, Glenn Reinman, Petros Faloutsos, A modular framework for adaptive agent-based steering, Symposium on Interactive 3D Graphics and Games, February 18-20, 2011, San Francisco, California Shawn Singh, Mubbasir Kapadia, Billy Hewlett, Glenn Reinman, Petros Faloutsos, A modular framework for adaptive agent-based steering, Symposium on Interactive 3D Graphics and Games, February 18-20, 2011, San Francisco, California Suiping Zhou, Dan Chen, Wentong Cai, Linbo Luo, Malcolm Yoke Hean Low, Feng Tian, Victor Su-Han Tay, Darren Wee Sze Ong, Benjamin D. Hamilton, Crowd modeling and simulation technologies, ACM Transactions on Modeling and Computer Simulation (TOMACS), v.20 n.4, p.1-35, October 2010 Suiping Zhou, Dan Chen, Wentong Cai, Linbo Luo, Malcolm Yoke Hean Low, Feng Tian, Victor Su-Han Tay, Darren Wee Sze Ong, Benjamin D. Hamilton, Crowd modeling and simulation technologies, ACM Transactions on Modeling and Computer Simulation (TOMACS), v.20 n.4, p.1-35, October 2010

Experiments – Claim recall Evaluate objectively Help researchers Working towards a standard for evaluation

Assessment – good things All the measured variables seem logical (Too?) Extensive variable set, with option to expand Customized evaluation Cheating not allowed – collision penalties – fail constraint – goal constraint Layered set of test cases

Assessment The measurements all seem to be approximately the same User test makes the difference? Who are these users? Examine, inspect, all vage terms What about the objective of objectiveness?

Assessment How good is it to be general How general/specific is this method? Time efficiency VS. Effort efficiency Should it be blind for the algorithm itself? Penalties, fail and goal constraints not specified!

Assessment – scoring(1/2) The test cases are clearly specified. But it is not specified HOW a GOOD agent SHOULD react, though they say there is such a specification How can you get cognitive decisions out of only position, direction and a goal?

Assessment – scoring(2/2) “Scoring not intended to be a proof of an algorithm’s effectiveness.” How do you interpreted scores and who wins? – “B is slightly better on average, but A has the highest scores.”

Assessment – final questions Can this method become a standard? What if someone claims to be so innovative this standard does not apply to them? Nice first try, though! Getty images