Trust and Communications in HSCB Simulations LT Shawn Pollock Advisor: Dr. Chris Darken Co-Advisor: LTC Jon Alt.

Slides:

Advertisements

Similar presentations

Intelligence Step 5 - Capacity Analysis Capacity Analysis Without capacity, the most innovative and brilliant interventions will not be implemented, wont.

Advertisements

Diffusion of innovation Technological aspect of communication technology Technological aspect of communication technology Diffusion of communication technology.

Background Reinforcement Learning (RL) agents learn to do tasks by iteratively performing actions in the world and using resulting experiences to decide.

Chapter 4 How to Observe Children

Modeling Human Reasoning About Meta-Information Presented By: Scott Langevin Jingsong Wang.

Yiannis Demiris and Anthony Dearden By James Gilbert.

Reinforcement Learning & Apprenticeship Learning Chenyi Chen.

Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.

Detecting Network Intrusions via Sampling : A Game Theoretic Approach Presented By: Matt Vidal Murali Kodialam T.V. Lakshman July 22, 2003 Bell Labs, Lucent.

Reinforcement Learning Rafy Michaeli Assaf Naor Supervisor: Yaakov Engel Visit project’s home page at: FOR.

1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.

1 Kunstmatige Intelligentie / RuG KI Reinforcement Learning Johan Everts.

Security Models for Trusting Network Appliances From ： IEEE ( 2002 ) Author ： Colin English, Paddy Nixon Sotirios Terzis, Andrew McGettrick Helen Lowe.

Group Influence Chapter 12 Group Influence

TESTING THE WATERS: USING COLLECTIVE REAL OPTIONS TO MANAGE THE SOCIAL DILEMMA OF STRATEGIC ALLIANCES Presented by Jong-kyung Park MATTHEW W. MCCARTER,

Kunstmatige Intelligentie / RuG KI Reinforcement Learning Sander van Dijk.

Assessing Students Ability to Communicate Effectively— Findings from the College of Technology & Computer Science College of Technology and Computer Science.

Organizational Learning (OL)

Introduction to Behavioral Science Unit 1. I.Social Sciences  The study of society and the activities and relationships of individuals and groups within.

Moving forward with Scalable Game Design. The landscape of computer science courses…  Try your vegetables (sneak it in to an existing course)  Required.

Talent Management Training Methods.

CS Reinforcement Learning1 Reinforcement Learning Variation on Supervised Learning Exact target outputs are not given Some variation of reward is.

1 Lesson 1 Introduction to Social Psychology and Some Research Methods.

The Best of Both Worlds of Psychology and Sociology

Theories of Attitudes and Behavior Dr. K. A. Korb University of Jos.

The Social-Cognitive Theory of Personality

Organizational Change

Autonomy in language learning Members: Cheung Lai Ling, Echo Choy Wai Yan, Vivien Tsang Man Yin, Angel.

Chapter Twelve Motivation. Copyright © Houghton Mifflin Company. All rights reserved Please add the following questions Use the following responses:

OBJECT FOCUSED Q-LEARNING FOR AUTONOMOUS AGENTS M. ONUR CANCI.

SAWSTON VILLAGE COLLEGE Research: Fixed and Growth mind-sets Fixed mind set traits include: - Avoiding challenges rather than risk failing - Give up easily.

Session 2a, 10th June 2008 ICT-MobileSummit 2008 Copyright E3 project, BUPT Autonomic Joint Session Admission Control using Reinforcement Learning.

Balancing Exploration and Exploitation Ratio in Reinforcement Learning Ozkan Ozcan (1stLT/ TuAF)

Intelligent agents, ontologies, simulation and environments for norm-regulated MAS Deliberative Normative Agents Ricardo Gralhoz Governance in Open Multi-Agent.

The Learning Approach  Focuses on how experiences shape behavior  Has two branches: Behaviorists believe that people learn socially desirable behaviors.

Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.

Combining Theory and Systems Building Experiences and Challenges Sotirios Terzis University of Strathclyde.

ASEF Risk Communication for Public Health Emergencies, 2015 Overview.

1 S ystems Analysis Laboratory Helsinki University of Technology Flight Time Allocation Using Reinforcement Learning Ville Mattila and Kai Virtanen Systems.

Performance Appraisal

1 What is OO Design? OO Design is a process of invention, where developers create the abstractions necessary to meet the system’s requirements OO Design.

Objectives Objectives Recommendz: A Multi-feature Recommendation System Matthew Garden, Gregory Dudek, Center for Intelligent Machines, McGill University.

Diffusion of Innovation Multimedia Presentation SMART Board.

Evolving cooperation in one-time interactions with strangers Tags produce cooperation in the single round prisoner’s dilemma and it’s.

Top level learning Pass selection using TPOT-RL. DT receiver choice function DT is trained off-line in artificial situation DT used in a heuristic, hand-coded.

Michael A. Hitt C. Chet Miller Adrienne Colella Slides by R. Dennis Middlemist Michael A. Hitt C. Chet Miller Adrienne Colella Chapter 4 Learning and Perception.

A Shock Therapy Against the “Endowment Effect” Dirk Engelmann (Royal Holloway, University of London) Guillaume Hollard (Paris School of Economics and CNRS)

Attitudes a belief and feeling that predisposes one to respond in a particular way to objects, people, and events Can be formed through learning and exposure.

Comparing Designs By Chris McCall. Comparing Designs A decision-making method for deciding between many designs for a single specification Provides a.

1 Chapter 17 2 nd Part Making Complex Decisions --- Decision-theoretic Agent Design Xin Lu 11/04/2002.

Consumer and Business Buyer Behavior Consumer Buying Behavior Refers to the buying behavior of people who buy goods and services for personal use.

Theories and Methods in Social Psychology David Rude, MA, CPC Instructor 1.

2  ETHICS IN MARKETING MEANS DELIBERATELY APPLYING STANDARDS OF FAIRNESS OR MORAL RIGHTS AND WRONGS TO MARKETING DECISION MAKING,BEHAVIOUR AND PRACTICE.

Session 5: Review of the Scientific Method and The ATS Development Process 1.

Sharing personal knowledge over the Semantic Web ● We call personal knowledge the knowledge that is developed and shared by the users while they solve.

Attitude and Behavior. Attitude It is a disposition to approach an idea, event, person, or an object.

Does the brain compute confidence estimates about decisions?

Social Psychology. What are group polarization and groupthink?

Student Motivation, Personal Growth, and Inclusion

Hawthorn Effect A term referring to the tendency of some people to work harder and perform better when they are participants in an experiment. Individuals.

Learning and Perception

Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Data Partition Dr. Xiao Qin Auburn University.

Reinforcement Learning (1)

Patterns of Involuntary Technology Adoption

Technology Planning.

Leadership Chapter 7 – Path-Goal Theory Northouse, 4th edition.

Understanding a Skills-Based Approach

Perspectives on Personality

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 7

Presentation transcript:

Trust and Communications in HSCB Simulations LT Shawn Pollock Advisor: Dr. Chris Darken Co-Advisor: LTC Jon Alt

ABSTRACT Trust plays a critical role in communications, strength of relationships, and information processing at the individual and group level. Cognitive social simulations show promise in providing an experimental platform for the examination of social phenomena such as trust formation. This work is a novel attempt at trust representation in a cognitive social simulation using reinforcement learning algorithms. Initial algorithm development has been completed in a standalone social network simulation centered around a public commodity game. Following this testing the algorithm has been imported into the Cultural Geography model for large scale test and evaluation.

? CENTRAL RESEARCH QUESTION What is the best method for modeling trust in HSCB simulations?

Breaking the central research question down leads to the following … What is trust? In other words, how can define trust in a concise way so as to develop a working computer model of it? How does trust impact communications within a social network? More to the point, how can we tune our trust algorithm to produce the effects of trust as actually observed in real social networks? To facilitate an implementation of trust within a social simulation, what will be the best format for communications between agents? When a model of trust has been implemented, what data requirements will there be to populate the model?

What is trust?

Reputation Familiarity Opportunity Respect Responsibility Confidence

What is trust? Easy to speak about casually - very difficult to precisely define - even more difficult to model. Our trust model will be mostly involved with communications, therefore we have adopted a definition of trust that is most useful to that end. DEFINITION: Trust is an agent’s perception of another agent’s adherence to an unspoken social contract as well as how faithfully the other agent will conform to preconceived actions based on their past actions or perceived characteristics. – This social contract encompasses relationships between that agent and other individuals as well as between that agent and the larger group.

REPUTATION PREJUDICE AMBITION TRUST The key elements of trust that we have focused on with this implementation are reputation, prejudice and ambition. Reputation: A measure of an agents past actions as a predictor of future actions. Prejudice: A baseline trust based on social similarity (homophily). Ambition: Loosely defined to encompass the agents desire for reward, and willingness to risk interacting with potentially untrustworthy agents. A subtle modification to our definition of trust: Our model of trust is based on action, not solely on internal trust alone.

Applying Reinforcement Learning to the Problem Reinforcement Learning (RL) is a form of machine learning that allows an agent to make informed decision based on a given state and a selection of possible actions. As the agents make a series of actions they come across rewards which reinforce the state-action pairs in their history. The form of reinforcement utilized in our algorithm is Q-Learning in which reinforcements are determined as follows: Q(s,a) ← Q(s,a) + α(r + γmax a‘ (Q(s',a'))-Q(s,a)) Agents use the Q-value (Q(s,a) above) to determine their best choice of possible actions. There are many methods for choosing amongst actions, each with a given Q-value. The particular method employed by our algorithm is a softmax or Boltzmann selection technique where the probability mass function of a given action is as follows: P’(a|s) = e Q(s,a)/t

Reinforcement Learning Drives the Trust Algorithm

Algorithm Overview Incoming Comm: State is defined as the sender and the subject. Agent will choose to increase, decrease or keep steady their trust level in the sender. If the sender’s new trust level exceeds a minimum value the information is received and processed. Beliefs are changed based on the new information. Beliefs drive the agents other actions and have a direct effect on the happiness of the agent. Happiness is calculated and used as the reward signal for the trust algorithm. Outgoing Comm: State is defined as the possible receivers and the subject. (one pass through the algorithm for each) Agent will choose to increase, decrease or keep steady their trust level in the possible receivers. If the receiver’s new trust level exceeds a minimum value the information is sent. Receiver’s beliefs are changed based on the new information. Beliefs drive the agents other actions and have a direct effect on the happiness of the receiver and his neighbors (including the sender). Happiness is calculated and used as the reward signal for the trust algorithm.

Putting Trust to the Test There have been many classic games of trust that test their human subjects’ responses when faced with unique trust scenarios. Our test is a variant of the Public Commodity (PC) game. The public commodity game has been studied in depth for many years. In real experiments, the semi-stable long time average contribution will be relatively low, but nonzero.

Overview of Testing Results In standalone testing we find that the average contributions to the public pot are predictably low, but nonzero. We also find that when we add a penalty to the reward signal that penalizes for an agents to change their initial beliefs, we see shifting behavior as shown below:

The Cultural Geography model is a social simulation that is being developed in Simkit in which a small society of autonomous agents interact with each other, external agents and commodity providers. The agents make decisions on particular courses of actions, on of which is the choice to communicate with the other agents. The trust algorithm is inserted into this decision as a filter in order to aid the agent in choosing who to communicate with as well as which agents to trust communications from. As agents either successfully or unsuccessfully acquire commodities (food, water, gas, etc) or observe events (IED, etc), they will refine their belief structure. The agents beliefs are represented in Bayesian Networks that give the agents Issue Stances such as their opinion of the local insurgency or of the coalition forces in their town. The effects of the trust filter can be seen as we watch the evolution of the issue stances both with and without the trust filter in place. Application to Cultural Geography

CONCLUSION Based on these results it is easy to see that reinforcement learning can go a long way in simulating trust behavior in social simulations, but there is a lot of work still to be done Future work will seek to incorporate a more cognitive approach to work in concert with the central reinforcement learning algorithm.