1 Decision Support Algorithms for Port of Entry Inspection Fred S. Roberts DIMACS Center, Rutgers University In collaboration with Los Alamos National.

Slides:



Advertisements
Similar presentations
On the Optimality of Probability Estimation by Random Decision Trees Wei Fan IBM T.J.Watson.
Advertisements

Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
A Hierarchical Multiple Target Tracking Algorithm for Sensor Networks Songhwai Oh and Shankar Sastry EECS, Berkeley Nest Retreat, Jan
A. Darwiche Sensitivity Analysis in Bayesian Networks Adnan Darwiche Computer Science Department
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.
Planning under Uncertainty
1 Homeland Security: What can Mathematics Do? Examples from Work at CCICADA.
1 Algorithms for Port of Entry Inspection: Finding Optimal Binary Decision Trees Fred S. Roberts Rutgers University.
1 Sometimes it Pays to be Greedy: Greedy Algorithms in Economic Epidemiology Fred Roberts, DIMACS.
1 Algorithms for Port of Entry Inspection for WMDs Fred S. Roberts DyDAn Center Rutgers University.
1 Algorithms for Port of Entry Inspection for WMDs Fred S. Roberts DIMACS Center, Rutgers University.
1 Algorithms for Port of Entry Inspection: Finding Optimal Binary Decision Trees Fred S. Roberts Rutgers University.
1 Algorithms for Port of Entry Inspection for WMDs Fred S. Roberts DyDAn Center Rutgers University.
Math443/543 Mathematical Modeling and Optimization
1 Decision Support for Port of Entry Inspection Fred S. Roberts DyDAn Center Rutgers University.
1 Algorithms for Port of Entry Inspection for WMDs Fred S. Roberts DIMACS Center, Rutgers University Philip D. Stroud Los Alamos National Laboratory.
LEARNING DECISION TREES
Cost-Based Plan Selection Choosing an Order for Joins Chapter 16.5 and16.6 by:- Vikas Vittal Rao ID: 124/227 Chiu Luk ID: 210.
1 Branch and Bound Searching Strategies 2 Branch-and-bound strategy 2 mechanisms: A mechanism to generate branches A mechanism to generate a bound so.
Adapted by Doug Downey from Machine Learning EECS 349, Bryan Pardo Machine Learning Clustering.
ICS 273A Intro Machine Learning
1 Port Resilience Fred S. Roberts Director of CCICADA Rutgers University.
Generalized Inspection Policies Ken Zhao Adv: Endre Boros.
Why Model? Fred S. Roberts Department of Mathematics and DIMACS (Center for Discrete Mathematics and Theoretical Computer Science) Rutgers University Piscataway,
Today Logistic Regression Decision Trees Redux Graphical Models
How to Fuse Independent Sensors Fast? (Nuclear Detection) Endre Boros MSIS & RUTCOR, Rutgers University Joint work with Noam Goldberg,
1 Boolean Functions I’ve Known and Not Known: Memories of Peter L. Hammer Fred Roberts Rutgers University.
Package Transportation Scheduling Albert Lee Robert Z. Lee.
Data Selection In Ad-Hoc Wireless Sensor Networks Olawoye Oyeyele 11/24/2003.
Approximation Algorithms Pages ADVANCED TOPICS IN COMPLEXITY THEORY.
LEARNING DECISION TREES Yılmaz KILIÇASLAN. Definition - I Decision tree induction is one of the simplest, and yet most successful forms of learning algorithm.
Inferring Decision Trees Using the Minimum Description Length Principle J. R. Quinlan and R. L. Rivest Information and Computation 80, , 1989.
Constraint Satisfaction Problems (CSPs) CPSC 322 – CSP 1 Poole & Mackworth textbook: Sections § Lecturer: Alan Mackworth September 28, 2012.
Universit at Dortmund, LS VIII
Minimum Average Routing Path Clustering Problem in Multi-hop 2-D Underwater Sensor Networks Presented By Donghyun Kim Data Communication and Data Management.
1 Learning Chapter 18 and Parts of Chapter 20 AI systems are complex and may have many parameters. It is impractical and often impossible to encode all.
Computational Intelligence: Methods and Applications Lecture 12 Bayesian decisions: foundation of learning Włodzisław Duch Dept. of Informatics, UMK Google:
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.
CS690L Data Mining: Classification
On optimal quantization rules for some sequential decision problems by X. Nguyen, M. Wainwright & M. Jordan Discussion led by Qi An ECE, Duke University.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
MACHINE LEARNING 10 Decision Trees. Motivation  Parametric Estimation  Assume model for class probability or regression  Estimate parameters from all.
1 Branch and Bound Searching Strategies Updated: 12/27/2010.
Decision Trees Binary output – easily extendible to multiple output classes. Takes a set of attributes for a given situation or object and outputs a yes/no.
1 Utilizing Shared Vehicle Trajectories for Data Forwarding in Vehicular Networks IEEE INFOCOM MINI-CONFERENCE Fulong Xu, Shuo Gu, Jaehoon Jeong, Yu Gu,
Operational Research & ManagementOperations Scheduling Economic Lot Scheduling 1.Summary Machine Scheduling 2.ELSP (one item, multiple items) 3.Arbitrary.
Operations Research The OR Process. What is OR? It is a Process It assists Decision Makers It has a set of Tools It is applicable in many Situations.
Learning to Detect Faces A Large-Scale Application of Machine Learning (This material is not in the text: for further information see the paper by P.
Bayesian networks and their application in circuit reliability estimation Erin Taylor.
IT Applications for Decision Making. Operations Research Initiated in England during the world war II Make scientifically based decisions regarding the.
1 Random Disambiguation Paths Al Aksakalli In Collaboration with Carey Priebe & Donniell Fishkind Department of Applied Mathematics and Statistics Johns.
Operations Research Models and Methods Advanced Operations Research Reference: Operations Research Models and Methods, Operations Research Models and Methods,
Classification and Regression Trees
1 Decision Trees Greg Grudic (Notes borrowed from Thomas G. Dietterich and Tom Mitchell) [Edited by J. Wiebe] Decision Trees.
Highway Risk Mitigation through Systems Engineering.
Branch and Bound Searching Strategies
Recoverable Service Parts Inventory Problems -Ibrahim Mohammed IE 2079.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Transportation, Assignment, and Network Models 9 To accompany Quantitative Analysis for Management, Twelfth Edition, by Render, Stair, Hanna and Hale Power.
Anifuddin Azis LEARNING. Why is learning important? So far we have assumed we know how the world works Rules of queens puzzle Rules of chess Knowledge.
Chapter 7. Classification and Prediction
k-Nearest neighbors and decision tree
Ch9: Decision Trees 9.1 Introduction A decision tree:
Analytics and OR DP- summary.
Introduction to Operations Research
Saket Anand David Madigan Richard Mammone Saumitr Pathak Fred Roberts
Optimization Problems in Economic Epidemiology
Presentation transcript:

1 Decision Support Algorithms for Port of Entry Inspection Fred S. Roberts DIMACS Center, Rutgers University In collaboration with Los Alamos National Laboratory Preliminary Support: Office of Naval Research

2 Port of Entry Inspection Algorithms Goal: Find ways to intercept illicit nuclear materials and weapons destined for the U.S. via the maritime transportation system Currently inspecting only small % of containers arriving at ports Even inspecting 8% of containers in Port of NY/NJ might bring international trade to a halt (Larrabbee 2002)

3 Port of Entry Inspection Algorithms Aim: Develop decision support algorithms that will help us to “optimally” intercept illicit materials and weapons subject to limits on delays, manpower, and equipment Find inspection schemes that minimize total “cost” including “cost” of false positives and false negatives Mobile Vacis: truck- mounted gamma ray imaging system

4 Sequential Decision Making Problem Stream of containers arrives at a port The Decision Maker’s Problem: Which to inspect? Which inspections next based on previous results? Approach: –“decision logics” –combinatorial optimization methods –Builds on ideas of Stroud –and Saeger at LANL –Need for new models – and methods

5 Sequential Decision Making Problem Containers arriving to be classified into categories. Simple case: 0 = “ok”, 1 = “suspicious” Inspection scheme: specifies which inspections are to be made based on previous observations

6 Sequential Decision Making Problem Containers have attributes, each in a number of states Sample attributes: –Does ship’s manifest set off an “alarm”? –What is the neutron or Gamma emission count? Is it above threshold? –Does a radiograph image come up positive? –Does an induced fission test come up positive? Gamma ray detector

7 Sequential Decision Making Problem Simplest Case: Attributes are in state 0 or 1 Then: Container is a binary string like So: Classification is a decision function F that assigns each binary string to a category F(011001) If attributes 2, 3, and 6 are present, assign container to category F(011001).

8 Sequential Decision Making Problem If there are two categories, 0 and 1, decision function F is a boolean function. Example: F(000) = F(111) = 1, F(abc) = 0 otherwise This classifies a container as positive iff it has none of the attributes or all of them. 1 =

9 Sequential Decision Making Problem Given a container, test its attributes until know enough to calculate the value of F. An inspection scheme tells us in which order to test the attributes to minimize cost. Even this simplified problem is hard computationally. Hard Questions

10 Binary Decision Tree Approach Sensors measure presence/absence of attributes. Binary Decision Tree: –Nodes are sensors or categories (0 or 1) –Two arcs exit from each sensor node, labeled left and right. –Take the right arc when sensor says the attribute is present, left arc otherwise

11 Binary Decision Tree Approach Reach category 1 from the root only through the path a 0 to a 1 to 1. Container is classified in category 1 iff it has both attributes a 0 and a 1. Corresponding boolean function F(11) = 1, F(10) = F(01) = F(00) = 0. Figure 1

12 Binary Decision Tree Approach Reach category 1 from the root by: a 0 L to a 1 R a 2 R 1 or a 0 R a 2 R1 Container classified in category 1 iff it has a 1 and a 2 and not a 0 or a 0 and a 2 and possibly a 1. Corresponding boolean function F(111) = F(101) = F(011) = 1, F(abc) = 0 otherwise. Figure 2

13 Binary Decision Tree Approach This binary decision tree corresponds to the same boolean function F(111) = F(101) = F(011) = 1, F(abc) = 0 otherwise. However, it has one less observation node a i. So, it is more efficient if all observations are equally costly and equally likely. Figure 3

14 Binary Decision Tree Approach Even if the boolean function F is fixed, the problem of finding the “optimal” binary decision tree for it is very hard (NP-complete). For small n = number of attributes, can try to solve it by brute force enumeration. Even for n = 4, not practical. (n = 4 at Port of Long Beach-Los Angeles) Port of Long Beach

15 Binary Decision Tree Approach Promising Approaches: Heuristic algorithms, approximations to optimal. Special assumptions about the boolean function F. Example: For “monotone” boolean functions, integer programming formulations give promising heuristics.

16 Cost Functions Above analysis: Only uses number of sensors Using a sensor has a cost: –Unit cost of inspecting one item with it –Fixed cost of purchasing and deploying it –Delay cost from queuing up at the sensor station Unit Cost Complication: How many nodes of the decision tree are actually visited during average container’s inspection? Depends on “distribution” of containers.

17 Cost Functions: Delay Costs Stochastic process of containers arriving Distribution of delay times for inspections Use queuing theory to find average delay times under different models

18 Cost Functions Cost of false positive: Cost of additional tests. –If it means opening the container, it’s very expensive. Cost of false negative: Complex issue.

19 Cost Functions One Approach to False Positives/Negatives and Sensor Errors: Modeling Sensor Operation Threshold Model: Sensors have different discriminating power Many use counts See if count exceeds threshold

20 Cost Functions Threshold Model: Sensor discriminating power K, threshold T Attribute present if counts exceed T Calculate fraction of objects in each category whose readings exceed T Seek threshold values that minimize all costs: inspection, false positive/negative Simulation approach Mathematical modeling

21 Complications Sensor errors – probabilistic approach More than two values of an attribute (present, absent, present with probability > 75%, etc.) Inferring the boolean function from observations (partially defined boolean functions)

22 Complications Machine learning approaches are promising: –Bayesian binary regression –Splitting strategies –Pruning learned decision trees

23 Research Team Endre Boros, Rutgers, Operations Research Elsayed Elsayed, Rutgers, Industrial and Systems Engineering Paul Kantor, Rutgers, School of Information and Library Studies Sallie Keller-McNulty, Los Alamos, Statistical Sciences Group Alex Kogan, Rutgers, Business School Paul Lioy, Rutgers/UMDNJ, Environmental and Occupational Health and Sciences Institute David Madigan, Rutgers, Statistics Richard Mammone, Rutgers, Center for Advanced Information Processing S. Muthukrishnan, Rutgers, Computer Science Feng Pan, Los Alamos, Energy and Infrastructure Analysis Group Richard Picard, Los Alamos, Statistical Sciences Group Fred Roberts, Rutgers, DIMACS Center Kevin Saeger, Los Alamos, Homeland Security Phillip Stroud, Los Alamos, Systems Engineering and Integration Group

24 For Further Information Fred Roberts Director of DIMACS Chair, Rutgers University Homeland Security Research Initiative Co-chair, New Jersey Universities Homeland Security Research Consortium