RCQ-ACS: RDF Chain Query Optimization Using an Ant Colony System WI 2012 Alexander Hogenboom Erasmus University Rotterdam Ewout Niewenhuijse.

Slides:



Advertisements
Similar presentations
A Comparison Study for Novelty Control Mechanisms Applied to Web News Stories 2012 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2012)
Advertisements

G5BAIM Artificial Intelligence Methods
Polarity Analysis of Texts using Discourse Structure CIKM 2011 Bas Heerschop Erasmus University Rotterdam Frank Goossen Erasmus.
Learning Semantic Information Extraction Rules from News The Dutch-Belgian Database Day 2013 (DBDBD 2013) Frederik Hogenboom Erasmus.
VEHICLE ROUTING PROBLEM
Exploiting Discourse Structure for Sentiment Analysis of Text OR 2013 Alexander Hogenboom In collaboration with Flavius Frasincar, Uzay Kaymak, and Franciska.
Ant colonies for the traveling salesman problem Eliran Natan Seminar in Bioinformatics (236818) – Spring 2013 Computer Science Department Technion - Israel.
Determining Negation Scope and Strength in Sentiment Analysis SMC 2011 Paul van Iterson Erasmus School of Economics Erasmus University Rotterdam
Exploiting Emoticons in Sentiment Analysis SAC 2013 Daniella Bal Erasmus University Rotterdam Flavius Frasincar Erasmus University.
Gizem ALAGÖZ. Simulation optimization has received considerable attention from both simulation researchers and practitioners. Both continuous and discrete.
Path Planning with the humanoid robot iCub Semester Project 2008 Pantelis Zotos Supervisor: Sarah Degallier Biologically Inspired Robotics Group (BIRG)
RCQ-GA: RDF Chain Query Optimization using Genetic Algorithms BNAIC 2009 Alexander Hogenboom, Viorel Milea, Flavius Frasincar, and Uzay Kaymak Erasmus.
Artificial Intelligence
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
Ant Colonies As Logistic Processes Optimizers
Ants-based Routing Marc Heissenbüttel University of Berne
RRT-Connect path solving J.J. Kuffner and S.M. LaValle.
Ant Colony Optimization Optimisation Methods. Overview.
Optimizing RDF Chain Queries using Genetic Algorithms DBDBD 2010 Alexander Hogenboom, Viorel Milea, Flavius Frasincar, and Uzay Kaymak Erasmus University.
Detecting Economic Events Using a Semantics-Based Pipeline 22nd International Conference on Database and Expert Systems Applications (DEXA 2011) September.
Planning operation start times for the manufacture of capital products with uncertain processing times and resource constraints D.P. Song, Dr. C.Hicks.
Ant Colony Optimization Algorithms for the Traveling Salesman Problem ACO Kristie Simpson EE536: Advanced Artificial Intelligence Montana State.
Ant Colony Optimization to Resource Allocation Problems Peng-Yeng Yin and Ching-Yu Wang Department of Information Management National Chi Nan University.
D Nagesh Kumar, IIScOptimization Methods: M1L4 1 Introduction and Basic Concepts Classical and Advanced Techniques for Optimization.
Analyzing Sentiment in a Large Set of Web Data while Accounting for Negation AWIC 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam.
Ant Colony Optimization: an introduction
Ant Colony Optimization (ACO): Applications to Scheduling
FORS 8450 Advanced Forest Planning Lecture 19 Ant Colony Optimization.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
EVOLVING ANTS Enrique Areyan School of Informatics and Computing Indiana University January 24, 2012.
A News-Based Approach for Computing Historical Value-at-Risk International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) Frederik Hogenboom.
Travelling Salesman Problem: Convergence Properties of Optimization Algorithms Group 2 Zachary Estrada Chandini Jain Jonathan Lai.
CSM6120 Introduction to Intelligent Systems Other evolutionary algorithms.
Genetic Algorithms and Ant Colony Optimisation
EE4E,M.Sc. C++ Programming Assignment Introduction.
Swarm Computing Applications in Software Engineering By Chaitanya.
Ontology Updating Driven by Events Dutch-Belgian Database Day 2012 (DBDBD 2012) November 21, 2012 Frederik Hogenboom Jordy Sangers.
Swarm Intelligence 虞台文.
Ant Colony Optimization. Summer 2010: Dr. M. Ameer Ali Ant Colony Optimization.
Optimization of multi-pass turning operations using ant colony system Authors: K. Vijayakumar, G. Prabhaharan, P. Asokan, R. Saravanan 2003 Presented by:
Object Oriented Programming Assignment Introduction Dr. Mike Spann
Ant Colony Optimization Algorithms for TSP: 3-6 to 3-8 Timothy Hahn February 13, 2008.
Discrete optimization of trusses using ant colony metaphor Saurabh Samdani, Vinay Belambe, B.Tech Students, Indian Institute Of Technology Guwahati, Guwahati.
The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,
Inga ZILINSKIENE a, and Saulius PREIDYS a a Institute of Mathematics and Informatics, Vilnius University.
Thursday, May 9 Heuristic Search: methods for solving difficult optimization problems Handouts: Lecture Notes See the introduction to the paper.
Erasmus University Rotterdam Introduction Content-based news recommendation is traditionally performed using the cosine similarity and TF-IDF weighting.
Towards Cross-Language Sentiment Analysis through Universal Star Ratings KMO 2012 Malissa Bal Erasmus University Rotterdam Flavius.
Building a Distributed Full-Text Index for the Web by Sergey Melnik, Sriram Raghavan, Beverly Yang and Hector Garcia-Molina from Stanford University Presented.
Ant colony optimization. HISTORY introduced by Marco Dorigo (MILAN,ITALY) in his doctoral thesis in 1992 Using to solve traveling salesman problem(TSP).traveling.
Lexico-semantic Patterns for Information Extraction from Text The International Conference on Operations Research 2013 (OR 2013) Frederik Hogenboom
Scalable Keyword Search on Large RDF Data. Abstract Keyword search is a useful tool for exploring large RDF datasets. Existing techniques either rely.
Optimization Problems
5 Fundamentals of Ant Colony Search Algorithms Yong-Hua Song, Haiyan Lu, Kwang Y. Lee, and I. K. Yu.
1 Ann Nowé Nature inspired agents to handle interaction in IT systems Ann Nowé Computational modeling Lab Vrije Universiteit Brussel.
Ant Colony Optimization Andriy Baranov
Biologically Inspired Computation Ant Colony Optimisation.
What is Ant Colony Optimization?
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
Optimization Problems
CSCI 4310 Lecture 10: Local Search Algorithms
School of Computer Science & Engineering
Probabilistic Data Management
Subject Name: Operation Research Subject Code: 10CS661 Prepared By:Mrs
Advanced Artificial Intelligence Evolutionary Search Algorithm
Ant colonies for traveling salesman problem
Optimization Problems
Metaheuristic methods and their applications. Optimization Problems Strategies for Solving NP-hard Optimization Problems What is a Metaheuristic Method?
Ant Colony Optimization
traveling salesman problem
Presentation transcript:

RCQ-ACS: RDF Chain Query Optimization Using an Ant Colony System WI 2012 Alexander Hogenboom Erasmus University Rotterdam Ewout Niewenhuijse Erasmus University Rotterdam Flavius Frasincar Erasmus University Rotterdam Frederik Hogenboom Erasmus University Rotterdam December 5, 2012

Introduction (1) The Semantic Web allows for an ever-growing amount of data to be stored in many heterogeneous, yet interconnected sources Fast query engines are needed for efficient querying of large amounts of data, typically represented by means of the Resource Description Framework (RDF) WI 2012December 5, 2012

Introduction (2) A major challenge lies in optimizing query paths: the order in which distinct parts of a query are evaluated Existing solutions for Semantic Web: –Two-phase optimization (2PO): Iterative Improvement (II) Simulated Annealing (SA) –Genetic Algorithm (GA) Ant Colony Optimization (ACO) appears to be a feasible alternative for the dynamic Semantic Web WI 2012December 5, 2012

RDF and Query Paths (1) An RDF model is a collection of facts declared in RDF Facts are triples in the form of a node-arc-node link consisting of a subject, a predicate, and an object RDF sources can be queried using SPARQL WI 2012December 5, 2012

RDF and Query Paths (2) We consider a subset of SPARQL queries: chain queries, where a query path is followed by performing joins between its subpaths of length 1 Example RDF chain query: 1. PREFIXc: 2. PREFIXo: 3. SELECT?partner 4. WHERE {c:NL o:exportPartner ?expPartner. 5. ?expPartner o:country ?partner. 6. ?partner o:dependentArea ?area. 7.?area o:internationalDispute ?conflict. 9. } WI 2012December 5, 2012

RDF and Query Paths (3) WI 2012 Left-deep query tree Bushy query tree December 5, 2012

RDF Query Path Optimization (1) Challenge: determine the right order in which the joins should be computed Optimize the overall response time Explore a solution space with query paths Solution space size exponential in number of concepts WI 2012December 5, 2012

RDF Query Path Optimization (2) Solutions are associated with data transmission and processing costs Data processing costs are the sum of all join costs, which are influenced by the cardinalities of each operand and the join method used (nested-loop) Neighboring solutions in the solution space can be identified using transformation rules WI 2012December 5, 2012

RDF Query Path Optimization (3) WI 2012 Join commutativityJoin associativity Left join exchangeRight join exchange December 5, 2012

RDF Query Path Optimization (4) Exploring the solution space by means of 2PO: –Using II, local optima are found by walking through the solution space (from random starting points), while only taking steps yielding improvement in solution quality –The best local optimum thus found is used as starting point for SA: a walk through the solution space, where moves not yielding improvement are accepted with a declining probability A GA has proven to outperform 2PO As ACO has proven to outperform GAs in solving other complex problems and ACO can deal with continuously changing environments, ACO is a promising alternative WI 2012December 5, 2012

RDF Chain Query Optimization with Ants (1) Artificial ants explore a solution space by iteratively: –Constructing a path from a starting point to an ending point –Updating pheromone traces marking their paths Steps depend on pheromone traces and local heuristics Ant Colony System (ACS) is a faster converging ACO variant, differing from classic ACO algorithms in that: –Ants occasionally simply take the step with the highest probability rather than possibly taking less likely steps –Pheromone traces are only deposited on parts of the best-so- far solution instead of on all paths taken by all ants –Pheromone evaporation only takes place on paths visited by ants, rather than on all paths WI 2012December 5, 2012

RDF Chain Query Optimization with Ants (2) We model the solution space based on an ordinal number scheme for encoding chain queries The encoding scheme iteratively joins two concepts in an ordered list of concepts, while saving the result on the position of first appearing concept Example: –(t1, t2, t3, t4): join 2 and 4 –(t1, t2t4, t3): join 2 and 1 –(t2t4t1, t3): join 2 and 1 –(t3t2t4t1) Encoded solution: ((2,4),(2,1),(2,1)) WI 2012December 5, 2012

RDF Chain Query Optimization with Ants (3) WI 2012December 5, 2012

Evaluation (1) We evaluate RDF chain query optimization (RCQ) by means of 2PO, a GA, and ACS on an RDF version of the CIA World Factbook (over 100,000 triples) The full solution space is considered Each algorithm is assessed in terms of execution time and solution quality, for chain queries varying in length from 3 to 20 predicates (2 to 19 joins) Each experiment is iterated 100 times We assess significance of performance differences by means of a paired, two-sided Wilcoxon signed rank test WI 2012December 5, 2012

Evaluation (2) WI 2012December 5, 2012

Evaluation (3) WI 2012December 5, 2012

Conclusions We have proposed an ACS approach in which artificial ants identify low-cost query paths guided by previously encountered solutions and local heuristics Our approach significantly outperforms existing work when optimizing RDF chain queries consisting of up to approximately 10 joins WI 2012December 5, 2012

Future Work Optimize parameters and perform a sensitivity analysis Make our ACS approach more scalable Evaluate our method in a real-world query execution engine in a dynamic, distributed setting WI 2012December 5, 2012

Questions? Alexander Hogenboom Erasmus School of Economics Erasmus University Rotterdam P.O. Box 1738, NL-3000 DR Rotterdam, the Netherlands WI 2012December 5, 2012