5/12/2015PhD seminar CS BGU Counting subgraphs Support measures for graphs Natalia Vanetik.

Slides:



Advertisements
Similar presentations
Maximum flow Main goals of the lecture:
Advertisements

CSE 211 Discrete Mathematics
 Theorem 5.9: Let G be a simple graph with n vertices, where n>2. G has a Hamilton circuit if for any two vertices u and v of G that are not adjacent,
1 Decomposing Hypergraphs with Hypertrees Raphael Yuster University of Haifa - Oranim.
Lecture 5 Graph Theory. Graphs Graphs are the most useful model with computer science such as logical design, formal languages, communication network,
gSpan: Graph-based substructure pattern mining
Theory of Computing Lecture 18 MAS 714 Hartmut Klauck.
Greedy Algorithms Greed is good. (Some of the time)
A Separator Theorem for Graphs with an Excluded Minor and its Applications Paul Seymour Noga Alon Robin Thomas Lecturer : Daniel Motil.
Lecture 22: April 18 Probabilistic Method. Why Randomness? Probabilistic method: Proving the existence of an object satisfying certain properties without.
NP-Completeness: Reductions
1 Partition Into Triangles on Bounded Degree Graphs Johan M. M. van Rooij Marcel E. van Kooten Niekerk Hans L. Bodlaender.
One of the most important problems is Computational Geometry is to find an efficient way to decide, given a subdivision of E n and a point P, in which.
Approximation Algorithms Chapter 5: k-center. Overview n Main issue: Parametric pruning –Technique for approximation algorithms n 2-approx. algorithm.
CS774. Markov Random Field : Theory and Application Lecture 17 Kyomin Jung KAIST Nov
Optimization of Pearl’s Method of Conditioning and Greedy-Like Approximation Algorithm for the Vertex Feedback Set Problem Authors: Ann Becker and Dan.
Rajat K. Pal. Chapter 3 Emran Chowdhury # P Presented by.
Complexity 16-1 Complexity Andrei Bulatov Non-Approximability.
Computability and Complexity 23-1 Computability and Complexity Andrei Bulatov Search and Optimization.
Complexity 15-1 Complexity Andrei Bulatov Hierarchy Theorem.
Computability and Complexity 15-1 Computability and Complexity Andrei Bulatov NP-Completeness.
Branch and Bound Similar to backtracking in generating a search tree and looking for one or more solutions Different in that the “objective” is constrained.
1 2 Introduction In last chapter we saw a few consistency tests. In this chapter we are going to prove the properties of Plane-vs.- Plane test: Thm[RaSa]:
Perfect Graphs Lecture 23: Apr 17. Hard Optimization Problems Independent set Clique Colouring Clique cover Hard to approximate within a factor of coding.
Is the following graph Hamiltonian- connected from vertex v? a). Yes b). No c). I have absolutely no idea v.
1 CSE 417: Algorithms and Computational Complexity Winter 2001 Lecture 23 Instructor: Paul Beame.
SubSea: An Efficient Heuristic Algorithm for Subgraph Isomorphism Vladimir Lipets Ben-Gurion University of the Negev Joint work with Prof. Ehud Gudes.
The community-search problem and how to plan a successful cocktail party Mauro SozioAris Gionis Max Planck Institute, Germany Yahoo! Research, Barcelona.
New Algorithm DOM for Graph Coloring by Domination Covering
4/17/2017 Section 8.5 Euler & Hamilton Paths ch8.5.
Mining Graphs with Constrains on Symmetry and Diameter Natalia Vanetik Deutsche Telecom Laboratories at Ben-Gurion University IWGD10 workshop July 14th,
Computer Science 1 Web as a graph Anna Karpovsky.
Minimum Spanning Trees. Subgraph A graph G is a subgraph of graph H if –The vertices of G are a subset of the vertices of H, and –The edges of G are a.
CSV: Visualizing and Mining Cohesive Subgraphs Nan Wang Srinivasan Parthasarathy Kian-Lee Tan Anthony K. H. Tung School of Computing National University.
Fixed Parameter Complexity Algorithms and Networks.
I/O-Efficient Graph Algorithms Norbert Zeh Duke University EEF Summer School on Massive Data Sets Århus, Denmark June 26 – July 1, 2002.
Approximating Minimum Bounded Degree Spanning Tree (MBDST) Mohit Singh and Lap Chi Lau “Approximating Minimum Bounded DegreeApproximating Minimum Bounded.
MCS 312: NP Completeness and Approximation algorithms Instructor Neelima Gupta
Graph Reconstruction Conjecture. Proposed by S.M. Ulan & P.J. Kelly in 1941: The conjecture states that every graph with at least 3 vertices is reconstructible;
INHERENT LIMITATIONS OF COMPUTER PROGRAMS CSci 4011.
Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster 2012.
1 Steiner Tree Algorithms and Networks 2014/2015 Hans L. Bodlaender Johan M. M. van Rooij.
The Tutte Polynomial Graph Polynomials winter 05/06.
CSE 024: Design & Analysis of Algorithms Chapter 9: NP Completeness Sedgewick Chp:40 David Luebke’s Course Notes / University of Virginia, Computer Science.
An Efficient Algorithm for Enumerating Pseudo Cliques Dec/18/2007 ISAAC, Sendai Takeaki Uno National Institute of Informatics & The Graduate University.
Chapter 3. Community Detection and Evaluation May 2013 Youn-Hee Han
1 The number of orientations having no fixed tournament Noga Alon Raphael Yuster.
Maximum density of copies of a graph in the n-cube John Goldwasser Ryan Hansen West Virginia University.
Additional NP-complete problems
CS774. Markov Random Field : Theory and Application Lecture 02
Introduction to Graph Theory
Graph Colouring L09: Oct 10. This Lecture Graph coloring is another important problem in graph theory. It also has many applications, including the famous.
Planar Graphs Graph Coloring
Mining Top-K Large Structural Patterns in a Massive Network Feida Zhu 1, Qiang Qu 2, David Lo 1, Xifeng Yan 3, Jiawei Han 4, and Philip S. Yu 5 1 Singapore.
Fixed parameter algorithms for protein similarity search under mRNA structure constrains A joint work by: G. Blin, G. Fertin, D. Hermelin, and S. Vialette.
Computing Branchwidth via Efficient Triangulations and Blocks Authors: F.V. Fomin, F. Mazoit, I. Todinca Presented by: Elif Kolotoglu, ISE, Texas A&M University.
1 CS612 Algorithms for Electronic Design Automation CS 612 – Lecture 8 Lecture 8 Network Flow Based Modeling Mustafa Ozdal Computer Engineering Department,
1 Assignment #3 is posted: Due Thursday Nov. 15 at the beginning of class. Make sure you are also working on your projects. Come see me if you are unsure.
Graph Indexing From managing and mining graph data.
Chapter 11. Chapter Summary  Introduction to trees (11.1)  Application of trees (11.2)  Tree traversal (11.3)  Spanning trees (11.4)
+ GRAPH Algorithm 2 Dikompilasi dari banyak sumber.
 2005 SDU Lecture15 P,NP,NP-complete.  2005 SDU 2 The PATH problem PATH = { | G is a directed graph that has a directed path from s to t} s t
Ning Jin, Wei Wang ICDE 2011 LTS: Discriminative Subgraph Mining by Learning from Search History.
Gspan: Graph-based Substructure Pattern Mining
Finding Dense and Connected Subgraphs in Dual Networks
What is the next line of the proof?
Graphs Chapter 13.
Algorithms (2IL15) – Lecture 7
Existence of 3-factors in Star-free Graphs with High Connectivity
Trevor Brown DC 2338, Office hour M3-4pm
Presentation transcript:

5/12/2015PhD seminar CS BGU Counting subgraphs Support measures for graphs Natalia Vanetik

PhD seminar CS BGU 5/12/2015 This research was carried out under the supervision of Prof. Eyal S. Shimony and Prof. Ehud Gudes. Published in DAMI Journal vol. 13(2), September 2006.

PhD seminar CS BGU 5/12/2015 Multiflows in graphs Counting functions in graphs Research directions

PhD seminar CS BGU 5/12/2015 Problem description Let D and G be graphs. We need to measure statistical significance of G as a subgraph of D. Observe instances (isomorphic copies) of G within D. D D D G GG G G G G G G has zero significanceG has high significanceG has some significance

PhD seminar CS BGU 5/12/2015 Definition A counting function on graphs that measures statistical significance of one graph G as a subgraph of another graph D is called a support measure. It is obvious that when G is not a subgraph of D, this function should return 0. Otherwise, it should return value greater than 0.

PhD seminar CS BGU 5/12/2015 Traditional support measure An item-set X in relational model is a set of tuples (f 1,v 1 ),…,(f n,v n ) where f i are the names of fields and v i are values. A transaction T supports X if the value of f i in it equals to v i for every i=1…n. A support of an item-set X is the number of transactions in the database that support X.

PhD seminar CS BGU 5/12/2015 Admissibility It is important, especially for graph mining, that support measure is admissible or has a downward closure property or antimonotonicity: support of a graph cannot be smaller than support of its supergraph. support of a graph cannot be larger than support of any of its subgraphs.

PhD seminar CS BGU 5/12/2015 Motivation Significant amount of data in the world is graph- like and not relational. Graph data is usually represented by one or more large graphs. Transaction-like graph datasets are rare. Traditional support definition is not admissible. Admissible support measures are required for mining the graph data and other tasks.

PhD seminar CS BGU 5/12/2015 Instance graph We observe all the subgraphs of G in D, called instances. Instances are thought to be connected if they have an edge/node/subgraph in common. A graph with instances of G as nodes and edges between every pair of connected vertices is called the instance graph of G in D.

PhD seminar CS BGU 5/12/2015 Instance graph: an example G D Instance graph of G in D

PhD seminar CS BGU 5/12/2015 Intuitive support measures Just count the instances. Perform some sort of a weighted count. G Count D (G)=3 G Wcount D (G) = Count D (G) / 3 =1 D D

PhD seminar CS BGU 5/12/2015 The problem with intuitive approach is… …that these measures are not admissible: Count D (G)=3 Count D (g)=1 G g D D Wcount D (G)=1+1=2 Wcount D (g)= 1/2+1/2+1/2=3/2 G g

PhD seminar CS BGU 5/12/2015 What is going on? A counting function can be viewed as acting on the instance graph. A graph g and its supergraph G have different instance graphs I g and I G, and I g is obtained from I G by a series of graph operations. If a counting function does not decrease under these operations, it is admissible (for specific G and g, at least).

PhD seminar CS BGU 5/12/2015 Operations on instance graphs We narrowed it down to the following three operations on instance graphs:  clique contraction,  node addition,  edge deletion.

PhD seminar CS BGU 5/12/2015 Clique contraction A clique is contracted into a single node. Another node is incident to the new one only if it was incident to all the nodes in the clique. Intuition behind it: G G g

PhD seminar CS BGU 5/12/2015 Node addition A new node and some edges incident to this node are added. Intuition behind it: G G g g g

PhD seminar CS BGU 5/12/2015 Edge removal An edge is removed. Intuition behind it: G G gg

PhD seminar CS BGU 5/12/2015 The main result Theorem. A support measure on graphs is admissible if and only if it does not decrease under following operations on instance graphs: 1. clique contraction, 2. edge removal, 3. node addition.

PhD seminar CS BGU 5/12/2015 Sufficiency To prove sufficiency for these three operations, we need to show that for every graph D and every pair of graphs G and g, s.t. g is a subgraph of G, the instance graph I g of g is obtained from the instance graph I G of G by these operations alone.

PhD seminar CS BGU 5/12/2015 Sufficiency: proof outline The proof is constructive (algorithmic). The main idea is to build a pair of mappings, first from instances of G to instances of g and second from instances of g to instances of G. Perform clique contractions and node additions to obtain a vertex set of I g from a vertex set of I G. Perform edge deletions as necessary.

PhD seminar CS BGU 5/12/2015 Necessity To prove the necessity, we need to show that for every graph H and every operation (from the above list) that produces a graph h, there exist a database graph D and a pair of its subgraphs G and g, where g is a subgraph of G, so that H=I G and h=I g.

PhD seminar CS BGU 5/12/2015 Necessity: proof outline The proof is constructive. Specific graphs G and g are constructed. For convenience, these graphs are labeled. Intersection types for instances of G and g in D are defined. D is constructed accordingly.

PhD seminar CS BGU 5/12/2015 Necessity: the patterns a … … a a a b c dd dd G a … a a a b g Top Bottom Legs Arms Legs

PhD seminar CS BGU 5/12/2015 Necessity: intersection Following intersection types are allowed in D: Bottom overlap: all legs of two instances overlap. Leg overlap: two instances have exactly one leg in common. Arm overlap: two instances have exactly one arm in common.

PhD seminar CS BGU 5/12/2015 Bottom overlap: for clique contraction a … … a a a b c dd dd G1G1 … c dd dd … c dd dd G3G3 G2G2

PhD seminar CS BGU 5/12/2015 Leg overlap: for node addition a … … a a a b c dd dd G1G1 … … a a b c dd dd G2G2

PhD seminar CS BGU 5/12/2015 Arm overlap: for edge removal a … … a a a b c dd dd G1G1 … … a a b c d d G2G2 a a

PhD seminar CS BGU 5/12/2015 Necessity: proof outline Use instances of G to construct the database graph D. Prove that no additional instances of G arise from the overlaps. Show that the instance graph of g arises from the instance graph of G by applying the chosen operation.

PhD seminar CS BGU 5/12/2015 MIS measure MIS measure is the size of maximum independent set (anti-clique) in the instance graph. It satisfies the necessity conditions (direct admissibility proof is also available). It was used in several papers (Han, Kuramochi etc.) No other admissible support measure have been found to date.

PhD seminar CS BGU 5/12/2015 MIS: example G D Instance graph I G of G in D MIS(I G )=1

PhD seminar CS BGU 5/12/2015 Extensions  Necessary and sufficient conditions can be re-formulated for different pattern intersection types (for example, a common node can be considered an intersection).

PhD seminar CS BGU 5/12/2015 Open problems and conjectures Is computation of an admissible support measure an NP-hard problem, regardless of the measure chosen? Is any admissible support measure a function on MIS size? What kind of a function?

PhD seminar CS BGU 5/12/2015 Thank you!

PhD seminar CS BGU 5/12/2015 Questions?