1 A Throughput-Efficient Packet Classifier with n Bloom filters Authors: Heeyeol Yu and Rabi Mahapatra Publisher: IEEE GLOBECOM 2008 proceedings Present:

Slides:



Advertisements
Similar presentations
A Memory-optimized Bloom Filter using An Additional Hashing Function Author: Mahmood Ahmadi, Stephan Wong Publisher: IEEE GLOBECOM 2008 Presenter: Yu-Ping.
Advertisements

1 An Efficient, Hardware-based Multi-Hash Scheme for High Speed IP Lookup Hot Interconnects 2008 Socrates Demetriades, Michel Hanna, Sangyeun Cho and Rami.
1 Blooming Trees: Space-Efficient Structures for Data Representation Author: Domenico Ficara, Stefano Giordano, Gregorio Procissi, Fabio Vitucci Publisher:
Segmented Hash: An Efficient Hash Table Implementation for High Performance Networking Subsystems Sailesh Kumar Patrick Crowley.
Exploiting Graphics Processors for High- performance IP Lookup in Software Routers Author: Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu.
Author: Francis Chang, Wu-chang Feng, Kang Li Publisher: INFOCOM 2004 Presenter: Yun-Yan Chang Date: 2010/12/01 1.
Hit or Miss ? !!!.  Cache RAM is high-speed memory (usually SRAM).  The Cache stores frequently requested data.  If the CPU needs data, it will check.
1 Author: Ioannis Sourdis, Sri Harsha Katamaneni Publisher: IEEE ASAP,2011 Presenter: Jia-Wei Yo Date: 2011/11/16 Longest prefix Match and Updates in Range.
IP Address Lookup for Internet Routers Using Balanced Binary Search with Prefix Vector Author: Hyesook Lim, Hyeong-gee Kim, Changhoon Publisher: IEEE TRANSACTIONS.
1 Design of Bloom Filter Array for Network Anomaly Detection Author: Jieyan Fan, Dapeng Wu, Kejie Lu, Antonio Nucci Publisher: IEEE GLOBECOM 2006 Presenter:
Fast Filter Updates for Packet Classification using TCAM Authors: Haoyu Song, Jonathan Turner. Publisher: GLOBECOM 2006, IEEE Present: Chen-Yu Lin Date:
1 A Tree Based Router Search Engine Architecture With Single Port Memories Author: Baboescu, F.Baboescu, F. Tullsen, D.M. Rosu, G. Singh, S. Tullsen, D.M.Rosu,
Scalable IPv6 Lookup/Update Design for High-Throughput Routers Authors: Chung-Ho Chen, Chao-Hsien Hsu, Chen -Chieh Wang Presenter: Yi-Sheng, Lin ( 林意勝.
1 Energy Efficient Multi-match Packet Classification with TCAM Fang Yu
Efficient Multidimensional Packet Classification with Fast Updates Author: Yeim-Kuan Chang Publisher: IEEE TRANSACTIONS ON COMPUTERS, VOL. 58, NO. 4, APRIL.
1 Partition Filter Set for Power- Efficient Packet Classification Authors: Haibin Lu, MianPan Publisher: IEEE GLOBECOM 2006 Present: Chen-Yu Lin Date:
Parallel-Search Trie-based Scheme for Fast IP Lookup
Two stage packet classification using most specific filter matching and transport level sharing Authors: M.E. Kounavis *,A. Kumar,R. Yavatkar,H. Vin Presenter:
SSA: A Power and Memory Efficient Scheme to Multi-Match Packet Classification Fang Yu 1 T. V. Lakshman 2 Martin Austin Motoyama 1 Randy H. Katz 1 1 EECS.
An Efficient IP Lookup Architecture with Fast Update Using Single-Match TCAMs Author: Jinsoo Kim, Junghwan Kim Publisher: WWIC 2008 Presenter: Chen-Yu.
Software-Based Cache Coherence with Hardware-Assisted Selective Self Invalidations Using Bloom Filters Authors : Thomas J. Ashby, Pedro D´ıaz, Marcelo.
Fast binary and multiway prefix searches for pachet forwarding Author: Yeim-Kuan Chang Publisher: COMPUTER NETWORKS, Volume 51, Issue 3, pp , February.
Hash, Don’t Cache: Fast Packet Forwarding for Enterprise Edge Routers Minlan Yu Princeton University Joint work with Jennifer.
1 Route Table Partitioning and Load Balancing for Parallel Searching with TCAMs Department of Computer Science and Information Engineering National Cheng.
Sarang Dharmapurikar With contributions from : Praveen Krishnamurthy,
PARALLEL TABLE LOOKUP FOR NEXT GENERATION INTERNET
Fast and deterministic hash table lookup using discriminative bloom filters  Author: Kun Huang, Gaogang Xie,  Publisher: 2013 ELSEVIER Journal of Network.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture8.
CS212: DATA STRUCTURES Lecture 10:Hashing 1. Outline 2  Map Abstract Data type  Map Abstract Data type methods  What is hash  Hash tables  Bucket.
Author : Ozgun Erdogan and Pei Cao Publisher : IEEE Globecom 2005 (IJSN 2007) Presenter : Zong-Lin Sie Date : 2010/12/08 1.
(TPDS) A Scalable and Modular Architecture for High-Performance Packet Classification Authors: Thilan Ganegedara, Weirong Jiang, and Viktor K. Prasanna.
LayeredTrees: Most Specific Prefix based Pipelined Design for On-Chip IP Address Lookups Author: Yeim-Kuau Chang, Fang-Chen Kuo, Han-Jhen Guo and Cheng-Chien.
ETRI Bloom Filter-based Flat Name Resolution System for ICN ICNRG Interim meeting, Paris Jungha Hong, Woojik Chun, and Heeyoung Jung September 27, 2014.
Complexity of algorithms Algorithms can be classified by the amount of time they need to complete compared to their input size. There is a wide variety:
Author: Haoyu Song, Fang Hao, Murali Kodialam, T.V. Lakshman Publisher: IEEE INFOCOM 2009 Presenter: Chin-Chung Pan Date: 2009/12/09.
Efficient Packet Classification with Digest Caches Francis Chang Wu-chang Feng Wu-chi Feng Kang Li.
Peacock Hash: Deterministic and Updatable Hashing for High Performance Networking Sailesh Kumar Jonathan Turner Patrick Crowley.
Fast Packet Classification Using Bloom filters Authors: Sarang Dharmapurikar, Haoyu Song, Jonathan Turner, and John Lockwood Publisher: ANCS 2006 Present:
Multi-Field Range Encoding for Packet Classification in TCAM Author: Yeim-Kuan Chang, Chun-I Lee and Cheng-Chien Su Publisher: INFOCOM 2011 Presenter:
Author : Guangdeng Liao, Heeyeol Yu, Laxmi Bhuyan Publisher : Publisher : DAC'10 Presenter : Jo-Ning Yu Date : 2010/10/06.
A Formal Analysis of Conservative Update Based Approximate Counting Gil Einziger and Roy Freidman Technion, Haifa.
On Adding Bloom Filters to Longest Prefix Matching Algorithms
Author : Christopher J. Martinez, Devang K. Pandya, and Wei-Ming Lin Publisher/Conf : IEEE/ACM TRANSACTIONS ON NETWORKING Speaker : Chen Deyu Data :
StrideBV: Single chip 400G+ packet classification Author: Thilan Ganegedara, Viktor K. Prasanna Publisher: HPSR 2012 Presenter: Chun-Sheng Hsueh Date:
Segmented Hash: An Efficient Hash Table Implementation for High Performance Networking Subsystems Sailesh Kumar Patrick Crowley.
1 Power-Efficient TCAM Partitioning for IP Lookups with Incremental Updates Author: Yeim-Kuan Chang Publisher: ICOIN 2005 Presenter: Po Ting Huang Date:
PARALLEL-SEARCH TRIE- BASED SCHEME FOR FAST IP LOOKUP Author: Roberto Rojas-Cessa, Lakshmi Ramesh, Ziqian Dong, Lin Cai Nirwan Ansari Publisher: IEEE GLOBECOM.
Author: Heeyeol Yu and Rabi Mahapatra
High-Speed Policy-Based Packet Forwarding Using Efficient Multi-dimensional Range Matching Lakshman and Stiliadis ACM SIGCOMM 98.
Cross-Product Packet Classification in GNIFS based on Non-overlapping Areas and Equivalence Class Author: Mohua Zhang, Ge Li Publisher: AISS 2012 Presenter:
Parallel tree search: An algorithmic approach for multi- field packet classification Authors: Derek Pao and Cutson Liu. Publisher: Computer communications.
Cuckoo Filter: Practically Better Than Bloom Author: Bin Fan, David G. Andersen, Michael Kaminsky, Michael D. Mitzenmacher Publisher: ACM CoNEXT 2014 Presenter:
IP Routing table compaction and sampling schemes to enhance TCAM cache performance Author: Ruirui Guo a, Jose G. Delgado-Frias Publisher: Journal of Systems.
1 Space-Efficient TCAM-based Classification Using Gray Coding Authors: Anat Bremler-Barr and Danny Hendler Publisher: IEEE INFOCOM 2007 Present: Chen-Yu.
Hierarchical packet classification using a Bloom filter and rule-priority tries Source : Computer Communications Authors : A. G. Alagu Priya 、 Hyesook.
Author : Lynn Choi, Hyogon Kim, Sunil Kim, Moon Hae Kim Publisher/Conf : IEEE/ACM TRANSACTIONS ON NETWORKING Speaker : De yu Chen Data :
Author : Masanori Bando and H. Jonathan Chao Publisher : INFOCOM, 2010 Presenter : Jo-Ning Yu Date : 2011/02/16.
TOPIC 5 ASSIGNMENT SORTING, HASH TABLES & LINKED LISTS Yerusha Nuh & Ivan Yu.
Packet Classification Using Multi- Iteration RFC Author: Chun-Hui Tsai, Hung-Mao Chu, Pi-Chung Wang Publisher: 2013 IEEE 37th Annual Computer Software.
Range Hash for Regular Expression Pre-Filtering Publisher : ANCS’ 10 Author : Masanori Bando, N. Sertac Artan, Rihua Wei, Xiangyi Guo and H. Jonathan Chao.
BUFFALO: Bloom Filter Forwarding Architecture for Large Organizations Minlan Yu Princeton University Joint work with Alex Fabrikant,
Author: Heeyeol Yu; Mahapatra, R.; Publisher: IEEE INFOCOM 2008
Statistical Optimal Hash-based Longest Prefix Match
Packet Classification Using Coarse-Grained Tuple Spaces
Hash Functions for Network Applications (II)
Lecture 1: Bloom Filters
Author: Yi Lu, Balaji Prabhakar Publisher: INFOCOM’09
A flow aware packet sampling mechanism for high speed links
A SRAM-based Architecture for Trie-based IP Lookup Using FPGA
Authors: Ding-Yuan Lee, Ching-Che Wang, An-Yeu Wu Publisher: 2019 VLSI
Presentation transcript:

1 A Throughput-Efficient Packet Classifier with n Bloom filters Authors: Heeyeol Yu and Rabi Mahapatra Publisher: IEEE GLOBECOM 2008 proceedings Present: Chen-Yu Lin Date: April,29, 2009

Outline Introduction A Multi-Tiered Packet Classifier with n BFs Bloom filter theory Building a multi-tiered packet classifier Insert in a MPC Query in a MPC Simulation result 2

Introduction Due to memory efficiency and fast lookup, Bloom filters (BFs) have been widely used for packet classification in a high-speed router. However, in a parallel packet classifier (PPC) of n parallel BFs, using all n BFs for a lookup is not throughput efficient in a high speed router. In this paper, we propose a multi-tiered packet classifier (MPC) for high throughput with the same memory size as a PPC. A PPC with n BFs needs θ(n) BF access complexity for a lookup. Our MPC is geared to have the complexity far less than θ(n). 3

Introduction A BF is essentially a generalized hash mechanism on a key set with k hash functions for approximate membership testing. A BF of an m-bit vector has a limitation of a false positive (f-positive). Thus, to sustain a extremely low f-positive rate and resolve it are necessary in a high speed packet classification. A PPC accesses n BFs for one lookup every cycle. A MPC accesses n BFs for several lookups every cycle with the same BF’s memory amount as that in a PPC. 4

Introduction Probing all BFs for a packet lookup in a cycle is not throughput efficient, because only one BF is actually associated with the lookup. the rest BFs can do other lookups in the cycle. Thus, to distribute lookup requests to their corresponding BFs without probing the irrelevant BFs for a lookup is throughput efficient in packet classification. 5

A multi-tiered packet classifier with n BFs Bloom filter theory A legacy BF for representing a set S of n items (keys) is described by an m-bit array memory with initially set to 0. A BF uses k independent hash functions h 0, …, h k-1 within the range [0,m-1]. For insertion of a key e, the bits indexed by h i (e) are set to 1. (0 ≦ i ≦ k-1). To query the key y, k bits by h i (y) should all be 1. If so, the BF returns “yes” about the query. If not, then clearly the key y is not a member of S. Each hash function corresponds to one random lookup in an m-bit memory, and a BF with k hash functions needs the exact same k of memory read ports in an m-bit memory. A BF is considered as a high computation element. 6

A multi-tiered packet classifier with n BFs Building a multi-tiered packet classifier 7

A multi-tiered packet classifier with n BFs Build a MPC To build 2-tiered BFs, the total PPC memory is split into 2 parts: A pre-stage of small-sized BFs with one read port. A post-stage of large-sized BFs with k-1 read ports. Then, a small-sized BF is logically connected to two large-sized BFs. So that, a forest of binary trees is built. A MPC in place of a PPC configured in a linear form. 8

A multi-tiered packet classifier with n BFs Insert in a MPC Insert operation of a key in a BF on layer 1 is as simple as insertion of the key in a legacy BF. On layer j, if a key to hash is assigned to, the key is given to for insert procedure. For example: The memory write complexity of one key insertion : 9 Can be pipelined

A multi-tiered packet classifier with n BFs We proof that an MPC uses the same memory size as PPC by mathematical. Processing a lookup in small-sized BFs of one read port does not provide a higher throughput in large-sized BFs on a lower layer. The number of read ports in the small-sized BF needs to be the same as buffer size (b). As suggested in [3], using mini-BFs with few read ports is the solution without degrading lookup accuracy. Breaking a BF into mini-BFs only gives the possibility of fabricating BFs for packet processing, not the benefit of high throughput. 10

A multi-tiered packet classifier with n BFs Query in a MPC In this forest, a lookup starts from parent BFs in the pre-stage to children BFs in post-stage. The lookup can proceed to two children BFs in a post-stage, if the lookup in a parent BF return positive (or return ‘yes’). There is the possibility of not reaching the two large-sized BFs irrelevant to the lookup, compared to a PPC. The rest idle large-sized BFs in the post-stage can be utilized for other lookups. 11

A multi-tiered packet classifier with n BFs Unlike insert operation where only involved BFs need to be accessed, query operation needs to access all BFs to find which BFs return ‘yes’. Because except one BF, the rest BFs give f-positive leading to misclassification. The irrelevant BFs in MPC are geared not to be probed. The BF access complexity in processing a lookup with n BFs is far less than n. (compared with PPC) By distributing a group of lookups to corresponding BFs in a cycle at best, a higher throughput is achieved in an MPC. 12

A multi-tiered packet classifier with n BFs true-path (t-path) : In query operation among a forest, a true path occurs. All BFs on the t-path should return ‘yes’ for a given key as a legacy BF return ‘yes’ because each BF has the key as a member. false-path (f-path) : The number of f-paths means the number of packet misclassification. 13

Simulation result We measure PPC and MPC throughput with IP trace from NLANR PMA[10]. Average throughput ratio of 2TPC against PPC 14 PURSDAFRGPSC 19.4k29.5k39.7k37.9k

Simulation result Average throughput ratio of 2TPC against PPC In all different numbers of BFs, the larger is the buffer size (b), the higher throughput ratio is. At most 2.0 times throughput was observed in PSC. 15