Presentation is loading. Please wait.

Presentation is loading. Please wait.

Column Oriented Database By: Deepak Sood Garima Chhikara Neha Rani Vijayita Gumber.

Similar presentations


Presentation on theme: "Column Oriented Database By: Deepak Sood Garima Chhikara Neha Rani Vijayita Gumber."— Presentation transcript:

1 Column Oriented Database By: Deepak Sood Garima Chhikara Neha Rani Vijayita Gumber

2 Columnar Database Systems Stores data by column. Keeps all attribute information together. Handles fixed length data. 2-D data represented at conceptual level is mapped to 1-D data structure at physical level.

3 In row store data are stored in the disk tuple by tuple. In Column Store data is stored in disk column by column. Row StoreColumn Store (+) Easy to add/modify a record(+) Only need to read in relevant data (-) Might read in unnecessary data(-) Tuple writes require multiple accesses

4 Row Store and Column Store Most of the queries does not process all the attributes of a particular relation. For example the query Select c.name and c.address From CUSTOMES as c Where c.region=Mumbai; Only process three attributes of the relation CUSTOMER. But the customer relation can have more than three attributes. Column-stores are more I/O efficient for read-only queries as they read, only those attributes which are accessed by a query.

5 Why Column Store ? Faster. Fetch only required columns for a query. Better cache effects. Better Compresssion. Data Warehousing applications make more read operation. Row oriented have an overhead of seeking through all columns. Can be slower for some applications like OLTP with many row inserts.

6 Query Execution - Operators Select : Same as relational algebra, but produces a bit string Project : Same as relational algebra Join : Joins projections according to predicates Aggregation : SQL like aggregates Sort : Sort all columns of a projection Decompress: Converts compressed column to uncompressed representation

7 Query Execution - Operators Mask(Bitstring B, Projection Cs) => emit only those values whose corresponding bits are 1 Concat: Combines one or more projections sorted in the same order into a single projection Permute: Permutes a projection according to the ordering defined by a join index Bitstring operators: Band – Bitwise AND, Bor – Bitwise OR, Bnot – complement

8 Column-store simulation in a row-store 1.Vertical Partitioning: Each column is a relation. 2.Index-Only: B+ Tree on each columns. 3.Materialized Views: Optimal set of views for every query.

9 Column-Oriented Execution Four techniques are being introduced for Optimization in order to improve the performance of column-stores: Compression Late Materialization Block Iteration Invisible Join

10 Find Total revenue from Asian customers who purchase a product supplied by an Asian supplier between 1992 and 1997 grouped by nation of the customer, supplier and year of transaction

11 Phase 1 Invisible Join

12 Phase 2 Invisible Join

13 Phase 3 Invisible Join

14 Applications Analyzing unorganized BIG DATA with improved granularity. Data Warehouses and Business Intelligence. Online Analytical Processing. Data Marts Development. Data Mining.

15 THANK YOU !!


Download ppt "Column Oriented Database By: Deepak Sood Garima Chhikara Neha Rani Vijayita Gumber."

Similar presentations


Ads by Google