Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lightning Queries Miguel Branco. Obs. 1: Eating our own (dog) food Data Database Obs. 2: Data Deluge How many of you use databases to store your own data?

Similar presentations


Presentation on theme: "Lightning Queries Miguel Branco. Obs. 1: Eating our own (dog) food Data Database Obs. 2: Data Deluge How many of you use databases to store your own data?"— Presentation transcript:

1 Lightning Queries Miguel Branco

2 Obs. 1: Eating our own (dog) food Data Database Obs. 2: Data Deluge How many of you use databases to store your own data? Which one are we going to “move”?

3 “Lightning Fast” Queries? Time Loading “Overhead” Preparation “Overhead”

4 In-situ databases! Large collections of files Integration with existing tools Multiple data formats Changing areas of interest … lack of trust in database vendors … databases “forever owning” the data Databases that operate directly on raw data files

5 Great, but what about … … performance? competitive

6 Trap FS calls to maintain caches Scan FS buffer to build caches or maps Year,Make,Model,Description,Price 1997,Ford,E350,"ac, abs, moon",3000.00 1999,Chevy,"Venture ""Extended Edition""","",4900.00 1999,Chevy,"Venture ""Extended Edition, Very Large""","",5000.00 1996,Jeep,Grand Cherokee,"MUST SELL! air, moon roof, loaded",4799.00 != Data Loading User does not need to control when, what, how or where data is cached Blocks of raw files Row-store Col-store Best suited for … raw file format user queries Indices over … raw files cached data Positional Maps Caching File System integration Indices In-Situ Database Usability Data Deluge Positional Maps + Caching + Indices + File System Integration

7 miguel.branco@epfl.ch EPFL


Download ppt "Lightning Queries Miguel Branco. Obs. 1: Eating our own (dog) food Data Database Obs. 2: Data Deluge How many of you use databases to store your own data?"

Similar presentations


Ads by Google