Presentation is loading. Please wait.

Presentation is loading. Please wait.

PSoup: A System for streaming queries over streaming data

Similar presentations


Presentation on theme: "PSoup: A System for streaming queries over streaming data"— Presentation transcript:

1 PSoup: A System for streaming queries over streaming data
Sirish Chandrasekaran Michael J. Franklin The VLDB Journal (2003) 12: Study presented by: Hasmukh Patel Frequently, presenters must deliver material of a technical nature to an audience unfamiliar with the topic or vocabulary. The material may be complex or heavy with detail. To present technical material effectively, use the following guidelines from Dale Carnegie Training®. Consider the amount of time available and prepare to organize your material. Narrow your topic. Divide your presentation into clear segments. Follow a logical progression. Maintain your focus throughout. Close the presentation with a summary, repetition of the key steps, or a logical conclusion. Keep your audience in mind at all times. For example, be sure data is clear and information is relevant. Keep the level of detail and vocabulary appropriate for the audience. Use visuals to support key points or steps. Keep alert to the needs of your listeners, and you will have a more receptive audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

2 CS561: Advance Databases (Spring-2004), WPI
Plan of presentation Why PSoup? Applications for PSoup PSoup Architecture PSoup data structures PSoup in action Eddy in Psoup Summary In your opening, establish the relevancy of the topic to the audience. Give a brief preview of the presentation and establish value for the listeners. Take into account your audience’s interest and expertise in the topic when choosing your vocabulary, examples, and illustrations. Focus on the importance of the topic to your audience, and you will have more attentive listeners. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

3 Why one more database system like PSoup?
Conventional database system: Queries are streamed over static data Results are generated for a distinct point in time data Continuous query system: Data is streamed over static queries Results are generated continuously How to handle stream of queries over streams of data This is the problem domain where PSoup helps If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

4 Candidate applications for PSoup
Data Recharging: PDA like devices periodically connect to the network to refresh their data content Monitoring: Periodically getting health status (various stats) of network switches If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

5 PSoup Architecture (QPE)
If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. PSoup is memory based and part of TelegraphCQ built at UC Berkeley. (Source: Original paper) CS561: Advance Databases (Spring-2004), WPI 5/13/2019

6 PSoup Architecture (QPE)
Extends SQL by adding window clause ‘Begin- End’ e.g. Select r.c, s.y From data_stream r, data_stream s Where (r.a = 5 and r.b > s.x) Begin (Now – 20) End (Now) Standing Query Clause (SQC) If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. Window Clause CS561: Advance Databases (Spring-2004), WPI 5/13/2019

7 PSoup Data structures used
Stem: STatE Module Keeps state of data and queries Provides insert and probe methods over their One Stem for each stream and one Stem for all queries Data Stem Query Stem Tupple-id R.a R.b 71 1 72 5 4 73 9 74 8 75 12 6 Query-id Predicate 11 R.a = 5 12 R.a > 2 and R.a <= 10 13 R.a = 4 and R.b > 5 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

8 PSoup Data structures used
Data index: Index tree on each search attribute of a stream R.a 10 40 4 20 30 70 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. 1 2 3 4 6 8 10 50 60 CS561: Advance Databases (Spring-2004), WPI 5/13/2019

9 PSoup Data structures used
Predicate index: Red-Black tree Balanced binary search tree on values of predicate attribute. Keeps query ids in link list (for each relational op) attached to each node. (I still do not understand how this tree helps!!) For complex predicates keeps link list instead of tree 6 <= < = > 13 11 R.a = 5 12 R.a > 5 and R.a <= 10 13 R.a > 8 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. 8 2 <= < = 11 > 12 <= 12 < = > 1 5 7 10 CS561: Advance Databases (Spring-2004), WPI 5/13/2019

10 PSoup Data structures used
Result structure: Indicates which tuples satisfy which queries Can be a sparse matrix (bit map, shown here) or link list assigned to each query In bitmap rows are ordered by the timestamp (physical tuple-id), columns are ordered by query-id 11 12 13 71 72 T 73 74 75 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

11 PSoup in action (Current state)
Query Stem Query-id Predicate 11 R.a = 5 12 R.a > 2 and R.a <= 10 13 R.a = 4 and R.b > 5 Data Stem 11 12 13 71 72 T 73 74 75 Tupple-id R.a R.b 71 1 72 5 4 73 9 74 8 75 12 6 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

12 PSoup in action (New Query)
Query Stem Query-id Predicate 11 R.a = 5 12 R.a > 2 and R.a <= 10 13 R.a = 4 and R.b > 5 14 R.A < 5 and R.b = 8 Data Stem 11 12 13 14 71 72 T 73 74 75 Tupple-id R.a R.b 71 1 72 5 4 73 9 74 8 75 12 6 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

13 PSoup in action (New Tuple)
Query Stem Query-id Predicate 11 R.a = 5 12 R.a > 2 and R.a <= 10 13 R.a = 4 and R.b > 5 14 R.a < 5 and R.b = 8 Data Stem 11 12 13 14 71 72 T 73 74 75 76 Tupple-id R.a R.b 71 1 72 5 4 73 9 74 8 75 12 6 76 3 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

14 PSoup in action (Current State) Join scenario
Query Stem Query-id Predicate 11 R.a = S.a and S.b > 2 12 R.a >= 5 and R.b > S.a and S.b = 3 Result – Q11 Result – Q12 Data Stem - R Data Stem - S 81 82 83 84 85 71 T 72 73 74 75 81 82 83 84 85 71 T 72 73 74 75 Tupple-id R.a R.b 71 5 15 72 1 2 73 7 10 74 4 75 3 Tupple-id S.a S.b 81 5 3 82 4 83 6 84 8 85 2 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

15 PSoup in action (New Query) Join scenario
Query Stem S-Tuple-id Predicate 81 R.a = 8 and R.b = 3 82 R.a =6 and R.b = 4 83 R.a =9 and R.b = 3 84 R.a = 9 and R.b = 8 85 R.a = 5 and R.b = 3 Query-id Predicate 11 R.a = S.a and S.b > 2 12 R.a >= 5 and R.b > S.a and S.b = 3 13 R.a = S.a+3 and R.b = S.b and S.a < 10 75 1 2 3 Result – Q11 Result – Q12 Data Stem - R Data Stem - S 81 82 83 84 85 71 T 72 73 74 75 81 82 83 84 85 71 T 72 73 74 75 Tupple-id R.a R.b 71 5 15 72 1 2 73 7 10 74 4 75 3 Tupple-id S.a S.b 81 5 3 82 4 83 6 84 8 85 2 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. Result – Q13 81 82 83 84 85 71 72 73 74 75 T 4 CS561: Advance Databases (Spring-2004), WPI 5/13/2019

16 PSoup in action (New Tuple) Join scenario
Query Stem R-Tuple-id Predicate 76 6 =S.a and S.b > 2 7 > S.a and S.b = 3 6 = S.a+3 and 7 = S.b and S.a < 10 Query-id Predicate 11 R.a = S.a and S.b > 2 12 R.a >= 5 and R.b > S.a and S.b = 3 13 R.a = S.a+3 and R.b = S.b and S.a < 10 83, 84 81, 83 3 2 1 Result – Q11 Result – Q12 Data Stem - R Data Stem - S 81 82 83 84 85 71 T 72 73 74 75 76 81 82 83 84 85 71 T 72 73 74 75 76 Tupple-id R.a R.b 71 5 15 72 1 2 73 7 10 74 4 75 3 76 6 Tupple-id S.a S.b 81 5 3 82 4 83 6 84 8 85 2 If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. Result – Q13 81 82 83 84 85 71 72 73 74 75 T 76 4 CS561: Advance Databases (Spring-2004), WPI 5/13/2019

17 CS561: Advance Databases (Spring-2004), WPI
Eddy in PSoup Provides Stream prefix consistent state (put tuples in result structure in the order in which they enter) Stores new tuples in New Tuple Pool (NTP) Stores temporary tuples in TTP Begins by picking tuple from NTP and then processes all tuples in TTP, before it picks another new tuple from NTP Serializes the effects of new tuples on the Result structure in the order in which they enter the system If you have several points, steps, or key ideas use multiple slides. Determine if your audience is to understand a new idea, learn a process, or receive greater depth to a familiar concept. Back up each point with adequate explanation. As appropriate, supplement your presentation with technical support data in hard copy or on disc, , or the Internet. Develop each point adequately to communicate with your audience. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

18 CS561: Advance Databases (Spring-2004), WPI
Summary: Treats Query and Data streams analogously Performs multi-query evaluation by joining query-stem with data-stems Supports queries that requires access to both data that arrived prior to the query and also data that enter later Separates computation of the result from the delivery by materializing the results Determine the best close for your audience and your presentation. Close with a summary; offer options; recommend a strategy; suggest a plan; set a goal. Keep your focus throughout your presentation, and you will more likely achieve your purpose. CS561: Advance Databases (Spring-2004), WPI 5/13/2019

19 CS561: Advance Databases (Spring-2004), WPI
Questions??? CS561: Advance Databases (Spring-2004), WPI 5/13/2019

20 CS561: Advance Databases (Spring-2004), WPI
My sincere thanks to Prof. Elke for giving me this opportunity of study and Thanks to you friends for listening me CS561: Advance Databases (Spring-2004), WPI 5/13/2019


Download ppt "PSoup: A System for streaming queries over streaming data"

Similar presentations


Ads by Google