Presentation is loading. Please wait.

Presentation is loading. Please wait.

When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities Author: Bingsheng He (Nanyang Technological University, Singapore)

Similar presentations


Presentation on theme: "When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities Author: Bingsheng He (Nanyang Technological University, Singapore)"— Presentation transcript:

1 When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities Author: Bingsheng He (Nanyang Technological University, Singapore) Speaker: Jiong He (Nanyang Technological University, Singapore) 1

2 What is Approximate Hardware? Approximate hardware can trade off the accuracy of results for increased performance, reduced energy consumption, or both. Computer architecture researchers have proposed various emerging designs on approximate hardware, such as CPU, main memory and storage. 2

3 What is Approximate Hardware? 3 Precise CPU Approximate CPU Precise SSD Approximate SSD Multiplier using 8-bit mantissas reduces over 78% of energy than a full 24-bit multiplier. Improves write latencies by 1.7 × on average by trading off less than 10% of accuracy

4 A Radical Vision: Data Management on Approximate Hardware ApproxiDB is a radical data management system with its design, implementation and optimization aware of approximate hardware. – ApproxiDB will run on a hybrid machine consisting of both approximate hardware and precise hardware. – Enabling techniques: approximate query processing and precise query processing. – Several examples demonstrate the effectiveness of ApproxiDB in performance and energy consumption. 4

5 Outline Motivations Our proposal: ApproxiDB Open problems Summary 5

6 Hardware Evolution Drives Database Architectural Innovations 6 Disk-based databases Open-source: PostgreSQL, MySQL Most commercial databases Taught in current textbooks Main memory databases Built on large-sized main memory More efficient in data accessing and algorithm optimization Hot research topic

7 Hardware Evolution Drives Database Architectural Innovations (con’t) 7 (new) Parallel databases Emerge with CMP, SMP and SMT Hardware-conscious optimizations Query co-processor Hardware: GPUs, FPGA, etc. Efficient query processing with massively parallel processors Examples: GPUQP, Ocelot, etc.

8 Prediction is Always Difficult, especially about the Future 8 What is the next? Approximate hardware!

9 Approximate Hardware: An Example with Solid State Storage 9 Figures are reproduced from [14] in our paper. 1.Guard bands are to separate different analog values so that they can safely represent digital values. 2.Solid state storages adopt iterative program-and-verify (P&V). (a) Precise MLC

10 Approximate Hardware: An Example with Solid State Storage 10 Figures are reproduced from [14] in our paper. (a) Precise MLC (b) Approximate MLC The guard band is reduced -> a smaller number of P&V iterations to achieve the acceptable accuracy

11 What’s New? O1: The data are inherently imprecise, and thus can tolerate loss of accuracy. – For example, the reading from a temperature sensor may not need the accuracy to “last decimal”. O2: The query processing itself can tolerate loss of accuracy, and the result can be imprecise/approximate. – Approximate query processing O3: Although the query processing requires precise final result, a hybrid execution on precise hardware and approximate hardware could have better performance/energy consumption than the execution with precise hardware only. – The focus of this talk. 11

12 Outline Motivations Our proposal: ApproxiDB Open problems Summary 12

13 Our Proposal: Approximate and Refine Design for hybrid hardware (including both precise hardware and approximate hardare). Approximate-and-refine consists of two steps: – Step 1: use approximate hardware to obtain intermediate results (superset of the final results) within some query processing steps. – Step 2: use precise hardware to refine the intermediate results and obtain the final precise results. We show two examples (selection and merge sort) to illustrate this paradigm. 13

14 Example 1: Selection 5.7 4.3 5.7 10.510.11.112.55.78.54.38.2 (b) Selection on hybrid processor R.x Approximate execution Precise execution Precise storage Approximate storage 5.7 10.510.11.112.55.78.54.38.2 (a) Selection on precise CPU R.x Select tuples where: R.x > 4.5 and R.x < 5.9 (a) cost: 1 * 8 = 8 (b) cost:0.5 * 8+ 1 * 2 = 6

15 Example 2: Merge Sort 10.510.11.112.55.78.54.38.2 (c) Sort on precise storage r1 r2 r3 r4 r5 r6 r7 r8 1.15.710.510.1 r2 r1 8.5 r5 r6 4.38.2 r7 r8 12.5 r3 r4 12.5 r1 10.510.11.1 r3 r2r4 5.78.24.38.5 r7 r5 r8 r6 10.510.11.112.55.78.24.38.5 r3 r7 r5 r8 r6 r2 r1 r4 10.510.11.112.55.78.54.38.2 (d) Sort on hybrid storage r1 r2 r3 r4 r5 r6 r7 r8 10.410.01.212.45.88.44.48.3 r2 r1 r3 r4 r5 r6 r7 r8 10.39.91.012.25.78.44.48.3 r3 r2 r1 r4 r7 r5 r8 r6 10.29.80.912.45.58.24.48.4 r3 r7 r5 r6 r8 r2 r1 r4 10.510.11.112.55.78.24.38.5 r3 r7 r5 r8 r6 r2 r1 r4 Approximate execution Precise execution Precise storage Approximate storage Sort input in ascending order

16 Initial Design of ApproxiDB 16 Precise storage Approximate storage Storage DBMS (ApproxiDB) Precise CPU Approximate CPU Cost estimation Query optimizer Query operators (e.g., joins and sort) Access methods (e.g., scans) Other Components in DBMS CPU

17 Extensions to existing DBMS Allow users to specify what should be stored in approximate storage as well as accuracy requirement. We propose four query processing modes: – Precise storage + precise query processing – Approximate storage + precise query processing – Precise storage + approximate query processing – Approximate storage + approximate query processing 17

18 Extensions to Existing DBMS (con’t) The cost model needs to consider new factors like the tradeoff between performance/energy consumption and accuracy. Query optimizer should revisit the physical operator implementation and query processing executions to optimally utilize the hybrid system. 18

19 Outline Motivations Our proposal: ApproxiDB Open problems Summary 19

20 Open Problems We are facing a lot of open problems in ApproxiDB. – Automatic physical design to ease burden of users. – Problems in multi-level approximate hardware. – Synthetize techniques in probabilistic databases in ApproxiDB. – Query-level tradeoff between accuracy and performance/energy. 20

21 Outline Motivations Our proposal: ApproxiDB Open problems Summary 21

22 Summary We sketch a radical vision of ApproxiDB on hybrid hardware with both approximate hardware and precise hardware. We demonstrate our initial design of ApproxiDB to exploit those optimization opportunities. We conjecture that approximate hardware can be one of the interesting driving forces in database community in the future. 22

23 Q & A Thank you. Our research group: Xtra Computing Group http://pdcc.ntu.edu.sg/xtra/ http://pdcc.ntu.edu.sg/xtra/ 23


Download ppt "When Data Management Systems Meet Approximate Hardware: Challenges and Opportunities Author: Bingsheng He (Nanyang Technological University, Singapore)"

Similar presentations


Ads by Google