Presentation is loading. Please wait.

Presentation is loading. Please wait.

Advanced data management Jiaheng Lu Department of Computer Science Renmin University of China

Similar presentations

Presentation on theme: "Advanced data management Jiaheng Lu Department of Computer Science Renmin University of China"— Presentation transcript:

1 Advanced data management Jiaheng Lu Department of Computer Science Renmin University of China

2 Course purpose 2 Teach in English The objective is to expose graduate students to exciting data management topics

3 Course contents 3 Cloud computing and cloud data management XML data management Column-store database Data processing in bioinformatics

4 Lecturer Academic experience 2006.9 ~2008.6 University of California, Irvine, Postdoc researcher 2002.8 ~2006.8 National University of Singapore, PhD candidate 1998.9 ~ 2001.1 Shanghai Jiao Tong University Master candidate

5 University of California, Irvine

6 Research in Postdoc 6 6 Data integration in medical system [US patent] Approximate string search [ICDE08]

7 7 National University of Singapore

8 Course grading 8 Report 30% Google App Engine 30% In-class presence and quiz 40%

9 2014-8-169 Any question and any comments ?

10 Cloud computing


12 Why we use cloud computing?

13 Case 1: Write a file Save Computer down, file is lost Files are always stored in cloud, never lost

14 Why we use cloud computing? Case 2: Use IE --- download, install, use Use QQ --- download, install, use Use C++ --- download, install, use …… Get the serve from the cloud


16 What is cloud and cloud computing? Cloud Demand resources or services over Internet scale and reliability of a data center.

17 What is cloud and cloud computing? Cloud computing is a style of computing in which dynamically scalable and often virtualized resources are provided as a serve over the Internet. Users need not have knowledge of, expertise in, or control over the technology infrastructure in the "cloud" that supports them.

18 The architecture of cloud computing system

19 Characteristics of cloud computing Virtual. software, databases, Web servers, operating systems, storage and networking as virtual servers. On demand. add and subtract processors, memory, network bandwidth, storage.

20 IaaS Infrastructure as a Service PaaS Platform as a Service SaaS Software as a Service Types of cloud service

21 Software delivery model No hardware or software to manage Service delivered through a browser Customers use the service on demand Instant Scalability SaaS

22 Examples Your current CRM package is not managing the load or you simply don’t want to host it in-house. Use a SaaS provider such as Your email is hosted on an exchange server in your office and it is very slow. Outsource this using Hosted Exchange. SaaS

23 Platform delivery model Platforms are built upon Infrastructure, which is expensive Estimating demand is not a science! Platform management is not fun! PaaS

24 Examples You need to host a large file (5Mb) on your website and make it available for 35,000 users for only two months duration. Use Cloud Front from Amazon. You want to start storage services on your network for a large number of files and you do not have the storage capacity…use Amazon S3. PaaS

25 Computer infrastructure delivery model A platform virtualization environment Computing resources, such as storing and processing capacity. Virtualization taken a step further IaaS

26 Examples You want to run a batch job but you don’t have the infrastructure necessary to run it in a timely manner. Use Amazon EC2. You want to host a website, but only for a few days. Use Flexiscale. IaaS

27 Cloud computing and other computing techniques


29 An Industry Transformed Delgo

30 Shrek, Delgo, and Others Why did Dreamworks use this? Upsides? Downsides?

31 Grid Computing & Cloud Computing share a lot commonality intention, architecture and technology Difference programming model, business model, compute model, applications, and Virtualization.

32 Grid Computing & Cloud Computing the problems are mostly the same manage large facilities; define methods by which consumers discover, request and use resources provided by the central facilities; implement the often highly parallel computations that execute on those resources.

33 Grid Computing & Cloud Computing Virtualization Grid do not rely on virtualization as much as Clouds do, each individual organization maintain full control of their resources Cloud an indispensable ingredient for almost every Cloud


35 2014-8-1635 Any question and any comments ?

36 Google App Engine

37 37 Google App Engine Does one thing well: running web apps Simple app configuration Scalable Secure

38 38 App Engine Does One Thing Well App Engine handles HTTP(S) requests, nothing else Think RPC: request in, processing, response out Works well for the web and AJAX; also for other services App configuration is dead simple No performance tuning needed

39 App Engine Architecture 39 Python VM process stdlib app memcache datastore mail images urlfech stateful APIs stateless APIsR/O FS req/resp

40 How to use Google App engine Download Java 6 Download Eclipse and Google plug in Register a user account in Google Create an application (python, Java) and upload the code


42 In class quiz Please answer all questions You may be requested to answer a question later. Your performance will affect your final score.

43 Study Google App Engine ava/gettingstarted/

Download ppt "Advanced data management Jiaheng Lu Department of Computer Science Renmin University of China"

Similar presentations

Ads by Google