Research on cloud computing application in the peer-to-peer based video-on-demand systems 2012.04.13 Speaker : 吳靖緯 MA0G0101 2011 3rd International Workshop.

Slides:



Advertisements
Similar presentations
Distributed Processing, Client/Server and Clusters
Advertisements

MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
Digital Library Service – An overview Introduction System Architecture Components and their functionalities Experimental Results.
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
MapReduce Online Veli Hasanov Fatih University.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Speaker: Li-Wei Wu Advisor: Dr. Kai-Wei Ke 1.  Introduction  Coolstreaming  Proposed system  Simulation  Conclusion  Reference 2.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Network+ Guide to Networks, Fourth Edition
Network Operating Systems Users are aware of multiplicity of machines. Access to resources of various machines is done explicitly by: –Logging into the.
Introduction  What is an Operating System  What Operating Systems Do  How is it filling our life 1-1 Lecture 1.
Figure 1.1 Interaction between applications and the operating system.
Interpret Application Specifications
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Computer Networks IGCSE ICT Section 4.
Lecture 2 – MapReduce CPE 458 – Parallel Programming, Spring 2009 Except as otherwise noted, the content of this presentation is licensed under the Creative.
Hands-On Microsoft Windows Server 2008 Chapter 8 Managing Windows Server 2008 Network Services.
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Applying Twister to Scientific Applications CloudCom 2010 Indianapolis, Indiana, USA Nov 30 – Dec 3, 2010.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
NetworkProtocols. Objectives Identify characteristics of TCP/IP, IPX/SPX, NetBIOS, and AppleTalk Understand position of network protocols in OSI Model.
An Architecture for Video Surveillance Service based on P2P and Cloud Computing Yu-Sheng Wu, Yue-Shan Chang, Tong-Ying Juang, Jing-Shyang Yen speaker:
Introduction To Computer System
Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Chapter 6 Operating System Support. This chapter describes how middleware is supported by the operating system facilities at the nodes of a distributed.
CH2 System models.
Authors: Jiann-Liang Chenz, Szu-Lin Wuy,Yang-Fang Li, Pei-Jia Yang,Yanuarius Teofilus Larosa th International Wireless Communications and Mobile.
MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Introduction to Apache Hadoop Zibo Wang. Introduction  What is Apache Hadoop?  Apache Hadoop is a software framework which provides open source libraries.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
MediaGrid Processing Framework 2009 February 19 Jason Danielson.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
MACIASZEK, L.A. (2001): Requirements Analysis and System Design. Developing Information Systems with UML, Addison Wesley Chapter 6 - Tutorial Guided Tutorial.
An Architecture for Distributed High Performance Video Processing in the Cloud Speaker : 吳靖緯 MA0G IEEE 3rd International Conference.
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
NETWORK HARDWARE AND SOFTWARE MR ROSS UNIT 3 IT APPLICATIONS.
Presented by: Katie Woods and Jordan Howell. * Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of.
Research of P2P Architecture based on Cloud Computing Speaker : 吳靖緯 MA0G0101.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Data Communications and Networks Chapter 9 – Distributed Systems ICT-BVF8.1- Data Communications and Network Trainer: Dr. Abbes Sebihi.
Authors: Jiann-Liang Chenz, Szu-Lin Wuy, Yang-Fang Li, Pei-Jia Yang,
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.
Presented by Deepak Varghese Reg No: Introduction Application S/W for server load balancing Many client requests make server congestion Distribute.
BIG DATA/ Hadoop Interview Questions.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Lecture 13 Parallel Processing. 2 What is Parallel Computing? Traditionally software has been written for serial computation. Parallel computing is the.
Domain Name System The Technology Context Presentation.
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Cloud based Open Source Backup/Restore Tool
Storage Virtualization
MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner
An Introduction to Computer Networking
Ch 4. The Evolution of Analytic Scalability
Network+ Guide to Networks, Fourth Edition
Chapter 2: Operating-System Structures
Chapter 2: Operating-System Structures
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop on Intelligent Systems and Applications (ISA), On page(s): 1- 4, May 2011 Authors: SUN Bing-jue, WU Kai-jun

Outline Introduction Related work System design Simulation and result analysis Conclusion 2

Introduction In the peer-to-peer based video-on-demand systems, when the number of users online reaches a certain scale. It will greatly influence the bandwidth and the speed of response server. According to the usage analysis of P2P/VOD video system, user demand video via the cache server which will generate a cache file for the next user call. 3

Introduction This raises the question that if the original video files have So it is possible to generate 1,000 cache files which remain in the cache server within the server's disk. When the buffer disk is full, it will clear the cache files automatically and move the new file to the disk, but in the process of deleting and writing files, the client appears the phenomenon of the video buffer, which is blocking in the play. 4

Introduction Traditional method is to use load-balancing with multiple cache servers to solve this problem. Because load-balancing server can’t distinguish between user needs, resulting in more than one cache servers stored the same cache files in the video. According to the thinking of "cloud computing", the paper designed to work a cross platform of cache server cluster based on operation stored of Hadoop synchronization which applied in P2P/VoD. 5

Related work Figure 1 is a flow chart of Hadoop for data processing and transmission of map, reduce data phase. 6

Related work Hadoop processing needs to be done by using 3 data transfer process: file input to the map, map transfer to reduce, reduce output to HDFS. When they run in the same time, system manage the data segmentation, job scheduling, load balancing, fault tolerance and network communications. MapReduce provides a simple interface that enables researchers to design parallel and distributed applications easily. 7

Related work The data flow analysis on demand Map read data from the server in the way to pull the data. The output data of Map side is first written to local disk, it notified JobTracker when the task is completed. 8

Related work The integration of computing and storage 9

System design The whole system is divided into hardware and software. Hardware components include: Web system modules, file server system modules, DNS server, storage module and the composition of cache server cluster. The design of software components: the operating environment of server will be mainly based on Windows 2003 server operating system and FreeBSD operating systems. 10

System design Database store data by using a distributed way of SQLserver. Parts of the site use ASP.net compiler to achieve the management of background data and the view of front static page. Play system of video program uses the design of P2P technology. 11

System design Video services implement the process in the system which was shown in the following. Internet users first access to the video website for inquiring the video information, then send the video request. Web server inquire the information of video address in the database after it obtain video request, and then send the requests of video data to the file server. 12

System design File server send video data for users in a variety of ways: Direct transmission, transmission through the cache server, users through other Internet video services transmitted by P2P. The eventual realization of the video data streams transmitted parallelly to the client, complete high-definition video playback. 13

Simulation and result analysis The experimental data from access logs of our school VOD system user. The average daily traffic of VOD system is up to 1000 requests / day, the system of the user logs records the various interactive operations in detail, a unique user ID correspond to one visit, but the user's interactive operation, one visit may contain multiple user’s requests. 14

Simulation and result analysis The experiment compare the traditional system of load balancing plus 3 sets of buffer server with Hadoop cluster, the results shown in Table 1. 15

Simulation and result analysis Figure4 shows that when the traditional manner executes, execution time almost linearly increases. 16

Simulation and result analysis Storage occupancy rate of Cache server is true as shown in Figure 5. 17

Simulation and result analysis Theoretically, for a particular cluster, the trend of execution time that is slowly grow, with the growth of input data, will not last. From this experiment can be seen, when the configuration of parameters accord with the scale of cluster, Hadoop framework compared to the traditional way has a huge advantage for time- consuming aspects of data processing in large. 18

Conclusion The simulation of P2P/VoD video systems which designed based on "Hadoop" technology theoretically realizes unlimited expansion and unlimited data computing power. This design does not require too much hardware investment, and transform the existing equipment of network transmission, you can achieve fast data transmission. And facing the possibly increases of future traffic, the implementation does not change the system structure and upgrading core equipment, just add the appropriate equipment to meet the demand. 19

Conclusion If such P2P/VoD system which is promoted widely in the major institutions can form a large cluster of the streaming media. These cluster nodes joined together to form a huge video network through the network between the cache server farms, streaming media will become the main platform for online teaching. 20