Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,

Similar presentations


Presentation on theme: "1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,"— Presentation transcript:

1 1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades danton@ics.forth.gr Evangelos P. Markatos markatos@ics.forth.gr ICS-FORTH Heraklion, Crete, Hellas Constantine Dovrolis dovrolis@cc.gatech.edu College of Computing Georgia Tech By X.F. Chen

2 2 Content  File Sharing & OCH File Sharing & OCH  Clients Characteristics Clients Characteristics  Service Architectuer Service Architectuer  Rapidshare V.S. Bittorrent Rapidshare V.S. Bittorrent  Content Indexing Sites Content Indexing Sites  Conclusion Conclusion

3 3 File Sharing One of the most popular Internet user activities 60-70% of total traffic volume Since 2005, a large number of One-Click Hosting (OCH) services have made their appearance –Mainly used for file-sharing

4 4 What's OCH Provide file hosting services at no cost Provide unique URLs to the uploader that she can share with her friends & communities Provide no indexing for the hosted files

5 5 Upload Phase

6 6 Download Phase

7 7 Collected Data Two monitoring points –Monitor1: NREN, ~10K total users (~750 IP) –Monitor2: University, ~1K total users (~450 IP) BACK

8 8 Flow Sizes 90% of the flows < 150KBytes –Probably page access flows Upload flows range from several MB to 2GB Two main drop – maximum upload file size limit

9 9 Free Vs. Paying Clients Rate-limits free user downloads to 0.2Mb/s – 2.0Mb/s Only 20% of the users experience greater download throughputs  Subscribers

10 10 File Popularity  Unique downloaders per file 75% of the files downloaded only once Only 0.05% downloaded by more than 5 users  Caching will be useless in OCH BACK

11 11 Service Architecture  Try to infer the architecture of the RapidShare service by answering: What is the total number of servers used by RapidShare? Where are these servers located? How many copies of a file in RapidShare? How does RapidShare balance the load between servers?  How is this architecture different from traditional content distribution networks?

12 12 Total Number of Servers Used 5,291 distinct server IP addresses 36 /24 subnets 8 different ISPs Large increase in number of servers during Sep'08 Infrastructure Update

13 13 Server Location  Discover the geographical location of the server infrastructure Performed a number of traceroutes from different planetlab locations Used minimum RTT to infer distance from landmarks

14 14 Server Location cont. min-RTT values show a single central datacenter Datacenter closest to central-European

15 15 Content Replication  What is the number of servers that store each file? Used TOR as a geographically distributed downloader –421 different exit nodes Requested 22,000 RapidShare file URLs  Each file indexed by exactly 1 server Each file served by exactly 12 servers (group)

16 16 Server Upload Balancing  Which server group will host a newly uploaded file? 50000 file upload requests Log upload group-id  Recently added groups have a higher likelihood of being selected as the upload group

17 17 Server Download Balancing  Which download server of that group will be used upon a download request? 1000 back-to-back file download requests Log download server  Indexing servers are less likely to be selected as download server

18 18 OCH services vs. CDNs  One-Click Hosting services Data-center in a single location Focus on large transfers that are less sensitive to delay Get revenues from Premium users Content replicated on multiple servers  Content Distribution Networks Multiple geographically distributed servers so as to minimize delay observed by client Aim to minimize Web transaction delay Get revenues from large content producers Content replicated on multiple servers BACK

19 19 Challenging the P2P Paradigm  P2P has been (and continues to be) the most popular File-Sharing mechanism  BitTorrent Vs. RapidShare.com –Download Throughput –Content Availability

20 20 BT Vs. RS: Download Throughput RS subscribers outperforms open BitTorrent trackers in terms of throughput Free users experience comparable download experience

21 21 BT Vs. RS: Content Availability Searched for a number of different files in both network  Rapidshare.com holds at least as much objects as BitTorrent BACK

22 22 Content Indexing Websites  Form an important component for the emergence of OCH services  Crawled 4 different Indexing Websites Identify the contributors of the traffic Identify the size of the shared object Identify the types of shared object

23 23 Indexing WebSites Less than 20% of the files are not available Only a small number of users upload content Name# Indexed Objects RS Hosted Objects # of Stale Files # of Uploaders egydown.com972787134 (17%)N/A rapidmega.info942893116 (13%)9 rslinks.org121241184164 (0.5%)21 rapidshareindex.com54327365227052 (19.3%)18

24 24 Content Contributors A small number of the users is responsible for most of the content uploaded

25 25 Shared Objects At most 60% objects consist of a single URL Users share mostly Videos and Applications

26 26 Copyrighted Material Manually observed 100 most recent objects uploaded in each WebSite. In all cases more than 84% of the Objects are copyrighted. BACK

27 27 Conclusions  For OCH by this paper: Responsible for a significant share of daily Web traffic Most files are downloaded only once All servers at multihomed single datacenter (V.S. CDN) Free users experience similar performance with BitTorrent users & Subscribers (~20%) experience better performance Most users do not contribute on sharing files (only download)  Question: Will OCH services be an alternative to P2P for file- sharing in the future?

28 28 Backup slides


Download ppt "1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,"

Similar presentations


Ads by Google