P2P Distributed Computing Platform: C.Y. ‘Connor’ Park KISTI Supercomputing Center
th APAN Meetings, BUSAN, KOREA Overview ( KatH ) : A Software Platform for Distributed Computing using Volunteer Computer Resources Features Hybrid Peer-to-Peer architecture Secure PKI-based Security and Digital Signature Web Services with SOAP messaging
th APAN Meetings, BUSAN, KOREA Motivation To take advantage of and stimulate ‘Advanced Network Infrastructure in Korea’ To meet the growing needs of computing power for applications Funded by ‘Ministry of Information and Communication (MIC)’
th APAN Meetings, BUSAN, KOREA Infrastructure Over 10,000,000 Broadband network (xDSL, Cable) households Over 58.4% of PCs with Pentium3+
th APAN Meetings, BUSAN, KOREA Intention of Participation UnderstandingIntention of Providing Resources Well-Understanding (44.9%) Understanding roughly (21.8%) Don’t Know (33.3%) Keep up Providing (59.5%) Providing later (30.4%) Providing with Compensation (7.2%) Don’t (2.9%) 67.7% of Netizen already know what is Distributed Computing and 89.9% of Netizen have intention of providing their PC resources.
th APAN Meetings, BUSAN, KOREA Similar Projects Science climateprediction.net Life Science Cryptography Distributed.net, RSAttack576 Mathematics Mersenne Prime Search, Pi(x) Project Internet Grub, CycleTrader Financial MoneyBee Games ChessBrain Miscellaneous DALiWorld from
th APAN Meetings, BUSAN, KOREA Project History MIC launched the pilot project Application : Virtual Screening for New Drug Design First pilot production and test Client/Server Architecture Model Second pilot production and test Regularize the project “P2P Internet-based Distributed Computing Platform Development” Announcement of Hybrid P2P-style Distributed Computing Platform
th APAN Meetings, BUSAN, KOREA Architecture Cooperation Internet Large-scale Application Resource Management Subscription Distribution Agent Job Request
th APAN Meetings, BUSAN, KOREA Design Goals Application Developers (Job Requester) Convenience They should be able to use easily. Flexibility The platform is general enough to be used by various applications. Correctness The platform should return correct results.
th APAN Meetings, BUSAN, KOREA Design Goals PC agents (Resource Providers) Transparency PC users are not disturbed in their own works. Reliability It’s reliable not to make any security and privacy problem. Reward There is any kind of compensation, even just in mind.
th APAN Meetings, BUSAN, KOREA Design Goals Platform Scalability Performance should be improved according to the growth of agents. Adaptability It should put up with the dynamic features of Internet.
th APAN Meetings, BUSAN, KOREA Implementation Choices Web Services SOAP messaging over https Request-Response Model It is Agents’ duty to make most decisions. PKI-based certification and Digital Signature for files Make server as simple as possible, and Make PC agents more intelligent
th APAN Meetings, BUSAN, KOREA Functional Classification Resource Management Authentication, Certification Simple accounting Scheduling and Job distribution Job distribution with hybrid P2P style Result gathering Client Management Client authentication Application preparation to work Digital signing Result returning to client
th APAN Meetings, BUSAN, KOREA Job Distribution Internet KatH Server Agent Authentication Working Specification Request Is there any job for me to work? Who has the job data? Getting data, working, and returning result to server
th APAN Meetings, BUSAN, KOREA Networking Issues Network Security DoS, Spoofing, Sniffing, … Variable Connection Dial-up, cable, xDSL, T1/E1, … IPv6, WiFi, Bluetooth, … Network Unobtrusiveness Background/shared communication Transparency (Firewall/NAT)
th APAN Meetings, BUSAN, KOREA 2002 Pilot Projects Application : Virtual Screening for New Drug Design Client-Server Architecture An Agent Receives about 400KB-size file Executes during 20 minutes per working unit Returns about 190KB-size file Total number of working job units : 50,000
th APAN Meetings, BUSAN, KOREA Virtual Screening Structure Analysis Target Protein Share Analysis Chemical DB Disease Protein Ligand DB Server + + Client Client 2 Client 3 Virtual High-Throughput Screening Internet Calculating Ligand PMI Looking for Drug Candidate Clinical Demo New Drug Mass-produce Calculating Active Site PMI
th APAN Meetings, BUSAN, KOREA Experimental Results 1st Test 1,217 installed agents Max 53.5 Gflops / Avg Gflops 2nd Test 6,472 installed agents Max 644 Gflops / Avg Gflops ※ World Top Rank Gflops: IBM xSeries Cluster Xeon 2.8 GHz - Myrinet/ 192 (from )
th APAN Meetings, BUSAN, KOREA 2003 Progresses P2P data distribution New Applications Virtual Screening for New Drug Design Global Risk Management for financial investment 3D image rendering Supplementary plug-in Remote backup service
th APAN Meetings, BUSAN, KOREA Future works Make agents more intelligent with P2P MPI-like API for application developer Fault-tolerance : checkpointing & rollback recovery IPv6 enables Open-Source
th APAN Meetings, BUSAN, KOREA Thanks a lot! ^^