Web Service Clustering Building Homogenous Service Communities Wei Liu Wilson Wong.

Slides:



Advertisements
Similar presentations
Towards Data Mining Without Information on Knowledge Structure
Advertisements

Current design issues for digital archives Robert Munro (presented by David Nathan) Endangered Languages Archive (ELAR), School of Oriental and African.
Copyright © 2003 Pearson Education, Inc. Slide 6-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Fatma Y. ELDRESI Fatma Y. ELDRESI ( MPhil ) Systems Analysis / Programming Specialist, AGOCO Part time lecturer in University of Garyounis,
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
(2)(2) APNOMS 2003 Introduction Web-Service –A software application identified by a URI –Its public interfaces and bindings are defined and described.
Cultural Heritage in REGional NETworks REGNET T1.4: Development of the system specification.
David Martin for DAML-S Coalition 05/08/2003 OWL-S: Bringing Services to the Semantic Web David Martin SRI International
Sensor Web Enablement and GEOSS Presented by: Terence van Zyl.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
Universität Innsbruck Leopold Franzens Copyright 2006 DERI Innsbruck LarCK Workshop, ISWC/ASWC Busan, Korea 16-Feb-14 Towards Scalable.
1 Term 2, 2004, Lecture 9, Distributed DatabasesMarian Ursu, Department of Computing, Goldsmiths College Distributed databases 3.
15 Oct Workshop on Web Service and Testing 1 Towards Service-Oriented Testing of Web Services Hong Zhu Department of Computing, Oxford Brookes University.
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
Distributed Service Registries Workshop, July 2005 Slide 1 NISO Metasearch Initiative Registries Robert Sanderson Dept. of Computer Science University.
Overview Environment for Internet database connectivity
Server Access The REST of the Story David Cleary
Web Service Architecture
1 Mobile Applications and Web Services Part II Prof. Klaus Moessner, Dr Payam Barnaghi Centre for Communication Systems Research Electronic Engineering.
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
1 University of Namur, Belgium PReCISE Research Center Using context to improve data semantic mediation in web services composition Michaël Mrissa (spokesman)
:: DIAsDEM :: Seminar: Web Mining WS 2003/2004 Ingo Kampe Heiko Scharff.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
31242/32549 Advanced Internet Programming Advanced Java Programming
Executional Architecture
Getting Familiar with Web Pages 1 2 The Internet Worldwide collection of interconnected computer networks that enables businesses, organizations, governments,
Presentation 7 part 2: SOAP & WSDL.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 31 Slide 1 Service-centric Software Engineering 1.
Pronalaženje Skrivenog Znanja
Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search Date: 2014/03/25 Author: Taifeng Wang, Jiang Bian, Shusen.
Chapter 5: Introduction to Information Retrieval
Web Services Nasrullah. Motivation about web service There are number of programms over the internet that need to communicate with other programms over.
Interactive Systems Technical Design Seminar work: Web Services Janne Ojanaho.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Presentation 7 part 2: SOAP & WSDL. Ingeniørhøjskolen i Århus Slide 2 Outline Building blocks in Web Services SOA SOAP WSDL (UDDI)
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
A New Computing Paradigm. Overview of Web Services Over 66 percent of respondents to a 2001 InfoWorld magazine poll agreed that "Web services are likely.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Web Services Andrea Miller Ryan Armstrong Alex. Web services are an emerging technology that offer a solution for providing a common collaborative architecture.
Web Services Architecture1 - Deepti Agarwal. Web Services Architecture2 The Definition.. A Web service is a software system identified by a URI, whose.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
Web Service Discovery Mechanisms Looking for a Needle in a Haystack? Evangelos Sakkopoulos joint work with J. Garofalakis, Y. Panagis, A. Tsakalidis University.
A Snapshot of public Web Services Prof: Dr.Jainguo Lu Presenting Group: Aktar-uz-zaman Mohit Sud.
WSDL Tutorial Ching-Long Yeh 葉慶隆 Department of Computer Science and Engineering Tatung University
Dr. Bhavani Thuraisingham October 2006 Trustworthy Semantic Webs Lecture #16: Web Services and Security.
Web Services Based on SOA: Concepts, Technology, Design by Thomas Erl MIS 181.9: Service Oriented Architecture 2 nd Semester,
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Chapter 6: Information Retrieval and Web Search
Web Services. Abstract  Web Services is a technology applicable for computationally distributed problems, including access to large databases What other.
Ranking of Web Services Eyhab Al-Masri. Outline Discovery of Web Services 1 Ranking of Web Services 2 Approaches 3 Conclusion 4 Q & A 5.
Kemal Baykal Rasim Ismayilov
Advanced Web Technologies Lecture #4 By: Faraz Ahmed.
1 G52IWS: Web Services Chris Greenhalgh. 2 Contents The World Wide Web Web Services example scenario Motivations Basic Operational Model Supporting standards.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
Introduction to Web Services Presented by Sarath Chandra Dorbala.
Web Services. Web Service: Simple definition : “ Service Offered On the Web “ Technically : “ A Web Service is a programmable application component that.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Data mining in web applications
Java Web Services Orca Knowledge Center – Web Service key concepts.
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Clustering of Web pages
WEB SERVICES.
Unit – 5 JAVA Web Services
Web Ontology Language for Service (OWL-S)
Wsdl.
LOD reference architecture
Introduction to Web Services and SOA
Presentation transcript:

Web Service Clustering Building Homogenous Service Communities Wei Liu Wilson Wong

22-Jun-05 Outline A brief introduction on – Web services – Text mining Web Service Clustering – The motivation – The challenges – The process – The results

22-Jun-05 What are Web Services It is software designed to be used by other software via Internet protocols and formats (Forrester) Web services are self-describing components that can discover and engage other web services or applications to complete complex tasks over the Internet. (Sun Microsystems, Inc) Web Services are loosely coupled software components delivered over the Internet via standards-based technologies like XML, and SOAP. (Gartner) Self-describing, self-contained, modular unit of application logic that provides some business functionality to other applications through an Internet connection… (UDDI.org) Web services are Internet-based, modular applications that perform a specific business task and conform to a particular technical format. (IBM) A web service is application logic that is programmatically available, exposed using the Internet. (Microsoft)

22-Jun-054 Web services are applications accessible via the Web to be consumed by clients. Clients of a Web Service are usually refer as service requester. Technologies standardized by the W3C to support Web service applications are: Web Service Description Language (WSDL) Simple Object Access Protocol (SOAP) Universal Discovery, Description, and Integration (UDDI) The Web Service Triangle

Broadly defined as the act of locating a machine-processable description of a web service that may have been unknown and that meets certain functional criteria Originated from agent match-making paradigm (middle agents and brokers), later moved onto UDDI [2] The discovery mechanisms differ according what languages are used for describing the service (WSDL or OWL-S) What is Web Service Discovery [2] Garofalakis, J., Panagis, Y., Sakkopoulos, E., Tsakalidis, A.: Web service discovery mechanisms: Looking for a needle in a haystack? In: International Workshop on Web Engineering, Hypermedia Development and Web Engineering Principles and Techniques: Put them in use, in conjunction with ACM Hypertext, Santa Cruz (2004)

Static and Not scalable – The registry can become a bottle neck – New services have to be added through a laborious process to ensure correct categorisation, which deters people from using it Search is keyword based – Ontology supported semantic search are only available agent and semantic web services Ill-fated Registry Based Structure

Make use of the wsdl files collected by Google Automatically cluster these files into functionally similar groups using text mining methods – linguistic analysis, and statistical techniques combined The resulting clusters will help service discovery by reducing the size of the haystacks What we propose

Traditional Information Retrieval and Document Clustering techniques cannot be borrowed directly, because of the following observations – web service files do not usually contain sufficiently large number of words for use as index terms or features. – Moreover, the small number of words present in the web service files are erratic and unreliable. – Related web pages that describe the WSDL service are also considered. GoogleAPI for discovering web page referral or citation. However, most of the WSDL files do not have related web pages that provide hyperlinks to them. The few that have hyperlinks referring to them are typically examples teaching how to program in a service-oriented paradigm. Observations are concurred by [9] Challenges [9] Li, Y., Liu, Y., Zhang, L., Li, G., Xie, B., Sun, J.: An exploratory study of web services on the internet. In: 2007 IEEE International Conference on Web Services (ICWS). (2007)

System Architecture 22-Jun-05

Collected WSDL File 22-Jun-05

Content – Parse the WSDL file for service descriptions in natural language Context – Relate documents by looking at parent/grandparent directories – Tokenising, stemming, – Remove function words* – Remove programming terms* Obtaining Content and Context

One of the properties of content words is that they tend to clump or to re-occur whenever they have appeared once [10]. On the other hand, the occurrence of function words tend to be independent of one another. Very often, such contrasting property can be captured through the inability of the Poisson distribution to model word occurrences in documents [11]. In other words, unlike content words, function words tend to be Poisson distributed. Content Words vs. Function Words [10] Manning, C., Schutze, H.: Foundations of statistical natural language processing. MIT Press, MA, USA (1999) [11] Church, K., Gale, W.: Inverse document frequency (idf): A measure of deviations from poisson. In: Proceedings of the ACL 3rd Workshop on Very Large Corpora. (1995)

Remove Function Words 22-Jun-05 A segment of the output during content-word recognition performed on the word tokens in the web service context set for the service QuranService. (single parameter poisson distribution)

Remove Programming Terms 22-Jun-05 Using term clustering methods that based on Normalised Google Distance to identify programming term clusters using our Tree-Traversing Ants featureless term clustering [12] [NGD] Cilibrasi, R.L., Vitanyi, P.M.B.: The Google Similarity Distance, IEEE Trans. Knowledge and Data Engineering, 19:3(2007), [12] Wong, W., Liu, W., Bennamoun, M.: Tree-traversing ant algorithm for term clustering based on featureless similarities. Data Mining and Knowledge Discovery Vol15(3) (2007) 349–381

Clustering Results for QuranService 22-Jun-05 A small oracle: runtime, webservice, developer, module, data

The service host is the second and top-level portion of the domain name (i.e. a segment of the authority part of the URI) of the host containing the WSDL file, and The service name is the name of the WSDL file. As one may note, the four features are by no means the best or the only ones available for describing a web service. However they are the most accessible and feasible ones to use in this case. The service host and the service name

Combining the four features

Web Service Clusters 22-Jun-05

Conclusions The paper presented techniques for automatic discovery of web services of similar functionalities. We term such service clusters as homogeneous service communities. If the crawling and the clustering process are in continuous operation like a typical search engine does, the approach has the potential of enabling self-organisation of the Web as proposed in [3]. The proposed web service clustering approach assumes no registries, and can automatically reduce the search space of web services effectively. Therefore, it can be seen as a predecessor for Web Service Discovery. This paper gathers real service description files from the Web instead of working on hypothetical examples. The resulting clusters not only provide a useful glimpse on what services are out there, but also an insight into the types of technologies which have proliferated in this area. 22-Jun-05 [3] Liu, W.: Trustworthy service selection and composition reducing the entropy of service- oriented web. In: 3rd International IEEE Conference on Industrial Informatics, Perth, Australia (2005)

22-Jun-0520 Web service has become a new trend for doing business online. U.S. – 65% of companies will and have been working on Web service projects – $3 billions; 2008 – $15.8 billions Web services help in e-business and e-commerce development. The Web Service Hype Just as the Web revolutionized how users talk to applications, XML transforms how applications talk to each other. (Bill Gates) Web services are expected to revolutionize our life in much the same way as the Internet has during the past decade or so. (Gartner)

The UDDI Business Registry (UBR) was part of the UDDI Project announced in September The project goals were to define a set of specifications to enable description, discovery and integration and to prove interoperability through operational experience. The UBR ran for 5 years, demonstrating live, industrial strength UDDI implementations managing over 50,000 replicated entries. Why IBM, Microsoft and SAP stopped UBR

Is Popfly service-oriented?

Thank You