PANACEA - Y2 After the 2 nd Annual Review, 28 th February 2012, Barcelona 1.

Slides:



Advertisements
Similar presentations
WP 4: Integration of Language Technology Tools into ILIAS Learning Management System Alexander Killing Project review, Utrecht, 1 Feb 2007.
Advertisements

Web Services Copyright © Liferay, Inc. All Rights Reserved. No material may be reproduced electronically or in print without written permission.
0 DOD/DT/CEDCV – 20 th & 21 st January Paris meeting SAGEM RTD Activities C2-Sense project Paris – 20 & 21 January 2015.
© NCSR, Paris, December 5-6, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Enrich the lexicons for the 1 st domain based on partners remarks.
Web Services Darshan R. Kapadia Gregor von Laszewski 1http://grid.rit.edu.
Snejina Lazarova Senior QA Engineer, Team Lead CRMTeam Dimo Mitev Senior QA Engineer, Team Lead SystemIntegrationTeam Telerik QA Academy SOAP-based Web.
Overview Summary of the activities for the past two weeks Forthcoming deliverables Development plan for the following period.
MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information ISO TC37 SC4 WG Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
Information Retrieval in Practice
6/11/2015Page 1 Web Services-based Distributed System B. Ramamurthy.
ANLE1 CC 437: Advanced Natural Language Engineering ASSIGNMENT 2: Implementing a query expansion component for a Web Search Engine.
Aligning Business Processes to SOA B. Ramamurthy 6/16/2015Page 1.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
Jiten Bhagat University of myExperiment A Social VRE for Research Objects JISC Roadshow | February.
And so on CGI programming Web Services Java Programs for the Web.
The SMS project WP 4.2: Service Repository & Runtime Environment ICCS.
B. RAMAMURTHY Web services. Topics What is a web service? From OO to WS WS and the cloud WS code.
Overview of Search Engines
Web Services Michael Smith Alex Feldman. What is a Web Service? A Web service is a message-oriented software system designed to support inter-operable.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Web service testing Group D5. What are Web Services? XML is the basis for Web services Web services are application components Web services communicate.
1 LOMGen: A Learning Object Metadata Generator Applied to Computer Science Terminology A. Singh, H. Boley, V.C. Bhavsar National Research Council and University.
CLARIN tools for workflows Overview. Objective of this document  Determine which are the responsibilities of the different components of CLARIN workflows.
Graph-RAT Overview By Daniel McEnnis. 2/32 What is Graph-RAT  Relational Analysis Toolkit  Database abstraction layer  Evaluation platform  Robustly.
CLARIN web services and workflow Marc Kemps-Snijders.
UAM CorpusTool: An Overview Debopam Das Discourse Research Group Department of Linguistics Simon Fraser University Feb 5, 2014.
Introducing Dreamweaver MX 2004
Tutorial 1 Getting Started with Adobe Dreamweaver CS3
Internet Concept and Terminology. The Internet The Internet is the largest computer system in the world. The Internet is often called the Net, the Information.
CS117 Introduction to Computer Science II Lecture 1 Introduction to WWW and HTML Instructor: Li Ma Office: NBC 126 Phone: (713)
Mihir Daptardar Software Engineering 577b Center for Systems and Software Engineering (CSSE) Viterbi School of Engineering 1.
PANACEA WP3 The Platform WP participants: UPF, ILC, ILSP, LG, DCU, ELDA Final Annual Review 19 th February 2013 Marc Poch, UPF
PANACEA WP3 The Platform WP participants: UPF, ILC, ILSP, LG, DCU, ELDA Final Annual Review 19 th February 2013 Marc Poch, UPF
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Metadata Interoperability Framework (MIF) ELAG 2014 Naeem Muhammad Sam Alloing.
A Web Application for Customized Corpus Delivery Nancy Ide, Keith Suderman, Brian Simms Department of Computer Science Vassar College USA.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
Introduction to GATE Developer Ian Roberts. University of Sheffield NLP Overview The GATE component model (CREOLE) Documents, annotations and corpora.
(C) 2014 Logrus International Visualizing ITS 2.0 Categories for the localization process.
SOIS APP Working Group Overview. Presentation Overview Application Support Services Electronic Datasheets ESA Project History and Plans Standards Documentation.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
SSE3 Hypertext concepts 1. Agenda Pioneers and evolution Hypermedia – Modern hypermedia technology – Structure domains Architectural evolution The project.
©2012 LIESMARS Wuhan University Building Integrated Cyberinfrastructure for GIScience through Geospatial Service Web Jianya Gong, Tong Zhang, Huayi Wu.
© NCSR, Frascati, July 18-19, 2002 WP1: Plan for the remainder (1) Ontology Ontology  Use of PROTÉGÉ to generate ontology and lexicons for the 1 st domain.
SOAP-based Web Services Telerik Software Academy Software Quality Assurance.
NCSR “Demokritos” Institute of Informatics & Telecommunications CROSSMARC CROSS-lingual Multi Agent Retail Comparison Costas Spyropoulos & Vangelis Karkaletsis.
WSDL – Web Service Definition Language  WSDL is used to describe, locate and define Web services.  A web service is described by: message format simple.
Toward an Open Source Textual Entailment Platform (Excitement Project) Bernardo Magnini (on behalf of the Excitement consortium) 1 STS workshop, NYC March.
Introduction to Web Services Presented by Sarath Chandra Dorbala.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
ISMB Demo, 01 July 2009 Franck Tanoh University of Manchester, UK.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Institute of Informatics & Telecommunications NCSR “Demokritos” Spidering Tool, Corpus collection Vangelis Karkaletsis, Kostas Stamatakis, Dimitra Farmakiotou.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
1/16 TectoMT Zdeněk Žabokrtský ÚFAL MFF UK Software framework for developing MT systems (and other NLP applications)
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
WP1: Plan for the remainder (1) Ontology –Finalise ontology and lexicons for the 2 nd domain (RTV) Changes agreed in Heraklion –Improvement to existing.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Search Engine Architecture
 Corpus Formation [CFT]  Web Pages Annotation [Web Annotator]  Web sites detection [NEACrawler]  Web pages collection [NEAC]  IE Remote.
Professor Carole Goble University of Manchester, UK
Part of the Multilingual Web-LT Program
Re3gistry Software rc1.0 INSPIRE registry service rc 5
17th APAN Meetings & Joint Techs Workshop
Distributed System using Web Services
Presentation transcript:

PANACEA - Y2 After the 2 nd Annual Review, 28 th February 2012, Barcelona 1

Join together a number of advanced interoperable tools to build a platform/factory/production line that automates the stages involved in the –acquiring, processing and producing Language Resources required by MT and other Language Technologies Objectives

Partners WP1 – Management (UPF) WP3 – The Platform (UPF) WP4 – Corpus Acquisition & Annotation (ILSP) WP5 – Parallel corpus & derivatives (DCU) WP6 – Lexical Acquisition (UCAM) WP7 – Integration & resource evaluation (ILC) WP8 – Evaluation in industrial environment (LT) WP2 – Dissemination and Exploitation (ELDA)

Platform The PANACEA platform is an interoperability space based on tools, guidelines, a Common Interface definition, and a “Travelling Object” specification Tools: Taverna, BioCatalogue, myExperiment, Soaplab Common Interface: WS interoperability Travelling Object: XCES and GrAF Documentation (video tutorials, how-tos, deliverables, etc. at 4

Tools SOAPLAB 2 (SOAP) - Web application for deploying command line tools as WS - No coding needed! Metadata only - Services deployed by ILSP at Web application for deploying command line tools as WS - No coding needed! Metadata only - Services deployed by ILSP at TAVERNA - Open source desktop application - Imports Soaplab and other types of WS - Allows for combination of WS in workflows ( - Open source desktop application - Imports Soaplab and other types of WS - Allows for combination of WS in workflows ( BioCatalogue -Web application for registering and documenting WSs -Search function - Auto-checks web services status - Annotations: tags, categories, etc. -Web application for registering and documenting WSs -Search function - Auto-checks web services status - Annotations: tags, categories, etc. Web Services Workflow editor Registry Social network myExperiment - Share workflows, files, data, etc. - Share opinions and comments, create work groups, etc Share workflows, files, data, etc. - Share opinions and comments, create work groups, etc

Three levels of interoperability: –COMMUNICATION PROTOCOLS: Soap, Rest –DATA –PARAMETERS Format N Tool A Format M Tool B Format L Tool C Format N Tool A empty Tool B empty Tool C Interoperability Tool B does not “understand” format N! All tools understand the previous format Tool A Tool B ABCDABCD ABCDABCD Tool A Tool B YTQZYTQZ ABCDABCD 6

Travelling Object The Travelling Object (TO) is the common data and metadata format used in PANACEA to make components understand each other (syntactic interoperability) First TO for annotations up to tagging and lemmatization –Based on XCES (XML files with p, s, and t elements) –Tools: formatConverters and stylesheets Second TO for everything else (NER, DepParsing, etc.) –Based on GrAF (standoff annotation) –One file for primary data –One file for each annotation layer 7

Common Interface A Common Interface (CI) defines the mandatory parameters for every type of WS: 8

Soaplab Web Services 28 Corpus Acquisition and Annotation Web Services NLP WS’s focusing on sentence splitting, tokenization, tagging, lemmatization and parsing, e.g: –EN, FR: Berkeley tagger and parser (DCU) –ES: UPF tools, Freeling; IT: ILC’s DESR, Freeling –DE and EL: LT’s and ILSP’s in-house tools WS’s for conversion from and to PANACEA’s Travelling Object and ILC) WS’s for alignment of parallel data

10 Corpus Acquisition WS Focused Bilingual Crawler (FBC) –Documentation: –Test at –Sample topic definition for crawling EN-FR pages in the Environment domain xt xt –Seed URL for crawling EN-FR ENV data Focused Monolingual Crawler (FMC) –Documentation: –Test at –Topic definition for crawling EN ENV data txt txt –List of seed URLs for crawling EN ENV txt txt

11 Taverna Workflow Demo How can I align crawled data? Search for a DCU hosted alignment service at ry=alignhttp://myexperiment.elda.org/workflows?que ry=align

12 Corpus Annotation WS ILSP –Documentation: –Test at –Sample input: ILC DESR (dependency parser) –Workflow: