A Comparative Study of Two Natural Language Processing Frameworks Yixin Bian, Gunes Koru, Hongfang Liu Department of Information Systems, University of.

Slides:



Advertisements
Similar presentations
GenArch: Automatic Product Derivation in Multi-agent Systems Product Line Elder Cirilo.
Advertisements

Software Life Cycle and Models
5 August Eclipse Foundation - Mike Norman – CEO Scapa® Technologies Eclipse Test Project Lead Eclipse Test & Performance Tools Platform.
Extending Eclipse CDT for Remote Target Debugging Thomas Fletcher Director, Automotive Engineering Services QNX Software Systems.
Network Virtualization and Service Awareness Properties of FNs
26/10/2008 SWESE'08 1 Enhanced Semantic Access to Software Artefacts Danica Damljanović and Kalina Bontcheva.
Physical Aspects [Reflection Modelling] Hauptseminar: Augmented Reality for Driving Assistance in Cars.
Aligning Business and IT Models in Service-Oriented Architectures using BPMN and SoaML Brian Elvesæter, Dima Panfilenko, Sven Jacobi & Christian Hahn MDI2010.
ITEC800 – Group 1 Mobile G Calendar Ivan Alden [ ] Andrey Chislenko [ ] Kenfe-Mickaël Laventure [ ] Laurent Malvert [ ] José.
Software Tools Lecture 10: Software Tools Dr Valentina Plekhanova University of Sunderland, UK.
Data Mining and Text Analytics GATE, by Joel Bywater.
Chapter 11 Software Evolution
Usage of the memoQ web service API by LSP – a case study
Requirements Analysis 1. 1 Introduction b501.ppt © Copyright De Montfort University 2000 All Rights Reserved INFO2005 Requirements Analysis Introduction.
Understanding and Detecting Real-World Performance Bugs
© Coverity 2010 Coverity Analysis: Improving Quality in the Software Supply Chain Peter Henriksen, Development Manager for Analysis, Coverity October 1,
Code Smell Research: History and Future Directions Second PLOW Installment - March 5, Nikolaos Tsantalis Computer Science & Software Engineering.
FEATURES OF BUSINESS PROCESS TOOLS: AN OVERVIEW Vesna Bosilj-Vuksic University of Zagreb, Faculty of Economics and Business.
Unit 231 Software Engineering Introduction to SWE What is SDLC Phases of SDLC.
Software Quality Metrics
Small Tools and Interoperability Arend Rensink Formal Methods and Tools University of Twente.
Linear Clustering Algorithm BY Horne Ken & Khan Farhana & Padubidri Shweta.
TRAVEL RESERVATION SYSTEM USING WEB SERVICES COMPOSITION LANGUAGE
APPLICATION DEVELOPMENT BY SYED ADNAN ALI.
1 VRML As a Collaborative Design Tools For Architectural Students Chris Yeung, John Bradford, Guyver Cheng, Eric So.
#sqlsatPordenone #sqlsat367 February 28, 2015 Testing your databases Alessandro
Use Case Modelling Visual Annotator for studying ICU Notes Bacchus Beale.
3. Software product quality metrics The quality of a product: -the “totality of characteristics that bear on its ability to satisfy stated or implied needs”.
Systems Analysis and Design 9 th Edition Toolkit B CASE Tools By: Prof. Lili Saghafi.
Dirk van Schalkwyk Supervisor: Dr Greg Foster Co-Supervisor: Mrs Madeleine Wright Project Title: A Comparative Study of JME and Flash Lite for Mobile Data.
© 2006 by IBM 1 How to use Eclipse to Build Rich Internet Applications With PHP and AJAX Phil Berkland IBM Software Group Emerging.
BCS 2143 Introduction to Object Oriented and Software Development.
PHP TUTORIAL. HISTORY OF PHP  PHP as it's known today is actually the successor to a product named PHP/FI.  Created in 1994 by Rasmus Lerdorf, the very.
1 TRex Paul Baker 1, Dominic Evans 1, Jens Grabowski 2, Helmut Neukirchen 2, Benjamin Zeiss 2 The Refactoring and Metrics Tool for TTCN-3 Test Specifications.
CMS Security Justin Klein Keane CMS Working Group March 3, 2010.
ACS 560 – SOFTWARE ENGINEERING Course Accomplishment Summary Shilpashree K.S Fall 2010 Purdue University – Fort Wayne Instructor – Dr. John Tanik.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Developing Contemporary Canonical Software Courses Summer Program Overview Rise Research Group at Drexel.
1 SEG4912 University of Ottawa by Jason Kealey Software Engineering Capstone Project Tools and Technologies.
Software Engineering Laboratory, Department of Computer Science, Graduate School of Information Science and Technology, Osaka University Code-Clone Analysis.
 Once the system has been installed it will be monitored to check whether it is working correctly. Sometimes problems with a system will not be found.
Hyper/J and Concern Manipulation Environment. The need for AOSD tools and development environment AOSD requires a variety of tools Life cycle – support.
Lecture 1 The University of Lahore Software Engineering Saeed Akhtar
Software Methods Mö/ slide 1 Methods and Techniques of Software Quality Management ICEL Quality Management Systems: Methods and Techniques of Software.
Web based Hydrology and Water Resources Information System for India
Quality Models in Software Engineering Literature: An Analytical and Comparative Study Rafa E. Al-Qutaish, PhD Al Ain University of Science and Technology.
Combining GATE and UIMA Ian Roberts. University of Sheffield NLP 2 Overview Introduction to UIMA Comparison with GATE Mapping annotations between GATE.
Software Engineering Emphasis for Engineering Computing Courses William Hankley Computing & Information Sciences Kansas State University.
Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de A.
Software Development in HPC environments: A SE perspective Rakan Alseghayer.
INFSOM-RI WP 4 : Testing Tools and Methodologies Status Report ETICS Review – 15 February 2008 Éva Takács (4D SOFT)
The Java Open Review Project Brian Chess Founder/Chief Scientist Fortify Software June 14, 2007.
"proper words in proper places“ translating for the localisation industry LRC Summer School Limerick 16 June 2005.
T EST T OOLS U NIT VI This unit contains the overview of the test tools. Also prerequisites for applying these tools, tools selection and implementation.
CPSC 372 John D. McGregor Module 1 Session 2 Process Measurement.
Combining GATE and UIMA Ian Roberts. 2 Overview Introduction to UIMA Comparison with GATE Mapping annotations between GATE and UIMA.
Objective ICT : Internet of Services, Software & Virtualisation FLOSSEvo some preliminary ideas.
MANAGEMENT INFORMATION SYSTEM
Constructing Deploying and Maintaining Enterprise Systems
The Software Development Cycle
Object Oriented Analysis and Design
An Introduction to Software Engineering
Architectural Engineering, The Pennsylvania State University
Presents: Rally To Java Conversion Suite
Project Change Control
Best Mobile App Development Practices by AppSquadz Technologies.
Combining GATE and UIMA
Title Introduction: Discussion & Conclusion: Methods & Results:
To Brihaspathi. Software Development Services
The Software Development Cycle
Presentation transcript:

A Comparative Study of Two Natural Language Processing Frameworks Yixin Bian, Gunes Koru, Hongfang Liu Department of Information Systems, University of Maryland, Baltimore County,MD,21250,USA June 11, 2012

Introduction UIMA (Unstructured Information Management Architecture) is a framework for natural language processing, originally developed by IBM but now maintained by the Apache Software Foundation. GATE (General Architecture for Text Engineering) is a Java suite of tools originally developed at the University of Sheffield and now used worldwide by a wide community of scientists, companies for all sorts of natural language processing tasks.

Introduction Both developed in Java. Although they share common goals, the two architectures are different in many aspects. Which one to adopt ?

Introduction In this paper, we compare them from three perspectives: Software design quality Code MetricsCode Metrics Software maintenance Code smellsCode smells Bugs Bug survival curves Bug survival curves User's manualUser's manual

The Comparison of Metrics UIMAGATE The number of classes 2,1872,822 MinMedianMaxTotalAverage Value MinMedianMaxTotalAverage Value Line of Code , , CBO NOC RFC DIT LCOM WMC

The Number of Code Smells Code SmellThe number of code smells in UIMA Average (UIMA/KLOC) The number of code smells in GATE Average (GATE/KLOC) Data Class Data Clumps Feature Envy Refused Bequest Long Message Chain Shortgun Surgery God Class Total

The Number of Bugs Detection ToolUIMAGATE FindBugs (2.0.0)6178 PMD (5.0) Lint4j (0.9.13)84494

The Comparison of Bug Survival Curves

The Comparison of User Manuals ContentsUIMAGATE Catalog Tutoral of manual Overview and characteristics of software product Installation and setup Introduction of product application Frequently Asked Questions (FAQ) × Known issues and problems with the software × Terms, concepts and their basic definitions in software ×

Conclusion Software design quality Software maintenance Users manual UIMA is better than GATE.

Thank you !