Data Mining and Text Analytics GATE, by Joel Bywater.

Slides:



Advertisements
Similar presentations
1 Copyright © 2002 Pearson Education, Inc.. 2 Chapter 1 Introduction to Perl and CGI.
Advertisements

August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
1 BENT Educational Society  Software is collection of instructions which tells the computer that what to do and how to do.  There.
C++ Interface for Making Visualized Graphs By N.K. Bonsack and E.Harcourt Abstract Software engineers and computer scientists alike frequently come upon.
ClearTK: A Framework for Statistical Biomedical Natural Language Processing Philip Ogren Philipp Wetzler Department of Computer Science University of Colorado.
Khresmoi – Multilingual Semantic Search of Medical Text and Images Henning Müller Allan Hanbury.
UNDERSTANDING JAVA APIS FOR MOBILE DEVICES v0.01.
Essential Software Architecture Chapter Two - Introducing the Case Study Ian Gorton CS590 – Winter 2008.
Social Network Analysis: Tasks and Tools Steven Loscalzo and Lei Yu Department of Computer Science Watson School of Engineering and Applied Science State.
© Prentice Hall CHAPTER 3 Computer Software.
Outline of Presentation Introduction of digital video libraries Introduction of the CMU Informedia Project Informedia: user perspective Informedia:
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Words & Definitions By: Naftaly Garcia Birruete. Address Bar  The space provided on a web browser that shows the addresses of websites.
Passage Three Introduction to Microsoft SQL Server 2000.
Emerging Platform#4: Android Bina Ramamurthy.  Android is an Operating system.  Android is an emerging platform for mobile devices.  Initially developed.
COMPUTER SOFTWARE ALISA RAHMANI PUTRI / VIDIYA RACHMAWATI /
What is R By: Wase Siddiqui. Introduction R is a programming language which is used for statistical computing and graphics. “R is a language and environment.
Media Computing Instructor Byung Kim Olsen 231 Office hours – MWF 9:00-10:00 AM or by appointment.
Reference: An Overview of Business Intelligence Technology, Communications of The ACM, August VOL 54 NO.8
M1G Introduction to Database Development 6. Building Applications.
AGENDA Introduction to Virtual Mechanic Demo Architectural diagram and summary QA steps and user acceptance testing Bugs in the software Feedback from.
6 October Information Retrieval and Knowledge Management Within the VisIT Web Services Architecture.
WordFreak A Language Independent, Extensible Annotation Tool.
An overview of scripting languages Alexander Kanavin Teachers: Barbara Miraftabi, Jan Voracek.
ITGS Case Study Theatre Booking System Ayushi Pradhan.
1 3. Computing System Fundamentals 3.1 Language Translators.
Software – Applications software and programming languages.
University of Sheffield NLP Teamware: A Collaborative, Web-based Annotation Environment Kalina Bontcheva, Milan Agatonovic University of Sheffield.
Semantic Technologies & GATE NSWI Jan Dědek.
COMU114: Introduction to Database Development 1. Databases and Database Design.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
GUIDED BY DR. A. J. AGRAWAL Search Engine By Chetan R. Rathod.
Data mining. Data mining, at its core, is the transformation of large amounts of data into meaningful patterns and rules.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
1 Software Requirements l Specifying system functionality and constraints l Chapters 5 and 6 ++
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Combining GATE and UIMA Ian Roberts. University of Sheffield NLP 2 Overview Introduction to UIMA Comparison with GATE Mapping annotations between GATE.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Framework for Interactive Applications Matthew Korchinsky Advisor: Aaron Cass Senior Project – Computer Engineering – 2006 Abstract The Java language was.
HTML Concepts and Techniques Fourth Edition Project 1 Introduction to HTML.
GUI For Computer Architecture May01-05 Team Members: Neil HansenCprE Ben JonesCprE Jon MathewsCprE Sergey SannikovCprE Clients/Advisors: Manimaran Govindarasu.
KING SAUD UNIVERSITY – COLLAGE OF COMPUTER AND INFORMATION SCIENCES CSC 113 JAVA ONLINE DOCUMENTATION.
Clustering in R Xue li CS548 showcase. Source html project.org/web/packages/cluster/index.html.
Using Wikis in Education An introduction to the use of wikis as a collaborative content development tool for learning.
Learning Objectives Understand the concepts of Information systems.
New Paradigms for Broadband Data Building the Fact Base: The State of Broadband Adoption and Utilization Federal Communications Commission Kate Williams.
CIS 595 MATLAB First Impressions. MATLAB This introduction will give Some basic ideas Main advantages and drawbacks compared to other languages.
Web Designing & Development. Designswala.Com Offers Wide Range Of Services In Various Parameters Like Web Designing, Web Development, Software Development,
Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content Kalina Bontcheva, Diana Maynard, Hamish Cunningham, Horacio.
Introduction to Oracle Forms Developer and Oracle Forms Services
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Introduction to Oracle Forms Developer and Oracle Forms Services
Introduction to Visual Basic 2008 Programming
DMQ4:Instruments & Sensors for online remote access
An Introduction to the IVC Software Framework
Introduction to Oracle Forms Developer and Oracle Forms Services
Data Warehouse.
An Introduction to Visual Basic .NET and Program Design
Programming languages and software development
A BRIEF INTRODUCTION TO UNIX OPERATING SYSTEM
Silverlight Technology
Data Warehousing and Data Mining
Information Technology Ms. Abeer Helwa
Java Online documentation
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
McGraw-Hill Technology Education
Prism A Prism for research in software modularity through
Information Retrieval and Web Design
Software Re-engineering and Reverse Engineering
Presentation transcript:

Data Mining and Text Analytics GATE, by Joel Bywater

Introduction Developed at University of Sheffield in 1995 Now used world wide by many users, such as scientists, teachers and companies for natural processing language tasks Currently handles 12 languages Written in Java ture_for_Text_Engineering

What is it? “General Architecture for Text Engineering (GATE) is a development environment for writing software that can process human- language text. In particular, GATE is used for computational language processing and text mining.” al-Architecture-for-Text-Engineering-GATE

Types GATE Teamware: A web-based management system for semi-automatic and manual annotation of text collections GATE Developer: Development environment providing tools for processing human language GATE Mimir: Concerned with storage in for the form of an index, used for search

GATE Graphical User Interface

Uses Scalable – Supports a wide range of potential tasks Currently able to annotate 12 languages Open source software Includes plug-ins, supporting applications such as WEKA Collectiveness – A set of tools all within one package

Drawbacks? Visual interface may be deemed to be complex by new users Doesn’t support annotation of all languages Frequent updates – User having to become familiar with new version, may be tedious

Summary So, what have we learned? - Introduction to GATE - The various types that are available - Seen an interface example - Positives and negatives