U. S. National Library of Medicine The Current State of MetaMap and MMTx UMLS Webcast Alan (Lan) R. Aronson Lister Hill Center/NLM/NIH

Slides:



Advertisements
Similar presentations
Relevance Feedback Limitations –Must yield result within at most 3-4 iterations –Users will likely terminate the process sooner –User may get irritated.
Advertisements

The NLM Indexing Initiative Alan R. Aronson, PhD Lister Hill Center, National Library of Medicine American Society of Indexers Annual Meeting May 15, 2004.
Indexing the Biomedical Literature in a Time of Increased Demand and Limited Resources BioASQ Workshop September 27, 2013 Alan R. Aronson Lister Hill Center,
U. S. National Library of Medicine NLM Indexing Initiative Tools for NLP: MetaMap and the Medical Text Indexer Natural Language Processing: State of the.
1 Towards Automatic Discovery of Deviations in Binary Implementations with Applications to Error Detection and Fingerprint Generation David Brumley, Juan.
Codifying Semantic Information in Medical Questions Using Lexical Sources Paul E. Pancoast Arthur B. Smith Chi-Ren Shyu.
NLM Medical Text Indexer (MTI) BioASQ Challenge Workshop September 27, 2013 J.G. Mork, A. Jimeno Yepes, A. R. Aronson.
1 Towards Automatic Discovery of Deviations in Binary Implementations with Applications to Error Detection and Fingerprint Generation David Brumley, Juan.
1 Question Answering in Biomedicine Student: Andreea Tutos Id: Supervisor: Diego Molla.
U. S. National Library of Medicine Welcome to the first MMTx User’s Group Meeting AMIA 2003 November 11, 2003.
Chapter 2: Algorithm Discovery and Design
1 CS 502: Computing Methods for Digital Libraries Lecture 12 Information Retrieval II.
Automating Keyphrase Extraction with Multi-Objective Genetic Algorithms (MOGA) Jia-Long Wu Alice M. Agogino Berkeley Expert System Laboratory U.C. Berkeley.
Social Pharmacy and Pharmacoepidemiology Lister Hill National Center for Biomedical Communications Text-based Discovery in Biomedicine The Architecture.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Threads. Processes and Threads  Two characteristics of “processes” as considered so far: Unit of resource allocation Unit of dispatch  Characteristics.
HIKM’2006AMTEx Automatic Document Indexing in Large Medical Collections Angelos Hliaoutakis, Kalliopi Zervanou, Euripides G.M. Petrakis Technical University.
HIKM’2006AMTEx Automatic Document Indexing in Large Medical Collections Angelos Hliaoutakis, Kalliopi Zervanou, Euripides G.M. Petrakis Technical University.
Chapter 2: Algorithm Discovery and Design
Chapter 2: Algorithm Discovery and Design
Medical Subject Headings (MeSH)
PART A Emac Lisp   Emac Lisp is a programming language  Emacs Lisp is a dialect.
A Hybrid Model to Detect Malicious Executables Mohammad M. Masud Latifur Khan Bhavani Thuraisingham Department of Computer Science The University of Texas.
Citation Biomedical Informatics Data ➜ Information ➜ Knowledge BMI Biomedical Named Entity Recognition Ramakanth Kavuluru NLP Seminar – 8/21/2012.
Mr. JOTL: A User Friendly Matching Software Stéphane Lhuillery, Julio Raffo & Fernando Lladós December nd "NameGame" APE-INV workshop.
These materials are prepared only for the students enrolled in the course Distributed Software Development (DSD) at the Department of Computer.
Tallinn, 13 December 2005 EC CHM portal toolkit Miruna Bădescu Finsiel Romania.
Update on the SEEM Simulation Program Larry Palmiter and Ben Larson August 4, 2008 Ecotope Inc. Presented at Regional Technical Forum Portland, Oregon,
Zheng Lu, Abdulhadi Shoufan, Guido Rößling 8th European Workshop on Microelectronics Education
A hybrid method for Mining Concepts from text CSCE 566 semester project.
Session II: Scientific Publishing and Semantic Web W3C Semantic Web for Life Sciences Workshop October 27, 2004 Moderator: Alan R. Aronson.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Chapter 2: Algorithm Discovery and Design Invitation to Computer Science, C++ Version, Third Edition.
Semi-Automatic Indexing of Full Text Biomedical Articles Washington D.C. October 25, 2005 Clifford W. Gay Lister Hill National Center for Biomedical Communications.
Sadegh Aliakbary Sharif University of Technology Fall 2012.
1 st June 2006 St. George’s University of LondonSlide 1 Using UMLS to map from a Library to a Clinical Classification: Improving the Functionality of a.
Processing of large document collections Part 7 (Text summarization: multi- document summarization, knowledge- rich approaches, current topics) Helena.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
Text Mining In InQuery Vasant Kumar, Peter Richards August 25th, 1999.
Using the UMLS MetaMap as a Cause of Death Analyzer Michael Hogarth, MD Michael Resendez, MS Univ. of California, Davis.
Cross Language Clone Analysis Team 2 April 7, 2011.
CTAKES The clinical Text Analysis and Knowledge Extraction System.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Noun-Phrase Analysis in Unrestricted Text for Information Retrieval David A. Evans, Chengxiang Zhai Laboratory for Computational Linguistics, CMU 34 th.
Threads G.Anuradha (Reference : William Stallings)
AMIA 2008 Monday, Nov :15-1:30 National Library of Medicine National Institutes of Health U.S. Dept. of Health & Human Services UMLS ® Users’ Meeting.
Comparing Frequency of Content- Bearing Words in Abstracts and Texts in Articles from Four Medical Journals: An Exploratory Study September 4, 2001 James.
Collocations and Terminology Vasileios Hatzivassiloglou University of Texas at Dallas.
Compiler Construction (CS-636)
A Bring together all regional Trade Unions in China with IPDPoD - Information Portal Development Platform on Demand Bruce ticilo.
Semi-automatic Product Attribute Extraction from Store Website
Introduction Why are virtual machines interesting?
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Concept Frequency Distribution in Biomedical Text Summarization.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Document Indexing in Large Medical Collections.
Chapter 11  Getting ready to program  Hardware Model  Software Model  Programming Languages  Facts about C++  Program Development Process  The Hello-world.
An Analysis of some Software Engineering Tools in the Market Neelesh Sahay CSC532 Dr. Box.
ICS312 Introduction to Compilers Set 23. What is a Compiler? A compiler is software (a program) that translates a high-level programming language to machine.
Chapter 2: Algorithm Discovery and Design Invitation to Computer Science.
MetaMap UMLS Concept Mapping Program Pawel Matykiewicz and Others.
Consumer Health Question Answering Systems Rohit Chandra Sourabh Singh
JAFER Toolkit Project Headley Lecture Theatre, Ashmolean Museum 1 Introduction To JAFER A Toolkit for Information Retrieval Antony Corfield; Matthew Dovey;
Zachary Starr Dept. of Computer Science, University of Missouri, Columbia, MO 65211, USA Digital Image Processing Final Project Dec 11 th /16 th, 2014.
MetaCoDe A GATE PLUGIN FOR TAGGING MEDICAL CORPORA IN FRENCH WITH CONTROLED TERMINOLOGIES Thierry Delbecque Pierre.
Medical Semantic Similarity with a Neural Language Model Dongfang Xu School of Information Using Skip-gram Model for word embedding.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Jochen Ballof EN-STI-RBS
Tools Communication: Google Code Version Control: SVN UML: LucidCharts.
Lecture 8 Information Retrieval Introduction
Programming Languages, Preliminaries, History & Evolution
Presentation transcript:

U. S. National Library of Medicine The Current State of MetaMap and MMTx UMLS Webcast Alan (Lan) R. Aronson Lister Hill Center/NLM/NIH August 20, 2009 (updated December 10, 2009)

U. S. National Library of Medicine Outline Historical background Distribution modes MetaMap and MMTx* similarities MetaMap and MMTx differences Recent MetaMap development * MMTx – MetaMap Transfer

U. S. National Library of Medicine Historical Background Programs that map biomedical text to a thesaurus CLARIT (Evans et al., 1991) SAPHIRE (Hersh et al., 1990) MetaMap (Aronson et al., 1994) Metaphrase (Tuttle et al., 1998) MMTx (2001) KnowledgeMap (Denny et al., 2003) Mgrep (2009) Characteristics of MetaMap/MMTx Linguistic rigor Flexible partial matching Emphasis on thoroughness rather than speed

U. S. National Library of Medicine MetaMap/MMTx Example PMID – TI –Bile duct stricture due to caused by portal biliopathy: Treatment with one-stage portal-systemic shunt and biliary bypass. Stricture of bile ductCausingHepatic Administration procedureOnePhase Portasystemic shuntBiliaryBypass

U. S. National Library of Medicine MetaMap/MMTx Distribution Modes

U. S. National Library of Medicine MetaMap/MMTx Distribution Modes

U. S. National Library of Medicine MetaMap and MMTx Similarities Same purpose: mapping biomedical text to concepts in the UMLS Metathesaurus Same basic algorithm Tokenization and parsing into phrases Variant generation Candidate retrieval Candidate evaluation Final mapping construction

U. S. National Library of Medicine MetaMap and MMTx Differences (1/2) Algorithmic details Overall organization of the algorithm Tokenization Results Occasional differences, MetaMap’s generally preferred Programming language Prolog/C (MetaMap) Java (MMTx)

U. S. National Library of Medicine MetaMap and MMTx Differences (2/2) Platform availability MMTx: Solaris, Linux, Windows, OS X MetaMap: Solaris, Linux, Windows (soon), OS X (soon) Performance MetaMap is 2-5 times faster than MMTx (as of 2008)

U. S. National Library of Medicine Recent/Current MetaMap Development Technical algorithm enhancements resulting in at least 3x speedup in MetaMap execution MetaMap is now 3-10 times faster than MMTx (2009) Further technical development Migration from Sun/Solaris to Linux environment Update to current Berkeley DB to prepare for Migration from Quintus to SICStus Prolog MetaMap now detects negation (via NegEx) MetaMap 3D (colorized MetaMap output)

U. S. National Library of Medicine MetaMap 3D

U. S. National Library of Medicine Pointers: Website and Contributors Alan (Lan) R. Aronson James G. Mork Willie J. Rogers François M. Lang