Web Technologies for Bioinformatics Ken Baclawski.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

August 6, 2009 Joint Ontolog-OOR Panel 1 Ontology Repository Research Issues Joint Ontolog-OOR Panel Discussion Ken Baclawski August 6, 2009.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
XML: Extensible Markup Language
Project 1 Introduction to HTML.
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
XML Prashant Karmarkar Brendan Nolan Alexander Roda.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
“DOK 322 DBMS” Y.T. Database Design Hacettepe University Department of Information Management DOK 322: Database Management Systems.
1st Project Introduction to HTML.
4/20/2017.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
PHASE 3: SYSTEMS DESIGN Chapter 7 Data Design.
Chapter 11 Databases.
KINOLINA.COM XML Standards and Vocabulary Development Eric Gould December 2002.
Database Systems COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
 Introduction Introduction  Purpose of Database SystemsPurpose of Database Systems  Levels of Abstraction Levels of Abstraction  Instances and Schemas.
1 XML at a neighborhood university near you Innovation 2005 September 16, 2005 Kwok-Bun Yue University of Houston-Clear Lake.
Copyright © 2012 Accenture All Rights Reserved.Copyright © 2012 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are.
School of Computing and Management Sciences © Sheffield Hallam University To understand the Oracle XML notes you need to have an understanding of all these.
Introduction to XSLT By Ed Rosenthal And Dave Pion.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
STORING ORGANIZATIONAL INFORMATION— DATABASES CIS 429—Chapter 7.
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
Architecture for a Database System
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Chapter 27 The World Wide Web and XML. Copyright © 2004 Pearson Addison-Wesley. All rights reserved.27-2 Topics in this Chapter The Web and the Internet.
Chapter 1 : Introduction §Purpose of Database Systems §View of Data §Data Models §Data Definition Language §Data Manipulation Language §Transaction Management.
File Systems and Databases Lecture 1. Files and Databases File: A collection of records or documents dealing with one organization, person, area or subject.
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
R. Addie & S. Dekeyser XML for M&C / USQ ? What ? Why ? How ? When ?
Data resource management
Mining the Biomedical Research Literature Ken Baclawski.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Dom and XSLT Dom – document object model DOM – collection of nodes in a tree.
Assoc. Prof. Dr. Ahmet Turan ÖZCERİT.  The concept of Data, Information and Knowledge  The fundamental terms:  Database and database system  Database.
IT Accessibility Committee XML as Content Management Presented by Michael B. Short May 11, 2006 Prepared by the NYS Forum IT Accessibility Committee
Mr.Prasad Sawant, MIT Pune India Introduction to DBMS.
7. Data Import Export Lingma Acheson Department of Computer and Information Science IUPUI CSCI N207 Data Analysis Using Spreadsheets 1.
INFSY 547: WEB-Based Technologies Gayle J Yaverbaum, PhD Professor of Information Systems Penn State Harrisburg.
Chapter 2 Database Environment.
1 Chapter 2 Database Environment Pearson Education © 2009.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
Postgraduate Module Enterprise Database Systems Technological Educational Institution of Larisa in collaboration with Staffordshire University Larisa
IIS 645 Database Management Systems DDr. Khorsheed Today’s Topics 1. Course Overview 22. Introduction to Database management 33. Components of Database.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
1 XML and XML in DLESE Katy Ginger November 2003.
Introduction to DBMS Purpose of Database Systems View of Data
XML Related Technologies
Relational Databases.
Database Processing with XML
Databases.
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Data Model.
Introduction to DBMS Purpose of Database Systems View of Data
Lecture 1 File Systems and Databases.
Database Design Hacettepe University
Chapter 2 Database Environment Pearson Education © 2014.
CSE591: Data Mining by H. Liu
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Presentation transcript:

Web Technologies for Bioinformatics Ken Baclawski

Data Formats n Flat files n Spreadsheets n Relational databases n Web sites

XML Documents n Flexible very popular text format n Self-describing records

XML Documents (continued) n Hierarchical structure

Purpose of Data n Data is collected and stored for a purpose. n The format serves that purpose. n Using data for another purpose is common. n Data presentation (such as on a Web site) is one example of such a use. n It is important to anticipate that data will be used for many purposes. n Data is reused by transforming it.

Statistical Analysis as a Transformation Process n Transformation consists of a series of steps. n Specialized equipment and software is used for each step. n Separation into steps reduces the overall effort.

Web Site Construction n Web sites can be constructed using a Web site authoring tool (e.g., Front Page). n Alternatively, one could use a transformation process to separate concerns.

Advantages of Transformation n Reduces the overall effort. n Presentation style is independent of the source content. n Presentation style can be changed with immediate effect. n Uniform enforcement of presentation style. n Updates to content are immediate. n Content can be used for many other purposes: – Many reports in many formats – Proposals – Data sharing with other institutions – Data mining

Transformation Languages n Traditional programming languages such as Perl, Java, etc. n Rule-based (declarative) languages such as the XML Transformation language (XSLT). – Rule-based rather than procedural – Transform each kind of element with a template – Matching and processing of elements is analogous to the digestion of polymers with enzymes.

Transformation as Digestion n The blue enzyme attacks the polymer at two locations. n The resulting three polymers are then attacked by the green enzyme.

XSLT “Digestion” n An XSLT program consists of templates n Each template processes a set of matching elements n A template can break up the element to be processed by other templates

<xsl:transform version="1.0” xmlns:xsl="

5.67 <Concentration unit="nm">43 <interaction substrate="Sub89033"> <P id="Mtr245"> 0.65 <Concentration unit="um">0.53 <interaction substrate="Sub80933"> <S id="Sub89032"/>

Ontologies n The structure of data is its ontology. – Database schema – XML Document Type Definition (DTD) n An ontology defines the concepts and relationships between them in a domain. n Transformations are fundamental: – Queries – Organizing data (views) – Transformation for new purposes

Research Areas n Ontologies for bioinformatics n Ontology development in general – Constructing ontologies – Validation and testing of ontologies n New ontology languages to capture more meaning n Transformation languages

Research Areas n Inference and deduction – Logical inference – Probabilistic inference – Scientific inference – Other forms of inference n Integrating inference with – Data mining – Experimental processes