Introductory to database handling Endre Sebestyén.

Slides:



Advertisements
Similar presentations
Database Management Systems and Enterprise Software
Advertisements

INTRODUCTORY MICROSOFT ACCESS Lesson 1 – Access Basics
AS ICT Finding your way round MS-Access The Home Ribbon This ribbon is automatically displayed when MS-Access is started and when existing tables.
Lecture-7/ T. Nouf Almujally
The National Center for Biotechnology Information (NCBI) a primary resource for molecular biology information Database Resources.
Database Management An Introduction.
Using Objects and Properties
Chapter 4: Database Management. Databases Before the Use of Computers Data kept in books, ledgers, card files, folders, and file cabinets Long response.
Attribute databases. GIS Definition Diagram Output Query Results.
It refers to the software used to manage the database.
Microsoft Access Ervin Ha.
Introduction To Databases IDIA 618 Fall 2014 Bridget M. Blodgett.
Conceptual Architecture of PostgreSQL PopSQL Andrew Heard, Daniel Basilio, Eril Berkok, Julia Canella, Mark Fischer, Misiu Godfrey.
IST Databases and DBMSs Todd S. Bacastow January 2005.
Computing for Bioinformatics Introduction to databases What is a database? Database system components Data types DBMS architectures DBMS systems available.
Pasewark & Pasewark Microsoft Office 2003: Introductory 1 INTRODUCTORY MICROSOFT ACCESS Lesson 1 – Access Basics.
Databases C HAPTER Chapter 10: Databases2 Databases and Structured Fields  A database is a collection of information –Typically stored as computer.
Copyright © 2003 by Prentice Hall Module 4 Database Management Systems 1.What is a database? Data hierarchy and data organization Field, record, file,
ASP.NET Programming with C# and SQL Server First Edition
1 ITGS - introduction A computer may have: a direct connection to a net (cable); or remote access (modem). Connect network to other network through: cables.
 A databases is a collection of data organized to make it easy to search and easy to retrieve in a useful, usable form.
Gene Expression Omnibus (GEO)
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
Simple Database.
MySQL. Dept. of Computing Science, University of Aberdeen2 In this lecture you will learn The main subsystems in MySQL architecture The different storage.
Chapter 7 Working with Databases and MySQL PHP Programming with MySQL 2 nd Edition.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
MET280: Computing for Bioinformatics Introduction to databases What is a database? Not a spreadsheet. Data types and uses DBMS (DataBase Management System)
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
Key Applications Module Lesson 21 — Access Essentials
More about Databases. Data Entry through Forms Table View (Data sheet view) is useful for data entry of new records But sometimes customization would.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Introduction to CS520/CS596_026 Lecture Two Gordon Tian Fall 2015.
Data Resource Management
Chapter 5 Data Resource Management. 2 I. Why do organizations store data?  Data resources must be structured and organized in some logical manner so.
Copenhagen, 7 June 2006 Toolkit update and maintenance Anton Cupcea Finsiel Romania.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Maintaining a Database Access Project 3. 2 What is Database Maintenance ?  Maintaining a database means modifying the data to keep it up-to-date. This.
ABSTRACT The JDBC (Java Database Connectivity) API is the industry standard for database- independent connectivity between the Java programming language.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
Database Indexing 1 After this lecture, you should be able to:  Understand why we need database indexing.  Define indexes for your tables in MySQL. 
ARGOS (A Replicable Genome InfOrmation System) for FlyBase and wFleaBase Don Gilbert, Hardik Sheth, Vasanth Singan { gilbertd, hsheth, vsingan
Copyright OpenHelix. No use or reproduction without express written consent1.
CIS 250 Advanced Computer Applications Database Management Systems.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Protein sequence databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen This also includes old material from my thesis
Introduction to Databases Angela Clark University of South Alabama.
Relational Database Systems Bartosz Zagorowicz. Flat Databases  Originally databases were flat.  All information was stored in a long text file, called.
Not Your Father’s Laserfiche AA101 Michael Allen.
Microsoft Power Query 101 Belinda Allen Smith & Allen Consulting, Inc.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Storage and File Organization
Databases and DBMSs Todd S. Bacastow January
Indexing Goals: Store large files Support multiple search keys
Data Resource Management
Exploring Microsoft Office Access
Databases.
MANAGING DATA RESOURCES
Database.
More about Databases.
Conceptual Architecture of PostgreSQL
Database Management System
Lesson 3 Bioinformatics Laboratory
DATABASES WHAT IS A DATABASE?
Exploring Microsoft Office Access
Lecuter-1.
Presentation transcript:

Introductory to database handling Endre Sebestyén

What is a database? A database is a bunch of information It is a structured collection of information It contains basic objects, called records or entries The records contain fields, which contain defined types of data, somehow related to that record A nuclotid sequence database would contain for example all kinds of nucleotides as records, and nucleotide properties (length, name, origin, etc) as fields.

What is a database? A database is searchable It contains an index (table of content, catalog) It is updated regularly (releases) New data goes in Obsolete, old data goes out It is cross referenced To other databases

Why databases? The main purpose of databases is not only to collect and organize data, but to allow advanced data retrieval and analysis A database query is a method to retrieve information from the database The organization of records into fields allows us to use queries on fields Example : all mouse rna sequences between bp length

Databases on the internet USER WEBSERVERS DATABASE SERVER

Databases on the internet Information system Query system Storage system Data

Databases on the internet Information system Query system Storage system Data Book Book title Sequence Temperature Picture Video Log files of web servers etc

Databases on the internet Information system Query system Storage system Data Bookshelves Boxes Text files/directories Binary files MySQL database Oracle database

Types of databases Hierarchical model Tree-like structures Parent -> child One to many relations

Types of databases Network model More complex than the previous Parent -> child One to many Many to one

Types of databases Relational model Most widely used Fast and efficient (if the data structure is designed correctly)

Databases on the internet Lists Catalogues Librarian Index files SQL language grep command

Query systems for databases SQL query language Querying and modifying data Managing the database Optimize queries SELECT * FROM sequence_feature WHERE sequence_primary_id LIKE %$variable% SORT BY sequence_primary_id LIMIT 10; Multiple operating systems Different programming languages Different storage systems (MySQL, PostgreSQL, etc) Use SQL terminal Throught programming languages

Databases on the internet Library NCBI Entrez Google Lots of other general and specialized databases with search interfaces on the web

Case study: the DoOP database Tries to collect and analyze the promoter regions of different genes and orthologous gene clusters 2 main sections: plant and chordate Chordate: v1.4 Plant: v1.5, v1.6 Integrates different kinds of data Sequence data Sequence annotation Cross-references to external databases Multiple alignments Conserved sequence regions Goal: easily accessible and searchable interface on the web

Data processing

MySQL tables

MySQL table

MySQL tables

Data processing

API for the MySQL database Application Programming Interface We want to convert the MySQL data into nice webpages MySQL query to get data: SELECT * FROM sequence_feature WHERE sequence_primary_id LIKE %$variable% SORT BY sequence_primary_id LIMIT 10; And so on… Process the data OR with n API $data = $sequence_feature_object->get_data;

Bio::DOOP API (More or less) simple representations of the sequence and other data -> modules and objects The API hides the MySQL queries and other stuff from us, so we can concentrate on the web pages It works well only if we have good API design with all the necessary features Bio::DOOP API modules Clusters Subsets Sequences Sequence features Motifs Other modules for managing, sorting and filtering the data

Search page Search types Sequence ID Gene ID Keywords Species Sequence

Search results Cluster ID Description Conserved motifs Taxonomical groups Download sequences

Promoter cluster Sequences Gene annotation Sequence alignment Crossreferences Conserved regions

Promoter cluster UTR region Species, size Motifs

Further search in the motif collection Similar table as in the previous search results

Thank you for your attention!