ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University.

Slides:



Advertisements
Similar presentations
Applications of genome sequencing projects 1) Molecular Medicine 2) Energy sources and environmental applications 3) Risk assessment 4) Bioarchaeology,
Advertisements

applications of genome sequencing projects
A new GIS Map Interface for ALFRED: the ALelle FREquency Database R. Gadagkar 1, H. Rajeevan, K.-H. Cheung 1, S. Stein, U. Soundararajan,, J. R. Kidd,
By Angela Brooks and David Chapman Mentor: Dr. Garry Larson Molecular Medicine, City Of Hope Southern California Bioinformatics Institute 2004.
CS177 Lecture 9 SNPs and Human Genetic Variation Tom Madej
ALFRED - the ALlele FREquency Database: S. Stein 1, H. Rajeevan 1,U. Soundararajan 1, K-H Cheung 2, J.R. Kidd 1, A.J. Pakstis 1, R. Gadagkar 2, P.L. Miller.
Design of Web-based Systems IS Development: lecture 10.
A pilot application 12/9/2008Microsoft eScience Workshop 2008 Robert Bukowski and Jarek Pillardy Computational Biology Service Unit Cornell University.
ALFRED: the ALlele FREquency Database H. Rajeevan 1, M.V. Osier 1, K. Cheung 2, H. Deng 1, L. Druskin 2, J.R. Kidd 1, S. Stein 1, A.J. Pakstis 1, N.P.
The Extraction of Single Nucleotide Polymorphisms and the Use of Current Sequencing Tools Stephen Tetreault Department of Mathematics and Computer Science.
ALFRED ALFRED: the ALlele FREquency Database ALFRED: the A AA ALlele F FF FREquency D DD Database Kenneth K. Kidd and the ALFRED Team Department of Genetics.
ALFRED: A Resource for Research and Teaching K. K. Kidd, H. Rajeevan, K.-H. Cheung 1, S. Stein, U. Soundararajan, J. R. Kidd, A. J. Pakstis, and P. L.
ALFRED - the ALlele FREquency Database: H. Rajeevan 1, K-H Cheung 2, H. Deng 1, J.R. Kidd 1, S. Stein 1, A.J. Pakstis 1, D. Chudnov 1, P.L. Miller 2, K.K.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
ALFRED A new Graphical User Interface for ALFRED: the ALlele FREquency Database A new Graphical User Interface for ALFRED: the A AA ALlele F FF FREquency.
Restriction Fragment Length Polymorphisms (RFLPs) By Amr S. Moustafa, M.D.; Ph.D. Assistant Prof. & Consultant, Medical Biochemistry Dept. College of.
Employing e-Portfolios in Instructional and Co-Curricular Settings Jennifer Matthews, Senior Consultant Blackboard Inc April 13, 2005.
Selecting TagSNPs in Candidate Genes for Genetic Association Studies Shehnaz K. Hussain, PhD, ScM Assistant Professor Department of Epidemiology, UCLA.
BioBarcode: a general DNA barcoding database and server platform for Asian biodiversity resources Jeongheui Lim Korean BioInformation Center Korea Research.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
PCR Primer Design
Evaluating and Citing Internet Sources Pamela Fried, MBA, Director Diana Winters, BA, Associate Director Academic Publishing Services and Gary M. Childs,
Reading the Blueprint of Life
INTRODUCTION TO WEB DATABASE PROGRAMMING
Computer Concepts 2014 Chapter 7 The Web and .
Web Server A software program or server computer equipped to offer World Wide Web access. Web servers allow you to serve content over the Internet using.
Databases and the Internet. Lecture Objectives Databases and the Internet Characteristics and Benefits of Internet Server-Side vs. Client-Side Special.
Chapter 1: Introduction to Web
What is IIS? IIS (Internet Information Server) is a group of Internet servers (including a Web or Hypertext Transfer Protocol server and a File Transfer.
Student Learning Environment on the World Wide Web l CGI-programming in Perl for the connection of databases over the Internet. l Web authoring using Frontpage.
1 Web Server Administration Chapter 1 The Basics of Server and Web Server Administration.
Molecular Marker Evaluation Data Laura Fredrick Marek ISU/NCRPIS, Ames, IA WRPIS, Pullman, WA supporting presentations by: grape SSR informationpea SNP.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
Single Nucleotide Polymorphisms Mrs. Stewart Medical Interventions Central Magnet School.
A Primer on Genetic Variation Variety Lawrence Brody - NHGRI.
Fundamentals of Database Chapter 7 Database Technologies.
Bulk Metadata Structures in CERA Frank Toussaint, Michael Lautenschlager Max-Planck-Institut für Meteorologie World Data Center for Climate.
Amplifying DNA. The Power of PCR View the animation at
 2004 Prentice Hall, Inc. All rights reserved. 1 Segment – 6 Web Server & database.
Active Server Pages and Application Service Providers Architecture for 2000 and beyond Krishen Kota Denali Technologies
TUMS Scientific Writing EndNote Web Payam Kabiri, MD. PhD Epidemiologist Department of Epidemiology & Biostatistics School of Public Health Tehran University.
Kingdom of Saudi Arabia Ministry of Higher Education Al-Imam Muhammad Ibn Saud Islamic University College of Computer and Information Sciences Chapter.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Grid Chemistry System Architecture Overview Akylbek Zhumabayev.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
1 MSCS 237 Overview of web technologies (A specific type of distributed systems)
Overview Web Session 3 Matakuliah: Web Database Tahun: 2008.
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
WEB DEVELOPMENT WITH PHP/MYSQL. WEB DEVELOPMENT COURSE HAS DIFFERENT NAME IN DIFFERENT INSITUTES, THIS IS A CORE COURSE FOR BS/MS STUDENTS. THIS IS ALSO.
1 DNA Polymorphisms: DNA markers a useful tool in biotechnology Any section of DNA that varies among individuals in a population, “many forms”. Examples.
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Central Arizona Phoenix LTER Center for Environmental Studies Arizona State University Data Query Peter McCartney RDIFS Training Workshop Sevilleta LTER.
Chapter 1 Introduction to Ecommerce What is E-Commerce? Microsoft Technologies for E- Commerce. What is an ASP Page. Objectives :
An Investigation into using a Document Management System Presented by: Bijal RanaSupervisor: John Ebden.
A guided tour of Ensembl This quick tour will give you an outline view of what Ensembl is all about. You will learn: –Why we need Ensembl –What is in the.
Using a Single Nucleotide Polymorphism to Predict Bitter Tasting Ability Lab Overview.
Advanced Website Design “How Not To Get It Wrong!”
Synteny - many distantly related species have co- linear maps for portions of their genomes; co-linearity between maize and sorghum, between maize and.
1 Bioinformatics Tools for Genotyping Frances Tong Dr. Garry Larson, Ph.D City of Hope Department of Molecular Medicine Southern California Bioinformatics.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Building Library Web Site Using Drupal
ALFRED: the ALlele FREquency Database
Evolution of Internet.
Current sequencing technology makes microhaplotypes a powerful new type of genetic marker for forensics  Kenneth K. Kidd, Andrew J. Pakstis, William C.
The Distribution and Most Recent Common Ancestor of the 17q21 Inversion in Humans  Michael P. Donnelly, Peristera Paschou, Elena Grigorenko, David Gurwitz,
Exploring Web Page Design
Presentation transcript:

ALFRED ALFRED: the ALlele FREquency Database Kenneth K. Kidd and the ALFRED Team Department of Genetics and Center for Medical Informatics Yale University School of Medicine Supported by the U.S. National Science Foundation

IntroductionIntroduction What is in ALFRED? What is in ALFRED? How to access ALFRED How to access ALFRED What is ALFRED? What is ALFRED? Why is it necessary? Why is it necessary? What we are doing now? What we are doing now?

What is ALFRED? ALFRED, the ALlele FREquency Database, is designed to integrate into a single source information on the frequencies of human DNA sequence variants. ALFRED, the ALlele FREquency Database, is designed to integrate into a single source information on the frequencies of human DNA sequence variants.

ALFRED Home Page ALFRED Home Page

ALFRED is designed to allow reference of frequencies to: A specific typing protocol for a specific polymorphism at a specific locus. A specific typing protocol for a specific polymorphism at a specific locus. A specific sampling of an ethnic group. A specific sampling of an ethnic group. Cross reference to the literature for other publications of frequencies based on the same sample or ethnic group. Cross reference to the literature for other publications of frequencies based on the same sample or ethnic group.

Why is it necessary? ALFRED is designed to serve as a central repository of frequencies for variation in the human genome–curated and cross referenced to molecular and ethnographic databases–by assembling in one place data that are dispersed very widely in the scientific literature. ALFRED is designed to serve as a central repository of frequencies for variation in the human genome–curated and cross referenced to molecular and ethnographic databases–by assembling in one place data that are dispersed very widely in the scientific literature.

Why is it necessary? ALFRED is web-based, publicly available, with easy to download data thus serving as a resource for many types of research projects. ALFRED is web-based, publicly available, with easy to download data thus serving as a resource for many types of research projects. With its graphic displays of data, ALFRED can also serve as an educational resource for physical anthropology and human population genetics. With its graphic displays of data, ALFRED can also serve as an educational resource for physical anthropology and human population genetics.

What are we doing now? Data content - Quality control Criteria - a minimal typed sample size of 20 individuals, minimization of missing data, time-stamped frequency data (i.e., different versions of frequency data are available). Criteria - a minimal typed sample size of 20 individuals, minimization of missing data, time-stamped frequency data (i.e., different versions of frequency data are available).

What are we doing now? Data integration and accumulation ALFRED curators are currently uploading allele frequency data from published literature throughout the physical anthropology and population genetics peer reviewed journals. ALFRED curators are currently uploading allele frequency data from published literature throughout the physical anthropology and population genetics peer reviewed journals.

What are we doing now? Data management ALFRED programmers are currently working on the migration of ALFRED from Access to Oracle in order to handle the rapidly growing database. ALFRED programmers are currently working on the migration of ALFRED from Access to Oracle in order to handle the rapidly growing database.

What is in ALFRED? ALFRED stores allele frequencies and information on a wide range of loci, polymorphic sites, populations, and samples. ALFRED stores allele frequencies and information on a wide range of loci, polymorphic sites, populations, and samples.

Table Summary Numbers As of April 9, 2002

Loci Example: Chromosome 22

Locus Example: Catechol-O-Methyl Transferase

Definition of the Polymorphism A clear protocol PCR primers and product sizes for In/Dels and STRPs PCR primers and fragment sizes after enzyme digestion for RSPs Unambiguous definition of varying nucleotides based on flanking sequence.

Polymorphisms Example: COMT, 3-site haplotype

Allele Frequencies Example: COMT, 3-site haplotype

Populations Example: North America

Populations Example: Maya, Yucatan

Samples Example: Maya, Yucatan

Frequency data retrieval Search

Frequency Variation for Four SNPS in 33 Populations Africa Europe/Middle East East Asia North America SouthAmericaP.S.

ALFRED System Implementation Microsoft Access (migration to Oracle) Microsoft Access (migration to Oracle) Microsoft NT Server with Internet Information Server (IIS) Microsoft NT Server with Internet Information Server (IIS) Scripts Written in Server-side ASP (VB Script) Scripts Written in Server-side ASP (VB Script) Microsoft Access (migration to Oracle) Microsoft Access (migration to Oracle) Microsoft NT Server with Internet Information Server (IIS) Microsoft NT Server with Internet Information Server (IIS) Scripts Written in Server-side ASP (VB Script) Scripts Written in Server-side ASP (VB Script)

ALFRED System Overview PhenoDB ALFRED Web Server (ASP) ODBC Client Browser External Data Resources Collaborators HAPLO Program Others (e.g. literature) Input Data Sources NT Server Kidd Lab Data

ALFRED The ALlele FREquency Database from Kidd Lab Suggestions and comments are welcome.

The ALFRED Team Senior Faculty Kenneth K. Kidd, Ph.D., Professor of Genetics and Psychiatry (ALFRED P.I.) Perry Miller, M.D., Ph.D., Director of Center for Medical InformaticsCurators Chen-Chen Yeh, M.S., Research Associate Rebekah Heinzen, B.A., Research AssistantProgrammers Michael V. Osier, Ph.D. Candidate, Graduate Student Haseena Rajeevan, Ph. D., Systems Programmer Nicholas P. Tosches, M.D., Associate Research Scientist Lyudmila Druskin, M.D., Postdoctoral Fellow and AssociateConsultants Andrew J. Pakstis, Ph. D., Research Scientist Judith R. Kidd, Ph. D., Research Scientist Kei-Hoi Cheung, Ph. D., Assistant Professor