Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:

Slides:



Advertisements
Similar presentations
Social networks, in the form of bibliographies and citations, have long been an integral part of the scientific process. We examine how to leverage the.
Advertisements

CBioC: Massive Collaborative Curation of Biomedical Literature Chitta Baral, Hasan Davulcu, Anthony Gitter, Graciela Gonzalez, Geeta Joshi-Tope, Mutsumi.
Creating NCBI The late Senator Claude Pepper recognized the importance of computerized information processing methods for the conduct of biomedical research.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
The STRING database Michael Kuhn EMBL Heidelberg.
Computational characterization of biomolecular networks in physiology and disease Kakajan Komurov, Ph.D Department of Systems Biology University of Texas.
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Automating Discovery from Biomedical Texts Marti Hearst & Barbara Rosario UC Berkeley Agyinc Visit August 16, 2000.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Integrating Literature and Experimental Data Fan Meng, Ph.D. Microarray Laboratory Psychiatry Department and Molecular & Behavioral Neuroscience Institute.
1 Exploratory Tools for Follow-up Studies to Microarray Experiments Kaushik Sinha Ruoming Jin Gagan Agrawal Helen Piontkivska Ohio State and Kent State.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
DI FC UL1 Gene Function Prediction by Mining Biomedical Literature Pooja Jain Master in Bioinformatics Supervisor - Mário Jorge Costa Gaspar.
CBioC: Massive Collaborative Curation of Biomedical Literature Future Directions.
Class Projects. Future Work and Possible Project Topic in Gene Regulatory network Learning from multiple data sources; Learning causality in Motifs; Learning.
Mining the Medical Literature Chirag Bhatt October 14 th, 2004.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
BeeSpace Informatics Research: From Information Access to Knowledge Discovery ChengXiang Zhai Nov. 7, 2007.
Networks and Interactions Boo Virk v1.0.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Automatically Generating Gene Summaries from Biomedical Literature (To appear in Proceedings of PSB 2006) X. LING, J. JIANG, X. He, Q.~Z. MEI, C.~X. ZHAI,
Text Mining Special Interest Group Stuart Murray, Wyeth Research Novartis Institute for Biomedical Research, Cambridge, MA 6-8 th October 2004.
Improve your R&D Effectiveness and Manage Your Intellectual Property Assets with Luxid ® for Life Sciences.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Sage Bionetworks A non-profit organization with a vision to enable networked team approaches to building better models of disease BIOMEDICINE INFORMATION.
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
Retrieval of Highly Related Biomedical References by Key Passages of Citations Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan.
BeeSpace Informatics Research: From Information Access to Knowledge Discovery ChengXiang Zhai Nov. 14, 2007.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
Construction of Shanghai Life Science & Bio-technology Service Platform for Data Access and Sharing International Workshop on Strategies Presentation of.
Ferran Sanz – GRIB (IMIM-UPF) Bioinformatics: How it can support the Family of International Classifications? Ferran Sanz Research Programme on Biomedical.
RiceWiki: a wiki-based database for community curation of rice genes Available at
Opportunities for Text Mining in Bioinformatics (CS591-CXZ Text Data Mining Seminar) Dec. 8, 2004 ChengXiang Zhai Department of Computer Science University.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
Combining Data Mining and Network Analysis to find Prostate Cancer Biomarkers Gabriela Jurca 1.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Create and assess protein networks through molecular characteristics of individual proteins Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
BIOBASE Training TRANSFAC ® Containing data on eukaryotic transcription factors, their experimentally-proven binding sites, and regulated genes ExPlain™
Networks and Interactions
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Biomedical Text Mining and Its Applications
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.
Data challenges in the pharmaceutical industry
Functional Annotation of the Horse Genome
Annotation: linking literature to gene products
Large Scale Annotation of Genomic Datasets with Genephony
UMM Data Services Center
Browsing the GO at MGI Harold Drabkin, Ph.D. Senior Scientific Curator
Citation-based Extraction of Core Contents from Biomedical Articles
Network biology An introduction to STRING and Cytoscape
Overview Domains and conclusion Introduction Biological network data
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
BIOBASE Training TRANSFAC® ExPlain™
Presentation transcript:

Knowledge Integration for Gene Target Selection Graciela Gonzalez, PhD Juan C. Uribe Contact:

GeneRanker in a Nutshell Integration of knowledge from –biomedical literature –curated PPI databases, and –protein network topology Seeks to prioritize lists of genes on their association to specific diseases and phenotypes [1], Such associations may or may not have been published (thus, not text mining) [1] Gonzalez G, Uribe JC, Tari L, Brophy C, Baral C. Mining Gene-Disease relationships from Biomedical Literature: Incorporating Interactions, Connectivity, Confidence, and Context Measures. Pacific Symposium in Biocomputing; 2007; Maui, Hawaii; 2007.

GeneRanker Interface 1.The user types a disease or biological process to be searched. 2.Genes found to be in association to the disease are extracted from the literature. 3.Protein-protein interactions involving those genes are then pulled from the literature & curated sources 4.The protein network is built and each gene ranked

GeneRanker Interface Each gene is scored and can be annotated (count of co-occurrences and statistical representation) Collaboration: Application of GeneRanker to a biological context, with Dr. Michael Berens, Director of the Brain Tumor Unit at the Translational Genomics Institute (TGen). GeneRanker is available as an online application at

Evaluation of GeneRanker Contextual (PubMed search) based shows > 20% jump in precision over NLP based extraction. Synthetic network results show AUC > Empirical validation against a glioma dataset shows consistent results (118 vs 22 differentially expressed probes from top vs bottom of list)

Complementary Work CBioC: shows PPIs, gene-disease, and gene-bioprocess associations extracted from abstractswww.cbioc.org BANNER: sourceforge.banner.org (presenting a poster on this one). An open source entity recognizer available now. Gene normalization: a similar open source system soon to be available.