Hansheng Xue School of Computer Science and Technology

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Comparison of Several Meta-modeling Tools 2 Yi Lu Computer Science Department McGill University
Britain Southwick Nicole Anguiano March 29, 2014
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
Presented by Marlene Shehadeh Advanced Topics in Computer Vision ( ) Winter
Whole Genome Alignment using Multithreaded Parallel Implementation Hyma S Murthy CMSC 838 Presentation.
Linear Clustering Algorithm BY Horne Ken & Khan Farhana & Padubidri Shweta.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
CHAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling
Social Network Analysis via Factor Graph Model
 C. C. Hung, H. Ijaz, E. Jung, and B.-C. Kuo # School of Computing and Software Engineering Southern Polytechnic State University, Marietta, Georgia USA.
Motif Discovery in Protein Sequences using Messy De Bruijn Graph Mehmet Dalkilic and Rupali Patwardhan.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
WAES 3308 Numerical Methods for AI
Automated Social Hierarchy Detection through Network Analysis (SNAKDD07) Ryan Rowe, Germ´an Creamer, Shlomo Hershkop, Salvatore J Stolfo 1 Advisor:
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
An Automated Approach for Acceptance Web Test Case Modeling and Executing XP, 蔡侑儒 Felipe M. Besson, Delano M. Beder, and Marcos L. Chaim.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
ENHANCED MONITORING TOOL PROJECT Project Presentation By: David Nasi & Amitay Svetlit Supervisor: Oved Itzhak Software Systems Lab Department of Electrical.
On Finding Fine-Granularity User Communities by Profile Decomposition Seulki Lee, Minsam Ko, Keejun Han, Jae-Gil Lee Department of Knowledge Service Engineering.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
GRNmap and GRNsight June 24, Systems Biology Workflow DNA microarray data: wet lab-generated or published Generate gene regulatory network Modeling.
Semantic Wordfication of Document Collections Presenter: Yingyu Wu.
 Retinal images were acquired on normal and pathological subjects, affected by hypertensive retinopathy of various levels.  The tool has been tested.
Object-Oriented Modeling: Static Models. Object-Oriented Modeling Model the system as interacting objects Model the system as interacting objects Match.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology O( ㏒ 2 M) Self-Organizing Map Algorithm Without Learning.
Algorithms For Solving History Sensitive Cascade in Diffusion Networks Research Proposal Georgi Smilyanov, Maksim Tsikhanovich Advisor Dr Yu Zhang Trinity.
1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.
1 An Efficient Optimal Leaf Ordering for Hierarchical Clustering in Microarray Gene Expression Data Analysis Jianting Zhang Le Gruenwald School of Computer.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Design and Analysis of Algorithms Introduction to graphs, representations of a graph Haidong Xue Summer 2012, at GSU.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Working with Cascading Style Sheets
Objective % Select and utilize tools to design and develop websites.
DELLSOFT Technologies Pvt. Ltd.
Course Outcomes of Object Oriented Modeling Design (17630,C604)
Chapter 2 Database System Concepts and Architecture
Database System Concepts and Architecture
Bag-of-Visual-Words Based Feature Extraction
Edinburgh Napier University
Parallel Density-based Hybrid Clustering
Objective % Select and utilize tools to design and develop websites.
Chap 7. Building Java Graphical User Interfaces
Video Summarization by Spatial-Temporal Graph Optimization
Graphical User Interfaces -- Introduction
Graph Implementations
Systems Analysis and Design in a Changing World, 6th Edition
Project Title This is a sample slide layout
Disambiguation Algorithm for People Search on the Web
Digital Image Processing
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Magnet & /facet Zheng Liang
Mashup Service Recommendation based on User Interest and Service Network Buqing Cao ICWS2013, IJWSR.
Project Title: (Your project title here)
Charts A chart is a graphic or visual representation of data
Assignment 1: Classification by K Nearest Neighbors (KNN) technique
Inductive Clustering: A technique for clustering search results Hieu Khac Le Department of Computer Science - University of Illinois at Urbana-Champaign.
Project Title This is a sample poster layout -
Edinburgh Napier University
Presentation transcript:

PhenoSimWeb: a web tool for measuring and visualizing phenotype similarities using HPO Hansheng Xue School of Computer Science and Technology Harbin Institute of Technology, shenzhen

Background Human Phenotype Ontology(HPO): a. The HPO is one of the most popular bioinformatics resources, which was constructed by Robinson et al. in 2008. b. The HPO currently contains over 11,000 terms, each of which describes an individual phenotypic anomaly. c. The terms are arranged in a directed acyclic graph and(DAG) are connected by is-a (subclass-of) edges, such that a term represents a more specific or limited instance of its parent term(s).  In recent study, HPO-based phenotype semantic similarity has been effectively applied to model patient phenotype data .

Background Main researches and Drawbacks: a. None of existing methods take into account the unique features of HPO. And we proposed a novel path-constrained Information Content to fill this gap. b. None of existing tools allow text that describes phenotype features as input, neglecting that symptoms of patients are always described as text not HPO terms. c. Most existing tools ignore the effect of visualization and simply list the experimental results as the final output. In this article, we present a novel web application named PhenoSimWeb to measure phenotype similarities based on HPO and to visualize the similarity using an intuitive graphical interface.

Background Advantages of PhenoSimWeb: a. PhenoSimWeb supplies researchers with a measurement based on the design optimized for unique features of HPO. b. PhenoSimWeb allows text that describes phenotype features as input. c. PhenoSimWeb contains an intuitive and easy-to-use visualization interface to visualize phenotype association network. The main webpage of PhenoSimWeb

PhenoSimWeb Main introduction: a. The back-end of PhenoSimWeb is implemented using Java SDK 7, Python 2.7 and web framework Web.py. b. PhenoSimWeb uses MySQL to manage dataset. c. JavaScript Object Notation(JSON) and Asynchronous JavaScript and XML(AJAX) are used for data transmission. d. PhenoSimWeb uses Cytoscape.js and HTML5 Canvas as the graphics engine for the association network visualization. PhenoSimWeb is a Browser/Server architecture-based web application which can be used to calculate the phenotype similarities based on HPO, visualize the association between phenotypes, and predict the associated genes/diseses given a set of phenotypes.

PhenoSimWeb Main functional modules: a. Given a list of phenotypes, calculate the pairwise similarities among the input phenotypes. b. Given a list of genes or diseases, calculate the pairwise similarities by aggregating the similarities of phenotypes associated to given genes or diseases. c. Given a list of phenotypes, identify the most associated genes or diseases with the given phenotypes based on their HPO-based similarity. Two operations to execute: a. To type in a set of phenotypes and specify the corresponding parameters b. To visualize and download the phenotype similarities 7

User Interface of PhenoSimWeb The whole process can be divided into three parts, including: inputing phenotype, gene or disease dataset, choosing phenotype similarity measurement, typing in experimental user information optionally. Key to the refinement operation is a novel way to project flows such that the sanctity of the clustering algorithm is maintained. Figure. The main input webpage of PhenoSimWeb 8 8

Input interface for gene (or disease) similarity calculation to simply normalize the columns of the adjacency matrix to sum to 1 Figure. The input webpage of calculating genes similarity. This part provides two types of input, including inputting gene set directly and selecting gene from database.

Input interface for choosing phenotype similarity measurement This is what allows MCL to “overfit” the graph by outputting too many clusters. Figure. PhenoSimWeb supplies five state-of-art phenotype semantic similarity measurements for all the users, including PhenoSim, Information Content based (Resnik), Enhanced Information Content based (Lin), Normalized Information Content based (Schlicker) and Jiang-Conrath Measure (JC).

Input interface for phenotype associated gene or disease prediction This is what allows MCL to “overfit” the graph by outputting too many clusters. Figure. The input webpage of predicting similar genes or diseases. Users input phenotypes set in the left text box, gene or disease set in the right text box and select the type of predict.

The webpage of displaying experimental results This is what allows MCL to “overfit” the graph by outputting too many clusters. Figure. The calculation results of the phenotype list. And PhenoSim calculated the corresponding P-Value in addition to the semantic similarity.

Visualization Interface of PhenoSimWeb This is what allows MCL to “overfit” the graph by outputting too many clusters. Figure. The visualization interface of PhenoSimWeb to explore phenotype, gene or disease functional similarities based on HPO.

Visualization Interface of PhenoSimWeb Figure. The node operation panel of visualization interface.

Visualization Interface of PhenoSimWeb Figure. The comparison diagram of two contrasting phenotype association network with different phenotype-to-phenotype similarity thresholds. The edge threshold of left one is 0 and right is 0.1.

Visualization Interface of PhenoSimWeb Figure. The comparison diagram of visualizing phenotype association network with two different graph layouts. The type of cola and grid are used in the left and right figure respectively.

Visualization Interface of PhenoSimWeb Figure. The comparison diagram of constructing subnetworks by selecting interested phenotypes. The right one displays that four interested phenotypes (HP:0000080, HP:0000069, HP:0030037 and HP:0000025) are chosen. The left one displays all the chosen nodes and their direct connected neighbors.

Conclusion a. PhenoSimWeb is a novel web application, which allows researchers to compute phenotype similarity with five different measurements conveniently and visualize the resulting phenotype association networks with an easy-to-use visualization interface. b. PhenoSimWeb contains three main functional modules: measure phenotype similarity, calculate gene or disease similarity, and identify the most associated genes or diseases with the given phenotype set. c. PhenoSimWeb allows text that describes phenotype features as input.

Acknowledgment

Thank you