CellExpress Tutorial A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System 172.16.0.66:8080 NTU.

Slides:



Advertisements
Similar presentations
Using the Georgia Online Assessment System(OAS) We will lead the nation in improving student achievement. Kathy Cox, State Superintendent of Schools.
Advertisements

The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Welcome to the Turnitin.com Instructor Quickstart Tutorial ! This brief tour will take you through the basic steps teachers and students new to Turnitin.com.
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Customer Portal – Customer User. You will receive an indicating that your Customer Portal registration is complete. A link to the Customer Portal,
Differential Analysis & FDR Correction
Seta Management System. Introduction 1.Offline Versions 2.Capturing of Learners and Agreements 3.Capturing of Achievements.
Gene Expression Omnibus (GEO)
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Drinking Water Infrastructure Needs Survey and Assessment 2007 Training.
CellFateScout step- by-step tutorial for a case study Version 0.94.
Drinking Water Infrastructure Needs Survey and Assessment 2007 Website.
New Look New Tools Easier Access Accessing and Using Census Data The New American FactFinder (AFF2) Northwest Government Information Network Tumwater,
National Levee Database Interactive Reports Instructions NLD Point of Contact 1 US Army Corps of Engineers.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
SP5 - Neuroinformatics SynapsesSA Tutorial Computational Intelligence Group Technical University of Madrid.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
SUPPLEMENTAL FIGURES AND TABLES. Supplementary Table 1: List of new and improved features in GSEA-P version 2 Java software. Examples and screenshots.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Expression profiling & functional genomics Exercises.
How to Set-up Your Business Alliance Listing. Welcome  This tutorial will take you through the steps to set-up (or edit) your listing to ensure you get.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
1 Berger Jean-Baptiste
Bioinformatics Shared Resource Introduction to Gene Expression Omnibus (GEO) bsrweb.sanfordburnham.org
Reporter Training for High School RIO TM
Windows Vista Configuration MCTS : Internet Explorer 7.0.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
County Website Content Management System
Getting GO annotation for your dataset
Web-based Information Science Education
Database Essentials Access Lesson 1.
Imaging and Design for Online Environment
Weebly Elements, Continued
SAGExplore web server tutorial for Module III:
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Single Sample Registration
Using ArrayExpress.
Database Essentials Access Lesson 1.
CellExpress Examples A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System :8080 NTU.
Microsoft Excel 101.
Introduction With TimeCard users can tag SharePoint events with information that converts them into time sheets. This way they can report.
IBGP705 GEO—Gene Expression Omnibus
Azure Machine Learning & ML Studio
How Can I Download My Transactions Directly Into Quicken
New Features of Security Management 11
Statistical Analysis with Excel
Strategy Description Discovery Validation Application
Statistical Analysis with Excel
© 2016 Blackboard Inc. All rights reserved..
Gene Expression Omnibus (GEO)
ID Mapping tools: Converting Accessions between Databases
Gene expression profiling of pediatric acute myelogenous leukemia
Gene expression analysis
SCViSiON Salumatics Coding Viewer Users Guide
An overview of the online edition
Navya Thum January 30, 2013 Day 5: MICROSOFT EXCEL Navya Thum January 30, 2013.
Using the Online Compare Tool
Overview of Contract Association Batch Upload
Tutorial 7 – Integrating Access With the Web and With Other Programs
GRANT APPLICATION INSTRUCTIONS
Knowledge-Guided Sample Clustering
Cancer Cell Line Encyclopedia
Community-Engaged Partnership Database: VCU’s Commitment to Community Engagement
Stephen Bridgett, James Campbell, Christopher J. Lord, Colm J. Ryan 
Slide Set I: PARS Overview
PD-L1 expression correlates with T-cell markers and an IFN response signature in human melanomas. PD-L1 expression correlates with T-cell markers and an.
Reporter Training for High School RIOTM
Presentation transcript:

CellExpress Tutorial A Comprehensive Microarray-Based Cancer Cell Line and Clinical Sample Gene Expression Analysis Online System 172.16.0.66:8080 NTU CGM Bioinformatics & Biostatistics Core Lab.

Characteristics Feature : Provides comprehensive gene expression profiles analysis both for cell lines and clinical samples User friendly interface and visualized results User data upload supported All data and figures can be downloaded A Python Django and SQLite based website with jQuery and javascripts front end Four functions: Gene expression search Gene signature explorer Similarity assessment Profiling analysis NTU CGM Bioinformatics & Biostatistics Core Lab.

Total(without duplication) Dataset Information Dataset Name Type Sample Number Cell Line number Tissue Origin Number Platform Sanger Cell Line Project (GSE68950) cancer cell line 798 732 32 Affymetrix U133A CCLE (GSE36133) 917 24 Affymetrix U133 Plus2.0 NCI-60 (GSE32474) 174 60 9 expO(GSE2109) clinical sample 2152 N/A 74 Roth Normal Dataset(GSE7307) normal tissue 353 62 Total(without duplication) 4394 1319 128 - 

Workflow Structure

Home Page Link to CellExpress: 172.16.0.66:8080 List of all available cell lines in CellExpress Link to four analysis functions’ pages Tutorial and example for usage Feel free to contact us when you found bugs or having any questions :) Link to CellExpress: 172.16.0.66:8080

Avaliable Cell Lines List of all available cell lines in CellExpress

Function 1— Gene Expression Search Search for gene expression data with probe ID, Entrez Gene ID or official gene symbol Normalization will be done based on: -House keeping gene: GAPDH, ACTB -Gene with minimum coefficient of variance: RPL41 Cell Line Microarray Data -Search for cell line gene expression data Sample Microarray Data -Search for clinical sample gene expression data (Only the Step 3 is different in this two modes ) Click here!

Step 1 & 2 Choose the keyword type Choose the keyword type in Step 1, the keywords input in Step 2 should be based on this choice. Choose the keyword type Input the keywords here. Separate each keyword with space(s) or new line.

Cell Line Microarray Data—Step3 Step 3 of “cell line microarray” has two different choices If you choose to select the cell line by dataset and primary sites, select the dataset first. The selection block for each dataset will appear individually below (the red block in this plot.) Input keyword to search cell line name or primary site. Select the cell line you want. Input the cell line names directly. Separate each with space(s) or new line.

Cell Line Microarray Data—Step3 After you select the cell line, you will see the following Notice: Input cell line names directly will not see this Click this, return to selection part of p.8 Dataset name The cell line you selected Click this, hide or show the red block table below The selected cell lines’ information

Sample Microarray Data—Step3 Step 3 of “cell line microarray” has two different choices If you choose to by dataset and primary sites, select the dataset first. The selection block for each dataset will appear individually below (the red block in this plot.) Input keyword to search primary site or histology. Select the primary histology. If you choose to select by primary sites and histology, the search will be done on all clinical datasets. Nan: Normal tissues do not have primary histology . Same samples do not have the information about primary histology.

Step 4 Normalization will be done with the gene you selected. Choose the gene you want, then click submit. You will see the result page.

Cell Line Microarray Data—Result Basic information of cell lines and genes/probes Value: the quantiled expression value. Ranking: the rank of the expression value in the array platform of the dataset. Normalized: value normalized based on the gene you selected in Step 4. Click here to download the table as excel file

Sample Microarray Data—Result Value: the quantiled expression value. Ranking: the rank of the expression value in the array platform of the dataset. Normalized: value normalized based on the gene you selected in Step 4. Basic information and expression value of the samples and genes/probes Detail information of the samples

Function 2— Gene Signature Explorer Cluster heatmap with statistical significant genes/probes filtered by the p-value evaluated real time User define groups -Two groups: Student t-test -More than two groups: one-way-ANOVA P-value table provided P-value evaluated Expression data -Mean Clustering Heatmap display

Step 1 & 2 OR Choose the keyword type, the keywords input should be based on this choice. OR Filter from specific genes or probes you input here in the platform you selected in Step 1. Won’t filter with any p-value threshold. If you input more than 600 probes or genes, show the top significant 600 ones. Filter from all the genes or probes in the platform you selected in Step 1 with the p-value threshold you decided.

Click here to add or delete the last group. 2<=group number<=5 Step 3 You need to have at least two groups for the Student’s t-test. Select the dataset first, then select the cell lines or primary sites in the selection block for the dataset. Select at least 3 cell lines in each group to prevent statistical error.(no matter from what datasets) Click here to add or delete the last group. 2<=group number<=5

Result Show at most 600 probes or genes 1. Statistically significant probes or genes that passed the p-value threshold in “all genes” or “all probes” mode 2. All the specific genes or probes you input Genes or probes not found will be showed here in input mode P-value table Cell line names or sample names and the group they belong to.

Function 3— Similarity Assessment Compare the similarity between cell lines or clinical samples Display the PCA plot and Euclid distance table Two different display method: -one dot represents one sample -one dot represents the centroid of the cell line PC1 will be removed to reduce the batch effect when more than two datasets are selected If there are too many samples, the distance table will be provided as download link centroid 3 samples of the same cell line Only one dot to represent the cell line

Click here to add or delete the last group. 1<=group number<=5 Important notice here. Read before you select. The dots in the same group will have the same color. Select the dataset first, then the selection block (green) for the set will appear individually. Select one of the display method as P. 14 described Choose the array platform of datasets in Step 3 Selection block: Select the cell lines. Need to select at least four cell lines totally (no matter in what group) .

Result Tools for screen capture est.. Display method 3D PCA plot. Rotation, zoom in/out are supported. Click the circle to decide which group to display or hide Distance table for each group Information about the PCA plot

Function 4— Profiling Analysis Upload your own csv files and analyze the similarity with database of CellExpress Gene level comparison -At most 1 user group(file) -NGS or microarray data -All zero rows will be removed -Rank invariant normalization Probe level comparison -At most 2 user groups(files) -Only support Affymetrix U133A and U133Plus 2.0 platform -Quantile normalization

Step 1 & 2 OR Select one of the display method as P. 14 described Probe level: Select the array platform of dataset to compare in Step 3. Gene level: Select the array platform of dataset to compare in Step 3. Select the type of your data.

Step 3 & Upload File Format Gene level comparison csv file format: Official gene symbol Sample name Probe level comparison csv file format: Sample name Only for probe level comparison. Can have at most 2 groups . Probe ID

Step 4 The dots in the same group will have the same color. Selection block: Select the cell lines. Need to select at least four cell lines totally (no matter in what group) . Select the dataset first, then the selection block for the set will appear individually. Click this to add or remove the last group. 1<=group number<=3 The dots in the same group will have the same color.

Result Tools for screen capture est.. Display method 3D PCA plot. Rotation, zoom in/out are supported. Click the circle to decide which group to display or hide Distance table for each group Information about the PCA plot