Download presentation
Presentation is loading. Please wait.
Published byJuan Houston Modified over 10 years ago
1
DBBM CESMG G. Paolella CEINGE
2
CSI INTERNET CEINGE University Campus
3
CAPRI Image restoration and analysis Comparative Genomics Francesco Salvatore 0503 Research and Services in Bioinformatics
4
- Comparative genomics - DG-CST - KinWeb - Non Coding RNAs - Bacterial - Eukaryotic - Cell motility Research subjects
5
Conserved Sequence Tags (CST)
6
DG-CST
7
DG-CST DB
8
Genome browser
9
KinWeb
10
(a) (b) (c) (d) (e) KinWeb DB
11
Three genes a) b) Ig-IIg-IIIg-IIITMTyr Kinase // CSTs Ser-Thr Kinase CST // Ser-Thr Kinase c) // acb CST IIIIII
12
Selection of homologous chromosome regions from human and mouse genomes. Comparison of selected regions using BLASTZ, a program based on a local similarity algorhitm. Further analysis on the dataset looking for subpopulations sharing specific characteristics, using different programs, such as: - Blast of CSTs vs EST, human and other species genomes - Program for calculation of CPS score (Coding Potential Score) - RNA structure prediction programs Selection of the definitive set of CSTs based on specified thresholds (identity >= 70%; length >= 100 bp) using StrongHits. Insertion of selected CSTs into DB and extensively annotation for: - type (i.e. intergenic, exonic etc.) according to Ensembl - Coding capability according to Ensembl - Distances from other genes and coding regions - Calculation of Log Score according to UCSC comparison of human and mouse genomes Masking sequences of repetitive elements to reduce the noise fatally introduced by repeated sequences through RepeatMasker. Pipeline
13
Pipeline units
14
Non coding RNAs ncRNA DNA transcription reverse transcription Proteins translation mRNA tRNA rRNA Antisense miRNA transcription/maturation snoRNA maturation Self-splicing intron snRNA Imprinting H19, AIR X inactivation XIST Chromatin structure dynamics small RNAs DNA demethylation KHPS1a
15
Bacterial SLSs
16
SLS Families
17
Position in the genome Position
18
Alignment
19
RNAz P = 0.99 PFOLD Secondary structures
20
Processing time
21
4x14x2=112 procs 2.8 GHz 4x14x2=112 GB RAM 2 GB/s per scheda - 4 GB/s aggregata Cluster
22
Bioinfo portal
23
Servizi bioinformatici per la ricerca gia attivi Francesco Salvatore 0503 Circa 100 banche dati di interesse biologico accessibili mediante SRS (sequenze nucleotidiche, genomi, mutazioni, malattie ereditarie, enzimi, etc.) Sistema integrato per analisi di dati biologici con oltre 150 programmi per analisi di sequenze, modelli evolutivi, studio di mutazioni, proteine etc. Banche dati realizzate nellambito di progetti di ricerca (DG-CST, KinWEB, etc.) Sistemi per la gestione di dati sperimentali (campioni biologici, sequenze, immagini da microscopia etc.)
24
Research and services Research and Services in Bioinformatics CAPRI Image restoration and analysis Comparative Genomics
25
CEINGE DBBM IIGB BIOGEM Facolta di Medicina Facolta di Biotecnologie Altre Facolta Pubblico (accesso limitato) Francesco Salvatore 0503 Servizi: chi ha accesso ?
26
WEB SERVER CAPRI SRS PISE Other Emboss Fasta Blast User Data DB Primary remote databases ENSEMBL Services organization
27
Graphic interface to programs
28
CAPRI
29
Various operations in a row: Complement ->Translation -> Isoelectric point of the resulting protein. DNA Complement Translation Isoelectric point CAPRI workflow
30
CGI Plugin Object Pise Plugin Object CLI Simple Programs Plugin Object CURL Base Obj. Plugin Object SOAP Plugin Object JEMBOSS Program Object Tasks Obj. Menu Table Disk Buffering BLAST FASTA EMBOSS HMMer Genscan ClustalW Programmi Dischi del Server Phylip CLIENTSERVER CAPRI Program Object Program Object Legenda Relazione tra oggetti: Uso Eredità Esecuzione programmi Trasferimento dati Relazione temporale CAPRI architecture
31
ClusterCluster Cluster Nodes Access Server Access Server Access Server For each user request, a process is launched on a different node Distributed execution
32
Cluster Broker Web applicatio n server DB server Cluster Manager Cluster Manager 3 – Request the status of the cluster 5 - launch the command on the node 1 – Run a command 2 – Request a node IP 4 – Search for the best resource and return the corresponding node IP Relational DB 6 – Return the result Cluster activity http
34
Broker virtual node virtual node DB Grid node
35
PROGETTO DI RICERCA -------------- *Cell line *Colture conditions *Fixation and inclusion methods, stainings, ecc *Objective *Focus Position *Stage position x/y *Project title *Experiment name, *Author, group, group leader, ecc. WEB INTERFACE *Exposure time *Resolution, ecc. DB Image archival and management
36
Image-DB interface
37
timelapse at 6 positions timelapse actin wound healing timelapse 2 adhesion actin staining IPROC
38
HPC on Cluster nodes GatewayGateway iPage image area data + images page iPane proc- steps IPROC architecture
39
ClusterCluster Cluster Nodes Access Server Access Server Access Server A tool can require the execution of multiple, simultaneous processes Distributed execution of parallel requests
40
-PHP internal routines (basic drawing, processing) -ImageMagick (more advanced processing) -Image converters -Special tools (PDL, deconvolution) -Tools developed in-house (cell tracking) -...... What software may be linked
41
-Convenient graphic interface -Access to a vast library of image processing steps -No specific interface requirements -Remote processing on parallel hardware -Support for a large number of concurrent users -System independent (works on Mac, PC, Linux etc.) -No need to install. A browser is enough. Advantages
Similar presentations
© 2024 SlidePlayer.com Inc.
All rights reserved.