Presentation is loading. Please wait.

Presentation is loading. Please wait.

Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold.

Similar presentations


Presentation on theme: "Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold."— Presentation transcript:

1

2 Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold

3 Show & Tell Jonathan’s rules: Blue or Circle Jessica’s rules: All the rest What is Datamining? Whose block is this? Jonathan’s blocks Jessica’s blocks

4 Show & Tell What is Datamining? Question: Can you explain how?

5 Show & Tell What are the Benefits?  To the patient:  Better drug, better treatment  To the pharma:  Save time, save cost, make more $  To the scientist:  Better science

6 Show & Tell The Datamining Process

7 Show & Tell Epitope Prediction TRAP-559AA MNHLGNVKYLVIVFLIFFDLFLVNGRDVQNNIVDEIKYSE EVCNDQVDLYLLMDCSGSIRRHNWVNHAVPLAMKLIQQLN LNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIRS LLSTNLPYGRTNLTDALLQVRKHLNDRINRENANQLVVIL TDGIPDSIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNR FLVGCHPSDGKCNLYADSAWENVKNVIGPFMKAVCVEVEK TASCGVWDEWSPCSVTCGKGTRSRKREILHEGCTSEIQEQ CEEERCPPKWEPLDVPDEPEDDQPRPRGDNSSVQKPEENI IDNNPQEPSPNPEEGKDENPNGFDLDENPENPPNPDIPEQ KPNIPEDSEKEVPSDVPKNPEDDREENFDIPKKPENKHDN QNNLPNDKSDRNIPYSPLPPKVLDNERKQSDPQSQDNNGN RHVPNSEDRETRPHGRNNENRSYNRKYNDTPKHPEREEHE KPDNNKKKGESDNKYKIAGGIAGGLALLACAGLAYKFVVP GAATPYAGEPAPFDETLGEEDKDLDEPEQFRLPEENEWN

8 Show & Tell Epitope Prediction Results  Prediction by our ANN model for HLA-A11  29 predictions  22 epitopes  76% specificity 1 66 100 Rank by BIMAS Number of experimental binders 19 (52.8%) 5 (13.9%) 12 (33.3%)  Prediction by BIMAS matrix for HLA-A*1101

9 Show & Tell Gene Expression Analysis  Clustering gene expression profiles  Classifying gene expression profiles  find stable differentially expressed genes

10 Show & Tell Gene Expression Analysis Results The Discovery System Correlation test Voter selection Class prediction

11 Show & Tell Protein Interaction Extraction “What are the protein-protein interaction pathways from the latest reported discoveries?”

12 Show & Tell Protein Interaction Extraction Results  Rule-based system for processing free texts in scientific abstracts  Specialized in  extracting protein names  extracting protein-protein interactions

13 Show & Tell Transcription Start Prediction

14 Show & Tell Transcription Start Prediction Results

15 Show & Tell Medical Record Analysis  Looking for patterns that are  valid  novel  useful  understandable

16 Show & Tell Medical Record Analysis Results  DeEPs, a novel “emerging pattern’’ method  Beats C4.5, CBA, LB, NB, TAN in 21 out of 32 UCI benchmarks  Works for gene expressions

17 Show & Tell Under the Hood  Artificial neural network  Neighbourhood analysis  Non-linear analysis  Template matching  Emerging pattern  Hidden markov models  Bayesian inference  Decision tree induction ...

18 Show & Tell Behind the Scene  Epitope Prediction  Vladimir Brusic  Judice Koh  Seah Seng Hong  Zhang Guanglan  Yu Kun  Transcription Start Prediction  Vladimir Bajic  Seah Seng Hong  Gene Expression Analysis  Zhang Louxin  Zhang Zhuo  Zhu Song  Medical Records  Li Jinyan  Protein Interaction Extraction  Ng See Kiong  Zhang Zhuo


Download ppt "Show & Tell Limsoon Wong KRDL Datamining: Turning Biological Data into Gold."

Similar presentations


Ads by Google