Presentation is loading. Please wait.

Presentation is loading. Please wait.

Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy

Similar presentations


Presentation on theme: "Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy"— Presentation transcript:

1 Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy
Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy Computer Science & Engineering Dept. University of California – Riverside

2 Eamonn, patent this idea!
Outline Overview An Example: DNA to Intelligent Icon Icon Generation Algorithm Experimental Evaluation Conclusion Eamonn, patent this idea! Christos Faloutsos

3 Dataset Kalpakis_ECG Icons in a traditional browser

4 Dataset Kalpakis_ECG Suppose I magically..
Color the icons to somehow reflect the contents of the file. Position the icons based on their colors/patterns normal1.txt normal10.txt normal11.txt normal12.txt normal13.txt normal2.txt normal3.txt normal4.txt normal5.txt normal6.txt normal7.txt normal8.txt normal9.txt normal14.txt normal15.txt normal16.txt normal17.txt normal18.txt

5 Let us start with visualizing a special data type, DNA.
TGGCCGTGCTAGGCCCCACCCCTACCTTGCAGTCCCCGCAAGCTCATCTGCGCGAACCAGAACGCCCACCACCCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCCGACGATAGTCGACCCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACACAAAAACATTTCCCACTACTGCTGCCCGCGGGCTACCGGCCACCCCTGGCTCAGCCTGGCGAAGCCGCCCTTCA Let us start with visualizing a special data type, DNA. The DNA of two species… Are they similar? CCGTGCTAGGGCCACCTACCTTGGTCCGCCGCAAGCTCATCTGCGCGAACCAGAACGCCACCACCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCGACGATAAAGAAGAGAGTCGACCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACA

6 C T A G C C C C C T T T T T A A A A A G G G G G 0.20 0.24 0.26 0.30
CCGTGCTAGGGCCACCTACCTTGGTCCGCCGCAAGCTCATCTGCGCGAACCAGAACGCCACCACCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCGACGATAAAGAAGAGAGTCGACCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACA 0.26 0.30

7 C T A G C C C C C C T T T T T T A A A A A A G G G G G G CC CC CT TC TT
CA CG TA AC AT GC GT AA AG GA GG CC CC CC CC CC CC CC CC CT CT CT CT CT CT CT CT TC TC TC TC TC TC TC TC TT TT TT TT TT TT TT TT CCC CCC CCC CCC CCT CCT CCT CCT CTC CTC CTC CTC C C C C C C T T T T T T CCA CCA CCA CCA CCG CCG CCG CCG CTA CTA CTA CTA CA CG TA TC CA CG TA TC CA CA CA CA CA CA CA CA CG CG CG CG CG CG CG CG TA TA TA TA TA TA TA TA TC TC TG TC TG TC TC TC CAC CAC CAC CAC CAT CAT CAT CAT CAA CAA CAA CAA AC AT GC GT AC AT GC GT AC AC AC AC AC AC AC AC AT AT AT AT AT AT AT AT GC GC GC GC GC GC GC GC GT GT GT GT GT GT GT GT A A A A A A G G G G G G AA AG GA GG AA AG GA GG AA AA AA AA AA AA AA AA AG AG AG AG AG AG AG AG GA GA GA GA GA GA GA GA GG GG GG GG GG GG GG GG CCGTGCTAGGGCCACCTACCTTGGTCCGCCGCAAGCTCATCTGCGCGAACCAGAACGCCACCACCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCGACGATAAAGAAGAGAGTCGACCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACA

8 CA CA CA CA CA CA CA CA CA CA AC AC AC AC AC AC AC AC AC AC AT AT AT
1 0.02 0.04 0.09 0.04 0.03 0.07 0.02 CA CA CA CA CA CA CA CA CA CA 0.11 0.03 AC AC AC AC AC AC AC AC AC AC AT AT AT AT AT AT AT AT AT AT AA AA AA AA AA AA AA AA AA AA AG AG AG AG AG AG AG AG AG AG CCGTGCTAGGCCCCACCCCTACCTTGCAGTCCCCGCAAGCTCATCTGCGCGAACCAGAACGCCCACCACCCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCCGACGATAGTCGACCCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACA

9 OK. Given any DNA string I can make a colored bitmap, so what?
CCGTGCTAGGCCCCACCCCTACCTTGCAGTCCCCGCAAGCTCATCTGCGCGAACCAGAACGCCCACCACCCTTGGGTTGAAATTAAGGAGGCGGTTGGCAGCTTCCCAGGCGCACGTACCTGCGAATAAATAACTGTCCGCACAAGGAGCCCGACGATAGTCGACCCTCTCTAGTCACGACCTACACACAGAACCTGTGCTAGACGCCATGAGATAAGCTAACA

10 African elephant.dna Indian chimpanzee.dna hippopotamus.dna Human.dna orangutan.dna pygmy sperm whale.dna rhesus monkey.dna sperm whale.dna white rhinoceros.dna Indian Indian rhinoceros.dna rhinoceros.dna white white rhinoceros.dna rhesus rhesus monkey.dna monkey.dna pygmy pygmy chimpanzee.dna chimpanzee.dna sperm sperm whale.dna whale.dna Indian Indian hippopotamus.dna hippopotamus.dna chimpanzee.dna chimpanzee.dna elephant.dna elephant.dna Human.dna Human.dna African African orangutan.dna orangutan.dna elephant.dna elephant.dna pygmy pygmy sperm whale.dna sperm whale.dna

11 Note Elephas maximus is the Indian Elephant, Loxodonta africana is the African elephant and Pan troglodytes is the chimpanzee.

12 a b c d Can we make Intelligent Icons for time series? Yes, with SAX!
accbabcdbcabdbcadbacbdbdcadbaacb… c c c b b b aa ab ba bb ac ad bc bd ca cb da db cc cd dc dd a b c d aaa aab aba aac aad abc aca acb acc a a Time Series Bitmap

13 While they are all example of EEGs, example_a
While they are all example of EEGs, example_a.dat is from a normal trace, whereas the others contain examples of spike-wave discharges.

14 We can achieve this with MDS.
We can further enhance the time series bitmaps by arranging the thumbnails by “cluster”, instead of arranging by date, size, name etc We can achieve this with MDS. We can further enhance the time series bitmaps by arranging the thumbnails by “cluster”, instead of arranging by date, size, name etc We can achieve this with MDS. August.txt July.txt June.txt April.txt May.txt Sept.txt Oct.txt Feb.txt Dec.txt March.txt Nov.txt Jan.txt January 100 200 300 December August One Year of Italian Power Demand

15 Text Example Here are some papers that reference Eamonn Keoghs work…

16 Text Example Cluster of “warping” papers Cluster of
classification papers Paper on using “warping” to classify Classification paper in Italian “Warping” paper in Portuguese “classification” papers

17 Intelligent Icon Search

18 Paper Summary We show how to map DNA, time series and natural language into intelligent icons. We give a generic framework for mapping any kind of data into intelligent icons. We show the utility of intelligent icons for finding patterns (clusters, outliers etc)

19 Questions?


Download ppt "Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy"

Similar presentations


Ads by Google