Chapter 2 Getting to Know Your Data Yubao (Robert) Wu Georgia State University
Chapter 2 Getting to Know Your Data Data Objects and Attribute Types Basic Statistical Descriptions of Data Data Visualization Measuring Data Similarity and Dissimilarity
Printer Forensics ? Printer Forensics Based on Page Document's Geometric Distortion Yubao Wu, Xiangwei Kong, Xingang You, and Yiping Guo IEEE International Conference on Image Processing (ICIP), 2009
Printer Forensics scan Digital Image Feature extraction Darkness Dots around the letters SVM
print scan PDF FileScanned Image Printer Forensics Based on Page Document's Geometric Distortion Yubao Wu, Xiangwei Kong, Xingang You, and Yiping Guo IEEE International Conference on Image Processing (ICIP), 2009
Least Squares – Slope of Each Line X-coordinate of “e” in one line Y-coordinate Printer Forensics Based on Page Document's Geometric Distortion Yubao Wu, Xiangwei Kong, Xingang You, and Yiping Guo IEEE International Conference on Image Processing (ICIP), 2009
Slope of Each Line Line Number Matlab - plot Why are the slopes monotonically increasing? Printer Forensics Based on Page Document's Geometric Distortion Yubao Wu, Xiangwei Kong, Xingang You, and Yiping Guo IEEE International Conference on Image Processing (ICIP), 2009
IdealIn practice Age of the Printer !
Experimental Results Each data point = 1 Page = 1 Scanned Image 100% Accuracy Scatterpl ot
Feature Extraction == Data Reduction Original Scan Image 2550 x 3508 pixels 41 lines; 72 “e” in each line 41 x 72 = 2911 data points 41 slopes for the 41 lines 2550 pixels 3508 pixels 41 double values
Matlab Plot Function plot(vX,vY,'kx'); hold on; Matlab default settingsModify the Settings
Matlab Plot Function plot(X,Y,'kx'); hold on; Matlab Default Settings Modify the Settings hFigHandle = figure(1); set(gca,'FontSize',40); plot(X,Y,'kx','LineWidth',4,'MarkerE dgeColor','k','MarkerFaceColor','w', 'MarkerSize',20); hold on; xlim([1 30]); ylim([ ]); set(hFigHandle, 'Position', [ ]);
Text Cloud
Fisher’s Iris Data Scatterplo t
Fisher’s Iris Data
Microsoft Bing MapsGoogle Maps
ColorBrewer 2.0 How to Choose Colors? Matlab LineSpec (Line Specification) SpecifierColor rRed gGreen bBlue cCyan mMagenta yYellow kBlack wWhite Microsoft PowerPoint
How to Choose Qualitative Colors?
Qualitative Colors
Sequential Colors
Data-driven documents
The Open Graph Viz Platform