Presentation is loading. Please wait.

Presentation is loading. Please wait.

In the name of GOD. Zeinab Mokhtari 1-Mar-2010 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement.

Similar presentations


Presentation on theme: "In the name of GOD. Zeinab Mokhtari 1-Mar-2010 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement."— Presentation transcript:

1 In the name of GOD

2 Zeinab Mokhtari 1-Mar-2010

3 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement for understanding. scatter plots Plotting = visualization = graphing plotting some measured result against some parameter in a Cartesian co-ordinate system the entries from two vectors of the same size are plotted pairwise in the Cartesian co-ordinate system. Geographical maps satellite images Cartesian plotting Contour plots Scatter plots Line plots Images Bar plots Loading plots Score plots Biplots Joint plots

4

5 Latent variable methods unit-free plots The items plotted against each other (scores, loadings) are based on the same measured data but projected differently. principal component analysis orthogonal scores orthonormal loadings the co-ordinate systems used for the score plots PCA PLS regression factor analysis PARAFAC … scores and loadings

6

7 PLOTTING IN COMPONENT MODELS Sign inversion PCA scores and loadings are mirrored together.

8 Figure 1. The mean-centered data Figure 2. The score plot after PCA Figure 3.The normalized scoreplot

9

10 PLOTTING IN PARTIAL LEAST SQUARES REGRESSION a linear relationship with slope one Outliers non-linearity Grouping of data …

11 The singular value is distributed equally among the u and v parts (scores and loadings) for the purpose of forming new variables h and g to be plotted. The special cases of c=0 and 1 c=1 row metric-preserving version Euclidean distances between objects and Mahalanobis distances between variables c=0 column metric-preserving version Euclidean distances between variables and Mahalanobis distances between objects For almost equal number of objects and variables Biplots

12 A compensation for number of objects (I) and variables (K) is made by introducing a fudge or zoom factor z: Biplots can be expanded to the use of three- way loadings, especially for Tucker3 models. Then they get the name joint plots.

13 suitable for the elucidation of the similarities and dissimilarities among the columns and rows of two-dimensional data matrices cannot be employed for the evaluation of arrays of higher dimensions Principal component analysis (PCA), a versatile and easy-to-use multivariate mathematical–statistical method

14

15 Three-way analysis by PARAFAC not orthogonal loadings N-WAY TOOLBOX aa b b c c

16 Tucker3 model or three-way PCA analysis of the similarities and dissimilarities among N-dimensional data arrays The Tucker3 model computes three orthogonal matrices with lower dimensions than the original data arrays such a manner that the variance explained by the reduced matrices being as high as possible. N-WAY TOOLBOX A B C G X

17 Cluster analysis The reduction of the dimensionality of multidimensional arrays Projection of the points scattered in the multidimensional space on a plane, such a manner that the distances among the points in the multidimensional space on the plane are as similar as possible The objectives of the study were the measurement of the microbiological effect of benzimidazolium salts containing various anions, the application of the combination of Tucker3 model and cluster analysis for the evaluation of the dependence of the microbiological effect on the type of test organism, chemical structure of the free benzimidazolium base and the type of cation.

18 The free base and the salts formed with Cl −, SO 4 2−, PO 4 3− and NO 3 −

19 Species tested for the microbiological activity (altogether 15 species)

20 The Tucker3 model has been employed for the three dimensional data matrix consisting of the inhibitory activity of seven benzimidazole derivatives (factor I), the presence and type of anion (factor II) and the 15 test organisms (factor III) (3-way array with dimensions 7, 5, 15). Arrays of the largest possible dimensions (6, 4, 14) The arrays explaining more than 0.28% of the total variance (in this case 3, 2, 3)

21

22 the total variance explained : 99.75%

23 the total variance explained : 98.05%

24 Fig. 1. Plot of the first two elements of component matrix I Fig. 2. Cluster dendogram of component matrix I The distribution of benzimidazole derivatives is highly similar on both figures. Similarity and dissimilarity of microbiological activity

25 Fig. 3. Plot of component matrix II The presence of sulfate anion may have a considerable impact on the biological efficacy of benzimidazole derivatives.

26 Fig. 4. Plot of the first two elements of component matrix III Fig. 5. Cluster dendogram of component matrix III

27 It can be concluded from the results that a Tucker3 model combined with cluster analysis can be successfully used for the study of the microbiological activity of benzimidazolium salts and separates the effect of the type of benzimidazole derivatives and saltforming anions.

28 Five different breads were baked in replicates giving a total of ten samples. Eight different judges assessed the breads with respect to eleven different attributes. The data can be regarded as a three-way array (10 × 11 × 8) or alternatively as an ordinary two-way matrix (10 × 88).

29

30 Always enjoy life, no matter how hard it seems! When life gives you a thousand reasons to cry, show the world that you have million reasons to SMILE!


Download ppt "In the name of GOD. Zeinab Mokhtari 1-Mar-2010 In data analysis, many situations arise where plotting and visualization are helpful or an absolute requirement."

Similar presentations


Ads by Google