Presentation is loading. Please wait.

Presentation is loading. Please wait.

R Basics Xudong Zou Prof. Yundong Wu Dr. Zhiqiang Ye 18 th Dec. 2013 1.

Similar presentations


Presentation on theme: "R Basics Xudong Zou Prof. Yundong Wu Dr. Zhiqiang Ye 18 th Dec. 2013 1."— Presentation transcript:

1 R Basics Xudong Zou Prof. Yundong Wu Dr. Zhiqiang Ye 18 th Dec

2 R Basics 2  History of R language  How to use R  Data type and Data Structure  Data input  R programming  Summary  Case study

3 3 History of R language

4 4

5 5 Robert Gentleman Ross Ihaka

6 6 History of R language

7 7

8 8

9 9

10 10 History of R language

11 11 History of R language

12 12 History of R language

13 13 History of R language

14 14 History of R language

15 15 History of R language

16 16 History of R language

17 : Version: R-3.0.2

18 18 History of R language

19 19 History of R language

20 20 History of R language

21 21 History of R language

22 22 History of R language

23 23 History of R language 5088

24 What is R? R is a programming language, and also a environment for statistics analysis and graphics Why use R R is open and free. Currently contains 5088 packages that makes R a powerful tool for financial analysis, bioinformatics, social network analysis and natural language process and so on. More and more people in science tend to learn and use R # BioConduct: bioinformatics analysis(microarray) # survival: Survival analysis

25 控制台 从这里输 入命令 How to use R

26 ?用来获 取帮助 新建或打 开 R 脚本 点这里添 加 R 包 How to use R

27 Data type and Data structure numeric : integer, single float, double float character complex logical Data structure in R: Data type in R : ObjectsClass Mixed-class permitted ? Vectornumeric, char, complex, logicalno Factornumeric, charno Arraynumeric, char, complex, logicalno Matrixnumeric, char, complex, logicalno Data framenumeric, char, complex, logicalyes listnumeric, char, complex, logical, func, exp…yes

28 28 Vector and vector operation Vector is the simplest data structure in R, which is a single entity containing a collection of numbers, characters, complexes or logical. # Create two vectors: # Check the attributes: # basic operation on vector: 注意这个向 左的箭头

29 29 Vector and vector operation # basic operation on vector: > max( vec1) > min (vec1) > mean( vec1) > median(vec1) > sum(vec1) > summary(vec1) > vec1 > vec1[1] > x <- vec1[-1] ; x [1] > vec1[7] <- 15;vec1

30 30 array and matrix > x <- 1:24 > dim( x ) <- c( 4,6) # create a 2D array with 4 rows and 6 columns > dim( x ) <- c(2,3,4) # create a 3D array An array can be considered as a multiply subscripted collection of data entries.

31 31 array and matrix > x <- 1:24 > array( data=x, dim=c(4,6)) > array( x, dim= c(2,3,4) ) array() array indexing > x <- 1:24 > y <- array( data=x, dim=c(2,3,4)) > y[1,1,1] > y[,,2] > y[,,1:2]

32 32 array and matrix > class(potentials) # “matrix” > dim(potentials) # > rownames(potentials) # GLY ALA SER … > colnames(potentials) # GLY ALA SER … > min(potentials) # -4.4 Matrix is a specific array that its dimension is 2

33 33 list List is an object that containing other objects as its component which can be a numeric vector, a logical value, a character or another list, and so on. And the components of a list do not need to be one type, they can be mixed type. >Lst <- list(drugName="warfarin",no.target=3,price=500, + symb.target=c("geneA","geneB","geneC") >length(Lst) # 4 >attributes(Lst) >names(Lst) >Lst[[1]] >Lst[[“drugName”]] >Lst$drugName

34 34 Data Frame A data frame is a list with some restricts: ① the components must be vectors, factors, numeric matrices, lists or other data frame ② Numeric vectors, logicals and factors are included as is, and by default character vectors are coerced to be factors, whose levels are the unique values appearing in the vector ③ Vector structures appearing as variables of the data frame must all have the same length, and matrix structures must all have the same row size Names of components

35 35 Data Frame > names(cars) [1] "Plant" "Type" "Treatment" "conc" "uptake“ > length(cars) # 2 > cars[[1]] > cars$speed # recommended > attach(cars) # ?what’s this > detach(cars) > summary(cars$conc) # do what we can do for a vector

36 36 Data Input scan(file, what=double(), sep=“”, …) # scan will return a vector with data type the same as the what give. read.table(file, header=FALSE, sep= “ ”, row.names, col.names, …) # read.table will return a data.frame object # my_data.frame <- read.table("MULTIPOT_lu.txt",row.names=1,header=TRUE) # from SPSS and SAS library(Hmisc) mydata <- spss.get(“test.file”,use.value.labels=TRUE) mydata <- sasxport.get(“test.file”) #from Stata and systat library(foreign) mydata<- read.dta(“test.file”) mydata<-read.systat(“test.file”) # from excel library(RODBC) channel <- odbcConnectExcel(“D:/myexcel.xls”) mydata <- sqlFetch(channel, “mysheet”) odbcclose(channel) From other software load package

37 37 Operators

38 38 Control Statements R Programming # switch( statement, list) # repeat {…}

39 39 Function R Programming Definition : Example : matrix.axes <- function(data) { x <- (1:dim(data)[1] - 1) / (dim(data)[1] - 1); axis(side=1, at=x, labels=rownames(data), las=2); x <- (1:dim(data)[2] - 1) / (dim(data)[2] - 1); axis(side=2, at=x, labels=colnames(data), las=2); }

40 40 Summary Data type and Data Structure numeric, character, complex, logical vector, array/matrix, list, data frame Data Input scan, read.table load from other software: SPSS, SAS, excel Operators : <- R Programming:

41 41 Case study Residue based Protein-Protein Interaction potential analysis : Lu et al. (2003) Development of Unified Statistical Potentials Describing Protein-Protein Interactions, Biophysical Journal 84(3), p

42 42 Reference CRAN-Manual : Quick-R : R tutorial : MOAC : our/ our/

43 43


Download ppt "R Basics Xudong Zou Prof. Yundong Wu Dr. Zhiqiang Ye 18 th Dec. 2013 1."

Similar presentations


Ads by Google