Professional Documents
Culture Documents
Introduction
Presentation Title Goes Here to R: mydata[3,4]
Data Manipulation and Statistical Analysis
…presentation subtitle.
Data Manipulation
Violeta I. Bartolome
Senior Associate Scientist-Biometrics
Crop Research Informatics Laboratory
International Rice Research Institute
o mydata[,3] o mydata[3:6]
o mydata[3] o mydata[-c(1,2)]
To create a data frame containing Y1 To create a data frame containing Y1, Y2, Y3, Y4
Dataset
myA<- mydata[“Y1”] myB<- mydata[c(3,4,5,6)]
:: color, composition, and layout :: color, composition, and layout
Selecting Variables
Selecting Observations
• Select variables Y1, Y2, Y3, Y4
• Select observation numbers 3 to 8
o myB<-data.frame(mydata$Y1, mydata$Y2,
o mydata[3:8, ]
mydata$Y3, mydata$Y4)
o mydata[-c(1,2), ]
this is equivalent to
attach(mydata) • Select observations of Site B
o mydata[mydata$Site==“B”, ]
myB<-data.frame(Y1,Y2,Y3,Y4)
o subset(mydata,subset=Site==“B”)
detach(mydata)
o mydata[which(mydata$Site==“B”),]
o myB<-subset(mydata, select=Y1:Y4)
back
forward
Hands-on :: color, composition, and layout Hands-on :: color, composition, and layout
Parallel to Serial
Sorting Data Frames
• Sort by Trt and Site
mydataSorted <-
mydata[order(mydata$Trt,
mydata$Site), ]
idvar used as
row names
row.names(data.parallel) <-
1:NROW(data.parallel)
data.parallel
meanY na.rm=TRUE)
na.rm=TRUE na.rm=FALSE
meanYs