Summer camp: R Day3
Data analysis Warm up! We can use “describe” in psych package to see the number of participant, mean, std of a variable. library(psych) describe(penguins$body_mass_g) result vars n mean sd median trimmed mad min max range X1 1 342 4201.75 801.95 4050 4154.01 889.56 2700 6300 3600 skew kurtosis se X1 0.47 -0.74 43.36 What is TIDY DATA Every column is a variable Every row is an observation Every cell has one value It will benefit a lot if we deal with tidy data, for example, easy for data sharing, reproducible, easy to automate… Data cleaning Remove data hierachically!...