当前位置:文档之家› 描述性统计

描述性统计

专题二 描述性统计

通过图表、数值的描述

单变量、多变量

分类表量、数值变量

1、分类变量:频数

2、数值变量:集中趋势(取决于分布形态)、离散程度(全距、四分位距(利用位置信息),方差、标准差、变异系数)、分布形态(偏度、峰度)

更多关注分布的研究(histogram、pie chart)

作业2:打开mtcars,保存excel格式,选cyl(gear)做条形图,饼图,(颜色,主标题,颜色)

mpg分布(直方图等),语言描述图

提取一个表格,drat mpr wt均值,最大,最小,四分位数,标准差,偏度峰度,小数点3位。

data<-data.frame(mtcars)

data

write.table(data,"D:/data.csv",sep=",")

attach(data)

barplot(cyl,border = "red",main = "bar",axes=T)

table(gear)

pie(gear,border="blue",main = "bingtu")

hist(mpg,border = "red",axes=T)

mean(mpg)

mean(drat)

mean(wt)

summary(wt)

summary(drat)

summary(mpg)

mydata<-function(x)c(mina=min(x),maxa=max(x),meana=mean(x),sda=sd(x))

sapply(data.frame(mpg,drat,wt),mydata)

多变量

数值描述:相关系数、以定性数据为分组依据、图表描述(散点图矩阵(点颜色,形状),气泡图(气泡大小),)

data<-data.frame(mtcars)

data

write.table(data,"C:/data.csv",sep=",")

attach(data)

barplot(cyl,border = "red",main = "bar",axes=T)

table(gear)

pie(gear,border="blue",main = "bingtu")

hist(mpg,border = "red",axes=T)

mean(mpg)

mean(drat)

mean(wt)

summary(wt) summary(drat)

summary(mpg)

mydata<-function(x)c(mina=min(x),maxa=max(x),meana=mean(x),sda=sd(x))

sapply(data.frame(mpg,drat,wt),mydata)

library(graphics)

library(car)

library(scatterplot3d)

library(symbols)

plot(wt,mpg,col=cyl)

pchisq(wt,2)

?hist

install.pages("vcd")

library(vcd)

library(grid)

mosaicplot(~cyl+vs+am,data=mtcars,color=TRUE,border="red")

Data assumption:interval or ratio level;linear related;bivariate normally distributed

Hypothesis Testing

P-value and the method of judgement:p

相关主题