SAS Module 4 Regression Analysis

SAS

Module 4 Regression Analysis

Simple Linear Regression:
one independent variable
Multiple Linear Regression:
two or more independent variables
Regression Goal:
find line that most closely match the observed relationship between X and Y and “most closely” is defined by the minimum RSS (residual sum of squares )
Standard Error:
determine how close the coefficient estimates are to the actual values. We use RSE (residual standard error) for regression, a measure of the how much the dependent variable varies from estimated one. The primary value to calculate SE is to answer two questions:

  • what is the likely range for two values of coefficients? (95% confidence interval)
  • Does the independent variables influence the value of dependent variable in a statistically significant way?

在这里插入图片描述
Hypothesis Testing: By P-value.
For single linear regression model, if the probability is too small (typically less than 5% or 1%), we reject null hypothesis (no relationship).
在这里插入图片描述
在这里插入图片描述

If reject the null hypothesis, we need RSE and R^2 to see how well the model fit the data.
R^2 is always the one to use and lies in range [0,1], 1 means it is a perfect model, 0 means it explains none of observed variation.
在这里插入图片描述
For multiple linear regression model, we test all the regression coefficients are same as H0, and at least one is different as H1.
在这里插入图片描述
在这里插入图片描述
We accept or reject H0 by F-statistic. In SAS, we use P-value associated with F-statistic (Pr>F), if it is small enough, reject H0.
在这里插入图片描述
在这里插入图片描述
Independent variables can be qualitative or quantitative.
If a categorical predictor has more than two levels, we can create one fewer dummy variables than the number of levels. For example, weight status, it can be “Normal”, “Not normal”, “Overweight”, “Not Overweight” to represent different levels.

Model Selection in SAS: based on variable importance

  • optimize the subset of variables with Backward selection

Model Extension: Interactions

  • Sometimes, interactions of multiple predictors have bigger influence to the response.
  • In SAS, interaction effects can be detected by creating an interaction plot by grouping the dependent variable according to the different possible values of the hypothesized interaction variable and plotting them separately against the target dependent variable. If the resulting lines or scatter plots are “parallel” or have some rough shape (even at different levels), there is likely no significant interaction. Otherwise, we should consider to add an interaction regressor into the model.
  • In SAS, just use “+ New Data Item” and select “interaction effect…”
    在这里插入图片描述

Polynomial Regression: add square, cubic, quartic to the model

  • model should not be too complexed (over-fit) or too simple (under-fit)
  • add more polynomial into one model may lead to overfitting issues, so we need data partition to partition available data into train set and validation set
  • rate model performance using validation data. Select the simplest model with highest validation assessment.
  • In SAS, just use “+ New Data Item” and select “Partition…”

Model Comparison:

  • We always have an initial model first with all meaningful variables
  • If a large number of data are missing, consider to use “informative missingness” function
  • Remove some variables that are not significant (variable selection)
  • Remove some variables that are highly correlated
  • Consider adding interactive effects
  • Consider adding nonlinear effects (polynomial)
  • Use partition to avoid overfitting issues
  • Consider using “group by” to separate models for separate values of categorical variables
  • Finally, in SAS, use “model comparison” to compare different models and select one best
资源下载链接为: https://pan.quark.cn/s/d9ef5828b597 四路20秒声光显示计分抢答器Multisim14仿真源文件+设计文档资料摘要 数字抢答器由主体电路与扩展电路组成。优先编码电路、锁存器、译码电路将参赛队的输入信号在显示器上输出;用控制电路和主持人开关启动报警电路,以上两部分组成主体电路。通过定时电路和译码电路将秒脉冲产生的信号在显示器上输出实现计时功能,构成扩展电路。经过布线、焊接、调试等工作后数字抢答器成形。关键字:开关阵列电路;触发锁存电路;解锁电路;编码电路;显示电路 一、设计目的 本设计是利用已学过的数电知识,设计的4人抢答器。(1)重温自己已学过的数电知识;(2)掌握数字集成电路的设计方法和原理;(3)通过完成该设计任务掌握实际问题的逻辑分析,学会对实际问题进行逻辑状态分配、化简;(4)掌握数字电路各部分电路与总体电路的设计、调试、模拟仿真方法。 二、整体设计 (一)设计任务与要求: 抢答器同时供4名选手或4个代表队比赛,分别用4个按钮S0 ~ S3表示。 设置一个系统清除和抢答控制开关S,该开关由主持人控制。 抢答器具有锁存与显示功能。即选手按动按钮,锁存相应的编号,并在LED数码管上显示,同时扬声器发出报警声响提示。选手抢答实行优先锁存,优先抢答选手的编号一直保持到主持人将系统清除为止。 参赛选手在设定的时间内进行抢答,抢答有效,定时器停止工作,显示器上显示选手的编号和抢答的时间,并保持到主持人将系统清除为止。 如果定时时间已到,无人抢答,本次抢答无效。 (二)设计原理与参考电路 抢答器的组成框图如下图所示。它主要由开关阵列电路、触发锁存电路、解锁电路、编码电路和显示电路等几部分组成。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值