Statistical modeling in R is a critical skill for data analysis, allowing for predictive insights. This chapter covers key topics like linear and logistic regression, time series analysis, ARIMA models, and model validation.
lm()logit()glm()fit()ARMAITauto.arima()arima()tslm()lm()vif()vif.lm()vif.stats()calc_vif()| Qno | Answer |
|---|---|
| 1 | b) To predict continuous outcomes |
| 2 | a) lm() |
| 3 | b) The residuals are normally distributed |
| 4 | b) It is used for predicting categorical variables |
| 5 | a) Coefficients and odds ratios |
| 6 | c) Identity |
| 7 | c) They represent the odds ratio for a one-unit change in the predictor |
| 8 | a) The deviance of a model with only the intercept |
| 9 | b) Using deviance or AIC |
| 10 | b) It causes biased estimates of coefficients |
| 11 | a) To predict future values based on historical data |
| 12 | a) Autoregressive Integrated Moving Average |
| 13 | a) AR |
| 14 | b) Integration |
| 15 | a) To transform the data into a stationary series |
| 16 | d) Logistic |
| 17 | a) auto.arima() |
| 18 | a) Using the Augmented Dickey-Fuller (ADF) test |
| 19 | c) Autocorrelation Function (ACF) plot |
| 20 | b) The correlation between observations at different time lags |
| 21 | a) R-squared |
| 22 | a) To evaluate model performance using a subset of data |
| 23 | b) Confusion matrix |
| 24 | a) To estimate the error term |
| 25 | a) The variance of the errors is constant across all levels of the independent variable |
| 26 | a) vif() |
| 27 | b) Q-Q plot |
| 28 | a) To indicate the significance of the model coefficients |
| 29 | a) VIF (Variance Inflation Factor) |
| 30 | a) By using the cross-validation technique |