Background The dataset includes 9 baseline numeric variables…
Background The dataset includes 9 baseline numeric variables: age, body mass index, average blood pressure, and six blood serum measurements for each of n = 442 diabetes patients. The response of interest is a quantitative measure of diabetes disease progression one year after baseline. The dataset is obtained from sklearn.datasets. We will be fitting multiple linear regression models to the train dataset and making predictions on the test dataset. Attribute Information: age: age in years bmi: body mass index bp: average blood pressure s1: tc, total serum cholesterol s2: ldl, low-density lipoproteins s3: hdl, high-density lipoproteins s4: tch, total cholesterol / HDL s5: ltg, possibly log of serum triglycerides level s6: glu, blood sugar level Target: quantitative measure of disease progression one year after baseline (Response variable) Note: All features have NOT been standardized.