Let’s now work with both numerical and categorical features….

Let’s now work with both numerical and categorical features.   Create a predictive model where: The numerical data must be scaled. The categorical data must be one-hot encoded. Use the “infrequent_if_exist” strategy to handle unknown categories. For such purpose, set  `min_frequency=0.05` to group categories concerning less than 5% of the total samples. The predictor is a `LogisticRegressionCV` with same parameters as before,  except that you may need to increase the number of `max_iter`, which is 100 by default. Use the same 10-fold cross-validation strategy with `return_estimator=True` as above to evaluate the accuracy of this new pipeline. By comparing the cross-validation test scores of both models fold-to-fold, count the number of times the model using both numerical and categorical features has a better test score than the model using only numerical features.