Hi class, here I have two doubts about standardization and selection of the hyperparameters.
First, according to previous posts, we are sustrong textpposed to do the scaling after train and test split, fitting the scaler to training set and applying it to both training and test sets. Then I wonder should I develop different scaler for X and Y, or they could be fit and transformed together, or should Y even be standardized? If only X variables scaled, the final testing errors computation would be impacted?
Second, I was trying to test several parameters (for GBR) by grid search, but the runtime is still long even after I’ve reduced number of parameters and values. I wonder are there widely accepted “the most important parameters”? Or it depends, and we can select the ones which are significant in our view?
Thank you in advance!