Backward Looking Transformations

Yingyingz · December 4, 2023, 4:54pm

I’m wondering if I use Backward Looking Transformations, what should I do about the missing values? Is it better to drop the column with N/A or it’s better to fill it with zero or mean?

d.snow · December 6, 2023, 3:46pm

It is generally better to fill it with the mean, if you are using lightgbm models, you can even leave the N/A as it can deal with it automatically.

saakshimore · December 16, 2023, 7:21am

How should we deal with NaN values when working with deep learning models? We cannot fill it with the mean because of data leakage issues.

d.snow · December 16, 2023, 2:16pm

You should be fine by filling in using the mean in the training data. Another thing you can do is forwardfilling df.ffill() which only use past information for filling.