You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I try to evaluate measures for Feature Importance for a regression. I have data set with highly correlated features. So the betas have a high variance. This makes it difficult to estimate the "true betas" i.e. the true Feature Importance. An Idea was to simulate the data and compare some measures like Permutation Importance or Shapely Value. Therefore, I need the "true Importance" i.e. the population data. For this task, I used your package. However, I only can simulate the training data or the test data but I can´t generate the population data. Is there a solution for this?
The text was updated successfully, but these errors were encountered:
Hi Hadsga, Sorry for the late response. It will definitely be very useful to have variable importance for the population. I figured out that the variable importance for a certain variable is just the change in model error if you remove that variable. So, Here is a function you can use to calculate it. I have tested with variable importance you can get with caret package. With a large number of observation, the values are quite similar.
I try to evaluate measures for Feature Importance for a regression. I have data set with highly correlated features. So the betas have a high variance. This makes it difficult to estimate the "true betas" i.e. the true Feature Importance. An Idea was to simulate the data and compare some measures like Permutation Importance or Shapely Value. Therefore, I need the "true Importance" i.e. the population data. For this task, I used your package. However, I only can simulate the training data or the test data but I can´t generate the population data. Is there a solution for this?
The text was updated successfully, but these errors were encountered: