Can I pool parameter estimates of multiple imputed data obtained through10-fold cross-validation? #398
Replies: 3 comments 1 reply
-
@paulinavonstackelberg Could you shine your light on this, perhaps with a reference to code? |
Beta Was this translation helpful? Give feedback.
-
Hi Liu, |
Beta Was this translation helpful? Give feedback.
-
Sorry for the confusion in the message.
Subsequently, I perform multiple linear regression in the imputed training dataset and testing dataset (i.e, with the same variables but different observations), respectively.
Here, I am just wondering that could I direct use the Rubin's Rules to combine the effect estimates (i.e., beta coefficients) and derive the p values in the 5 training datasets, 5 testing datasets or 10 datasets (including all training and testing datasets)? In other words, if the observations of two datasets (here, training and testing) are different, could I use Rubin's Rules to combine the effect estimates?
Thank you |
Beta Was this translation helpful? Give feedback.
-
I performed 10-fold cross-validation on mice,using
'ignore'
parameter inmice
package, 5 datasets were obtained each time, and a total of 50 imputed datasets were obtained. I want to do linear regression using these 50 datasets .Can I use Rubin's Rules to directly combine the effect estimates(β) of 50 datasets and then derive confidence intervals and p-values? or only every 5 datasets ( imputed by the same training set and test set ) can be combined?
Thank you,
Liu.
Beta Was this translation helpful? Give feedback.
All reactions