combine_boot.Rd
combine_boot
function combines several imputed bootstrapped dataframes
into the final imputed dataframe and provide the variance for each imputed
value.
A list of imputed bootstrapped dataframes.
Continous columns index.
Discret columns index.
Categorical columns index.
Number of rows in the original incomplete dataframe before bootstrapping.
The encoded method of categorical columns in the imputed
dataframes.
This function is coded for both "onehot" and "factor" encoded situations.
When method
= 'onehot', combine_boot
averages the probability
vectors over the $B$ imputed datasets for the same observation,
then choose the position of maximum probability as the predicted category in
the final result.
When method
= 'factor', for each observation, combine_boot
choose the mode value over the imputed dataframes as the predicted category.
The dictionary of categorical columns names if "onehot" method is applied. For example, it could be list("Y7"=c("Y7_1","Y7_2"), "Y8"=c("Y8_1","Y8_2","Y8_3")).
The method of variance calculation for the categorical columns. "unalike" will lead to the calculation of unalikeability, while "wilcox_va" will lead to the calculation of Wilcox index: VarNC.
df_result_disj
The final imputed dataframe with the
categorical columns in onehot form.
df_result_var_disj
The variance matrix for the final
imputation dataframe with the categorical columns in onehot form.
df_result
The final imputed dataframe with the categorical
columns in factor form.
df_result_var
The variance matrix for the final imputation
dataframe with the categorical columns in factor form.
Statistical Analysis with Missing Data, by Little and Rubin, 2002