pandora.imputation module
- pandora.imputation.impute_data(input_data: ndarray[Any, dtype[_ScalarType_co]], imputation: str | None, missing_value: float | int = nan) ndarray[Any, dtype[_ScalarType_co]][source]
Imputes missing values in the given input data using the given imputation strategy.
- Parameters:
- input_datanpt.NDArray
Numpy array containing the input data to impute. Missing values are expected to be np.NaN.
- imputationOptional[str]
Imputation method to use. Available options are:
"mean": Imputes missing values with the average of the respective column."remove": Removes all columns with at least one missing value.None: Does not impute the given data.
- missing_valueUnion[float, int], default=np.nan
Value to treat as missing value.
- Returns:
- imputed_datanpt.NDArray
Imputed input data with imputation according to the specified method.
- Raises:
- PandoraException
If no data is left in case of
"remove"imputation strategy. That means that all columns in the input data contained at least one missing value.If the imputation method is not supported.