Tags: theislab/ehrapy
Tags
Datatype Support in Quality Control and Impute (#865) * Enhancement: Add Dask support for explicit imputation * Enhancement: Add Dask support for quality control metrics and imputation tests * Fix test for imputation to handle Dask arrays without raising errors * Refactor quality control metrics functions to streamline computation and improve readability * added expected error * Remove unused Dask import from quality control module * simplify missing value computation * Rename parameter 'arr' to 'mtx' in _compute_obs_metrics no longer creates copy * daskify qc_metrics * Add fixture for array types and update imputation tests for dask arrays * Refactor _compute_var_metrics to prevent modification of the original data matrix and add a test for encoding mode integrity * Add parameterized tests for array types in miceforest imputation * Update missing values handling to include array type in error message and refine parameterized tests for miceforest imputation * Fix array type handling in missing values computation and update test for miceforest imputation * Implement array type handling in load_dataframe function and update tests for miceforest imputation * Remove parameterization for array types in miceforest numerical data imputation test * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <65244425+eroell@users.noreply.github.com> * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <65244425+eroell@users.noreply.github.com> * revert deepcopy changes * Fix test to ensure original matrix is not modified after encoding * Remove unused parameters from observation and variable metrics computation functions * Add sparse.csr_matrix to explicit impute array types test case * Parameterize quality control metrics tests to support multiple array types * Remove unused imports from test_quality_control.py * encode blocks dask function * Add pytest fixtures for observation and variable data in tests * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <65244425+eroell@users.noreply.github.com> * Update tests/preprocessing/test_quality_control.py Co-authored-by: Eljas Roellin <65244425+eroell@users.noreply.github.com> * support dask explicit impute all object types --------- Co-authored-by: eroell <eljas.roellin@ikmail.com> Co-authored-by: Lukas Heumos <lukas.heumos@posteo.net> Co-authored-by: Eljas Roellin <65244425+eroell@users.noreply.github.com>
PreviousNext