Changelog#
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
[0.2.1]#
Fixed#
Compatibility with
anndata>=0.12.13(#240) @eroell
[0.2.0]#
Fixed#
Modified#
infer_feature_types()considers integers from 0, …, n as numeric. It further provides a new argumentbinary_as, to steer if columns 0/1 should be considered numeric or categorical. (#231) @eroell
[0.1.2]#
Added#
from_pandas()withformat='long'provides a new keyword argumentfill_time_gapsthat fills missing timegaps in the common case of integer time steps from 0 to n_timesteps (#229) @eroell
Modified#
Fixed#
from_pandas()withformat='long'misordered entries in.X/.layerswith.obsif the input df was not sorted for the obs id keys, which is now fixed. (#228) @eroell
Documentation#
Documentation style polishing (#223) @zethson
[0.1.1]#
Added#
setup_connection()can read.parquetfiles. (#217) @eroell
Fixed#
Sliceing of
EHRDataobjects fixed when the backing object is anAnnData. (#218) @eroell
Maintenance#
More concise messages in
infer_feature_types(). (#215) @zethson
[0.1.0]#
Added#
move_to_obs()andmove_to_x()are new helpers for conveniently moving variables from central 2D arrays to the.obsfield, and vice versa. (#199) @eroellphysionet2019()as another out-of-the-box, conveniently available dataset with 40’000 ICU stays from the Physionet 2019 challenge. (#204) @eroelltime_precisionparameter ("date"or"datetime") tosetup_variables()andsetup_interval_variables()for finer temporal granularity control. (#210) @eroell
Fixed#
read_h5ad()fixed issues whenbacked=True. (#199) @eroellread_h5ad()fixed bug when.XisNoneandharmonize_missing_featuresisTrue. (#206) @eroellsetup_obs()withobservation_table="person_visit_occurrence"now supports multiple visits per patient, creating one row per visit with unique observation IDs, instead of failing with xarray conversion errors with non-unique indices. (#210) @eroellOMOP time interval boundaries now use half-open intervals
[start, end)to prevent duplicate measurements at interval boundaries. (#210) @eroell
Maintenance#
Modified#
Dataset generator function
ed.dt.ehrdata_blobsnow takesn_cat_varandn_categoriesarguments to generate categorical (integer encoded) time series data (#207) @sueogluIf
enrich_var_with_feature_info=Trueinsetup_variables()andsetup_interval_variables(),data_table_concept_idsnot included within the concept table are now mapped from their respective alternateconcept_idincluded in the concept_relationship table to retrieve the available feature information. (#205) @KilianDahmsetup_variables()andsetup_interval_variables()with use of"person"now checksbirth_datetimefor meaningful behaviour and error messages. (#210) @eroellgen_default_config()provides convenience to generate a config directly from anEHRDataobject, and should be used instead of the previousehrdata.integrations.vitessce.gen_config. (#211) @eroell
[0.0.10]#
EHRData drops the .R field, and now supports 3D data storage in any slot of .layers. See the Getting started with EHRData tutorial for an introduction to this behaviour. In the future, .X will be enabled soon for 3D data storage as well.
Maintenance#
Enhanced Getting started with EHRData (#184) @eroell
Move from zarr<3 to zarr>=3 (#185) @eroell
Fixed#
Modified#
EHRDatadrops the.Rfield in favor of using.layersfor any 3D data arrays (#184) @eroellEHRData’s shape property will always return a 3 dimensional shape. If anEHRDataobject has flat arrays only, the third dimension will be 1. (#184) @eroellThe following functions now take a
layerargument:read_csv(),from_pandas(),to_pandas(),setup_variables(),setup_interval_variables(),ehrdata_blobs(),physionet2012(). If it is let to its default,None, the.Xfield ofEHRDatais used. Since.Xis 2D in this release, in cases with 3D data, thelayerargument needs to be used. (#184) @eroellwrite_zarr()now writes anEHRDataspecific store encoding, withanndataas a substore. This change allows to useAnnDatawith its change to consolidated Zarr metadata, and better isolatesAnnData’s io. (#185) @eroellread_zarr()is adapted to read the new store encoding, and can also deal withAnnDatastores. (#185) @eroell
[0.0.9]#
Maintenance#
Fixed#
Fix order of
varcreated ined.io.omop.setup_variablesanded.io.omop.setup_interval_variables(#179) @eroell
Modified#
[0.0.8]#
Fixed#
Update duckdb imports for future (#157) @eroell
Maintenance#
[0.0.7]#
Fixed#
Fix tests and Getting Started Notebook (#155) @eroell
Maintenance#
Update duckdb imports for future (#155) @eroell
[0.0.6]#
Fixed#
Cleaned up and updated tutorial notebooks (#140) @agerardy
Added#
read_csv()Reads a csv file (#136) @eroellread_h5ad()Reads an h5ad file (#136) @eroellread_zarr()Reads a zarr file (#136) @eroellwrite_h5ad()Writes an h5ad file (#136) @eroellwrite_zarr()Writes a zarr file (#136) @eroellfrom_pandas()Transform a givenDataFrameinto anEHRDataobject (#136) @eroellto_pandas()Transform anEHRDataobject into aDataFrame(#136) @eroellmimic_2_preprocessed()Loads the preprocessed MIMIC-II dataset (#136) @eroelldiabetes_130_raw()Loads the raw diabetes-130 dataset (#136) @eroelldiabetes_130_fairlearn()Loads the preprocessed diabetes-130 dataset by fairlearn (#136) @eroellinfer_feature_types()Infer feature types in anEHRDataobject (#136) @eroellfeature_type_overview()Overview of inferred feature types (#136) @eroellreplace_feature_types()Replacing inferred feature types (#136) @eroellharmonize_missing_values()Harmonize missing values in anEHRDataobject (#136) @eroell
[0.0.5]#
Fixed#
Initialize EHRData with X and layers (#132) @eroell
Added#
Modified#
Rename
.tattribute to.tem
[0.0.4]#
Fixed#
Zarr version to less than 3
[0.0.3]#
Fixed#
Added missing zarr dependency
[0.0.2]#
Added#
Expanded documentation
Improved OMOP Extraction
Support for COO sparse matrices for R
A
ed.dt.ehrdata_blobstest data generator functionReplace -1 encoded missing values with nans in physionet2012 challenge data
Breaking changes#
Renamed
rtoR
[0.0.1] - 2024-11-04#
Added#
Initial release
[Unreleased]#
Added#
Basic tool, preprocessing and plotting functions
Fixed#
tutorial notebooks updated to align with breaking changes