scglue.data.aggregate_var

scglue.data.aggregate_var(adata, by, X_agg='sum', var_agg=None, varm_agg=None, layers_agg=None, separator=',', nan_sparse=False)[source]

Aggregate var in a given dataset by certain categories

Parameters:
  • adata (AnnData) – Dataset to be aggregated

  • by (str) – Specify a column in adata.var used for aggregation, must be discrete.

  • X_agg (typing.Optional[str]) – Aggregation function for adata.X, must be one of {"sum", "mean", ``None}``. Setting to None discards the adata.X matrix.

  • var_agg (typing.Optional[typing.Mapping[str, str]]) – Aggregation methods for adata.var, indexed by var columns, must be one of {"sum", "mean", "majority"}, where "sum" and "mean" are for continuous data, and "majority" is for discrete data. Fields not specified will be discarded.

  • varm_agg (typing.Optional[typing.Mapping[str, str]]) – Aggregation methods for adata.varm, indexed by varm keys, must be one of {"sum", "mean"}. Fields not specified will be discarded.

  • layers_agg (typing.Optional[typing.Mapping[str, str]]) – Aggregation methods for adata.layers, indexed by layer keys, must be one of {"sum", "mean"}. Fields not specified will be discarded.

  • separator (str) – Separator between multiple values in the groupby column

  • nan_sparse (bool) – Whether missing entries in sparse matrix indicate nan

Return type:

AnnData

Returns:

aggregated – Aggregated dataset