tomas.infer.get_dbl_mg_bc

tomas.infer.get_dbl_mg_bc(adata, groupby, dblgroup, output=None, num_mg=100, kl_cutoff=None, merging_threshold=5, skip_threshold=2, alphaMin=1)

Extract metagenes from raw UMI counts of a certain type of heterotypic doublets. :param adata: The (annotated) UMI count matrix of shape n_obs × n_vars.

Rows correspond to droplets and columns to genes.

Parameters:
  • groupby (str) – The key of the droplet categories stored in adata.obs.

  • groups (list of strings) – Specify two droplet categories, e.g. [‘Homo-ct1’, ‘Homo-ct2’] annotated in adata.obs[groupby], which form the hetero-dbl of interest.

  • output (path) – Path to save the results.

  • num_mg (int, optional) – Number of exclusive meta-genes. The default is 100.

  • kl_cutoff (float, optional) – DESCRIPTION. The default is 1.

  • merging_threshold (float, optional) – Stop merging when alpha sum of metagenes exceeds the threshold in conterpart celltype. The default is 5.

  • skip_threshold (float, optional) – If adding a gene into current metegene leads to ‘skip_threshold’-fold change of the std of alpha values, this gene is skipped. The default is 2.

  • alphaMin (float, optional) – Individual genes with alpha greater than the threshold are skipped in mergging step and considerted as a specifial metagene with one member gene. The default is 1.

Returns:

adata_dbl_mg_top – The UMI count matrix of shape n_obs × n_metangenes. Rows correspond to droplets and columns to metagenes.

Return type:

AnnData