Convert to an AnnData object

Convert other objects to an AnnData object. See the sections below for details on how slots are mapped between objects. For more information on the functionality of an AnnData object, see AnnData-usage.

Usage

as_AnnData(
  x,
  x_mapping = NULL,
  layers_mapping = TRUE,
  obs_mapping = TRUE,
  var_mapping = TRUE,
  obsm_mapping = TRUE,
  varm_mapping = TRUE,
  obsp_mapping = TRUE,
  varp_mapping = TRUE,
  uns_mapping = TRUE,
  assay_name = NULL,
  output_class = c("InMemory", "HDF5AnnData"),
  ...
)

# S3 method for class 'SingleCellExperiment'
as_AnnData(
  x,
  x_mapping = NULL,
  layers_mapping = TRUE,
  obs_mapping = TRUE,
  var_mapping = TRUE,
  obsm_mapping = TRUE,
  varm_mapping = TRUE,
  obsp_mapping = TRUE,
  varp_mapping = TRUE,
  uns_mapping = TRUE,
  assay_name = TRUE,
  output_class = c("InMemory", "HDF5AnnData"),
  ...
)

# S3 method for class 'Seurat'
as_AnnData(
  x,
  x_mapping = NULL,
  layers_mapping = TRUE,
  obs_mapping = TRUE,
  var_mapping = TRUE,
  obsm_mapping = TRUE,
  varm_mapping = TRUE,
  obsp_mapping = TRUE,
  varp_mapping = TRUE,
  uns_mapping = TRUE,
  assay_name = NULL,
  output_class = c("InMemory", "HDF5AnnData"),
  ...
)

Arguments

x: The object to convert
x_mapping: A string specifying the data to map to the X slot. If NULL, no data will be copied to the X slot.
layers_mapping: A named character vector where the names are keys of layers in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
obs_mapping: A named character vector where the names are names of obs columns in the new AnnData object and values are the names of columns in the corresponding slot of x. See below for default if NULL depending on the class of x.
var_mapping: A named character vector where the names are names of var columns in the new AnnData object and values are the names of columns in the corresponding slot of x. See below for default if NULL depending on the class of x.
obsm_mapping: A named character vector where the names are keys of obsm in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
varm_mapping: A named character vector where the names are keys of varm in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
obsp_mapping: A named character vector where the names are keys of obsp in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
varp_mapping: A named character vector where the names are keys of varp in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
uns_mapping: A named character vector where the names are keys of uns in the new AnnData object and values are the names of items in the corresponding slot of x. See below for default if NULL depending on the class of x.
assay_name: For SeuratObject::Seurat objects, the name of the assay to be converted. If NULL, the default assay will be used (SeuratObject::DefaultAssay()). This is ignored for other objects.
output_class: The AnnData class to convert to. Must be one of "HDF5AnnData" or "InMemoryAnnData".
...: Additional arguments passed to the generator function for output_class

Value

An AnnData object of the class requested by output_class containing the data specified in the mapping arguments.

Details of mapping arguments

All mapping arguments except for x_mapping expect a named character vector where names are the keys of the slot in the AnnData object and values are the names of items in the corresponding slot of x. If TRUE, the conversion function will guess which items to copy as described in the conversion tables for each object type. In most cases, the default is to copy all items using the same names except where the correspondence between objects is unclear. To avoid copying anything to a slot, set the mapping argument to FALSE. Empty mapping arguments (NULL, c(), list()) will be treated as FALSE with a warning. If an unnamed vector is provided, the values will be used as names.

TRUE will guess which items to copy as described in the conversion tables for each object type
c(adata_item = "x_item") will copy x_item from the slot in x to adata_item in the corresponding slot of new AnnData object
FALSE will avoid copying anything to the slot
c("x_item") is equivalent to c(x_item = "x_item")

Converting from a `SingleCellExperiment` object

This table describes how slots in a SingleCellExperiment::SingleCellExperiment object to the new AnnData object.

From `SingleCellExperiment`	To `AnnData`	Example mapping argument	Default if `NULL`
`assays(x)`	`adata$X`	`x_mapping = "counts"`	Nothing is copied to `X`
`assays(x)`	`adata$layers`	`layers_mapping = c(counts = "counts")`	All items are copied by name
`colData(x)`	`adata$obs`	`obs_mapping = c(n_counts = "n_counts", cell_type = "CellType")`	All columns are copied by name
`rowData(x)`	`adata$var`	`var_mapping = c(n_cells = "n_cells", pct_zero = "PctZero")`	All columns are copied by name
`reducedDims(x)`	`adata$obsm`	`obsm_mapping = c(X_pca = "pca")`	All items are copied by name
`featureLoadings(reducedDims(x))`	`adata$varm`	`varm_mapping = c(PCs = "pca")`	Feature loadings from all `SingleCellExperiment::LinearEmbeddingMatrix` objects in `reducedDims(x)`
`colPairs(x)`	`adata$obsp`	`obsp_mapping = c(connectivities = "RNA_nn")`	All items are copied by name
`rowPairs(x)`	`adata$varp`	`varp_mapping = c(similarities = "gene_overlaps")`	All items are copied by name
`metadata(x)`	`adata$uns`	`uns_mapping = c(metadata = "project_metadata")`	All items are copied by name

Converting from a `Seurat` object

Only one assay can be converted from a SeuratObject::Seurat object to an AnnData object at a time. This can be controlled using the assay_name argument. By default, the current default assay will be used.

This table describes how slots in a SeuratObject::Seurat object to the new AnnData object.

From `Seurat`	To `AnnData`	Example mapping argument	Default if `NULL`
`Layers(x)`	`adata$X`	`x_mapping = "counts"`	Nothing is copied to `X`
`Layers(x)`	`adata$layers`	`layers_mapping = c(counts = "counts")`	All items are copied by name
`x[[]]`	`adata$obs`	`obs_mapping = c(n_counts = "n_counts", cell_type = "CellType")`	All columns are copied by name
`x[[assay_name]][[]]`	`adata$var`	`var_mapping = c(n_cells = "n_cells", pct_zero = "PctZero")`	All columns are copied by name
`Embeddings(x)`	`adata$obsm`	`obsm_mapping = c(X_pca = "pca")`	All embeddings matching `assay_name` are copied by name
`Loadings(x)`	`adata$varm`	`varm_mapping = c(PCs = "pca")`	All valid loadings are copied by name
`Graphs(x)`	`adata$obsp`	`obsp_mapping = c(connectivities = "RNA_nn")`	All graphs matching `assay_name` are copied by name
`Misc(x)`	`adata$varp`	`varp_mapping = c(similarities = "gene_overlaps")`	No data is copied to `varp`
`Misc(x)`	`adata$uns`	`uns_mapping = c(metadata = "project_metadata")`	All items are copied by name

Examples

# Convert a Seurat object to an AnnData object
library(Seurat)
#> Loading required package: SeuratObject
#> Loading required package: sp
#> ‘SeuratObject’ was built under R 4.5.0 but the current version is
#> 4.5.1; it is recomended that you reinstall ‘SeuratObject’ as the ABI
#> for R may have changed
#> 
#> Attaching package: ‘SeuratObject’
#> The following objects are masked from ‘package:base’:
#> 
#>     intersect, t

counts <- matrix(rbinom(20000, 1000, .001), nrow = 100)
obj <- CreateSeuratObject(counts = counts)
#> Warning: Data is of class matrix. Coercing to dgCMatrix.
obj <- NormalizeData(obj)
#> Normalizing layer: counts
obj <- FindVariableFeatures(obj)
#> Finding variable features for layer counts
obj <- ScaleData(obj)
#> Centering and scaling data matrix
obj <- RunPCA(obj, npcs = 10L)
#> PC_ 1 
#> Positive:  Feature55, Feature29, Feature66, Feature72, Feature44, Feature46, Feature2, Feature8, Feature60, Feature64 
#> 	   Feature26, Feature50, Feature81, Feature91, Feature11, Feature13, Feature1, Feature80, Feature30, Feature61 
#> 	   Feature47, Feature37, Feature53, Feature96, Feature34, Feature27, Feature82, Feature42, Feature48, Feature75 
#> Negative:  Feature3, Feature23, Feature21, Feature33, Feature43, Feature9, Feature40, Feature56, Feature4, Feature12 
#> 	   Feature16, Feature62, Feature28, Feature68, Feature17, Feature31, Feature89, Feature94, Feature6, Feature58 
#> 	   Feature41, Feature24, Feature84, Feature87, Feature100, Feature86, Feature49, Feature54, Feature97, Feature99 
#> PC_ 2 
#> Positive:  Feature35, Feature39, Feature83, Feature92, Feature27, Feature85, Feature12, Feature62, Feature38, Feature53 
#> 	   Feature15, Feature26, Feature56, Feature6, Feature16, Feature45, Feature61, Feature55, Feature99, Feature2 
#> 	   Feature48, Feature47, Feature49, Feature80, Feature5, Feature70, Feature25, Feature50, Feature22, Feature57 
#> Negative:  Feature91, Feature7, Feature63, Feature42, Feature1, Feature75, Feature76, Feature17, Feature88, Feature20 
#> 	   Feature40, Feature43, Feature72, Feature82, Feature73, Feature87, Feature78, Feature46, Feature66, Feature34 
#> 	   Feature65, Feature21, Feature44, Feature4, Feature23, Feature14, Feature13, Feature19, Feature29, Feature86 
#> PC_ 3 
#> Positive:  Feature38, Feature97, Feature84, Feature50, Feature13, Feature63, Feature8, Feature75, Feature52, Feature80 
#> 	   Feature96, Feature24, Feature86, Feature62, Feature17, Feature2, Feature61, Feature36, Feature35, Feature76 
#> 	   Feature40, Feature64, Feature58, Feature25, Feature90, Feature57, Feature28, Feature26, Feature70, Feature42 
#> Negative:  Feature87, Feature44, Feature34, Feature72, Feature6, Feature30, Feature73, Feature92, Feature60, Feature15 
#> 	   Feature14, Feature83, Feature95, Feature18, Feature3, Feature32, Feature5, Feature59, Feature4, Feature43 
#> 	   Feature29, Feature7, Feature85, Feature100, Feature46, Feature68, Feature33, Feature98, Feature67, Feature79 
#> PC_ 4 
#> Positive:  Feature14, Feature57, Feature77, Feature68, Feature27, Feature12, Feature75, Feature47, Feature87, Feature46 
#> 	   Feature94, Feature42, Feature9, Feature16, Feature37, Feature66, Feature81, Feature33, Feature70, Feature1 
#> 	   Feature36, Feature48, Feature10, Feature62, Feature45, Feature39, Feature52, Feature67, Feature30, Feature97 
#> Negative:  Feature49, Feature98, Feature90, Feature79, Feature99, Feature91, Feature18, Feature78, Feature5, Feature23 
#> 	   Feature74, Feature35, Feature95, Feature72, Feature19, Feature100, Feature51, Feature53, Feature17, Feature24 
#> 	   Feature44, Feature58, Feature64, Feature65, Feature20, Feature29, Feature60, Feature54, Feature71, Feature96 
#> PC_ 5 
#> Positive:  Feature71, Feature3, Feature34, Feature74, Feature32, Feature16, Feature96, Feature15, Feature59, Feature66 
#> 	   Feature76, Feature100, Feature61, Feature65, Feature13, Feature84, Feature89, Feature77, Feature44, Feature12 
#> 	   Feature57, Feature50, Feature81, Feature19, Feature69, Feature7, Feature88, Feature70, Feature5, Feature68 
#> Negative:  Feature45, Feature93, Feature39, Feature80, Feature24, Feature20, Feature30, Feature9, Feature8, Feature18 
#> 	   Feature95, Feature54, Feature51, Feature60, Feature23, Feature73, Feature79, Feature62, Feature92, Feature64 
#> 	   Feature4, Feature63, Feature14, Feature72, Feature36, Feature10, Feature17, Feature27, Feature85, Feature11 
obj <- FindNeighbors(obj)
#> Computing nearest neighbor graph
#> Computing SNN
obj <- RunUMAP(obj, dims = 1:10)
#> Warning: The default method for RunUMAP has changed from calling Python UMAP via reticulate to the R-native UWOT using the cosine metric
#> To use Python UMAP via reticulate, set umap.method to 'umap-learn' and metric to 'correlation'
#> This message will be shown once per session
#> 09:45:32 UMAP embedding parameters a = 0.9922 b = 1.112
#> 09:45:32 Read 200 rows and found 10 numeric columns
#> 09:45:32 Using Annoy for neighbor search, n_neighbors = 30
#> 09:45:32 Building Annoy index with metric = cosine, n_trees = 50
#> 0%   10   20   30   40   50   60   70   80   90   100%
#> [----|----|----|----|----|----|----|----|----|----|
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> *
#> |
#> 09:45:32 Writing NN index file to temp file /tmp/Rtmp7QZ13X/file663d54b1b2fc
#> 09:45:32 Searching Annoy index using 1 thread, search_k = 3000
#> 09:45:32 Annoy recall = 100%
#> 09:45:33 Commencing smooth kNN distance calibration using 1 thread
#>  with target n_neighbors = 30
#> 09:45:34 Initializing from normalized Laplacian + noise (using RSpectra)
#> 09:45:34 Commencing optimization for 500 epochs, with 6130 positive edges
#> 09:45:34 Using rng type: pcg
#> 09:45:35 Optimization finished

as_AnnData(obj)
#> AnnData object with n_obs × n_vars = 200 × 100
#>     obs: 'orig.ident', 'nCount_RNA', 'nFeature_RNA'
#>     var: 'vf_vst_counts_mean', 'vf_vst_counts_variance', 'vf_vst_counts_variance.expected', 'vf_vst_counts_variance.standardized', 'vf_vst_counts_variable', 'vf_vst_counts_rank', 'var.features', 'var.features.rank'
#>     obsm: 'pca', 'umap'
#>     varm: 'pca'
#>     layers: 'counts', 'data', 'scale.data'
#>     obsp: 'nn', 'snn'
# Convert a SingleCellExperiment object to an AnnData object
library(SingleCellExperiment)
#> Loading required package: SummarizedExperiment
#> Loading required package: MatrixGenerics
#> Loading required package: matrixStats
#> 
#> Attaching package: ‘MatrixGenerics’
#> The following objects are masked from ‘package:matrixStats’:
#> 
#>     colAlls, colAnyNAs, colAnys, colAvgsPerRowSet, colCollapse,
#>     colCounts, colCummaxs, colCummins, colCumprods, colCumsums,
#>     colDiffs, colIQRDiffs, colIQRs, colLogSumExps, colMadDiffs,
#>     colMads, colMaxs, colMeans2, colMedians, colMins, colOrderStats,
#>     colProds, colQuantiles, colRanges, colRanks, colSdDiffs, colSds,
#>     colSums2, colTabulates, colVarDiffs, colVars, colWeightedMads,
#>     colWeightedMeans, colWeightedMedians, colWeightedSds,
#>     colWeightedVars, rowAlls, rowAnyNAs, rowAnys, rowAvgsPerColSet,
#>     rowCollapse, rowCounts, rowCummaxs, rowCummins, rowCumprods,
#>     rowCumsums, rowDiffs, rowIQRDiffs, rowIQRs, rowLogSumExps,
#>     rowMadDiffs, rowMads, rowMaxs, rowMeans2, rowMedians, rowMins,
#>     rowOrderStats, rowProds, rowQuantiles, rowRanges, rowRanks,
#>     rowSdDiffs, rowSds, rowSums2, rowTabulates, rowVarDiffs, rowVars,
#>     rowWeightedMads, rowWeightedMeans, rowWeightedMedians,
#>     rowWeightedSds, rowWeightedVars
#> Loading required package: GenomicRanges
#> Loading required package: stats4
#> Loading required package: BiocGenerics
#> Loading required package: generics
#> 
#> Attaching package: ‘generics’
#> The following objects are masked from ‘package:base’:
#> 
#>     as.difftime, as.factor, as.ordered, intersect, is.element, setdiff,
#>     setequal, union
#> 
#> Attaching package: ‘BiocGenerics’
#> The following objects are masked from ‘package:stats’:
#> 
#>     IQR, mad, sd, var, xtabs
#> The following objects are masked from ‘package:base’:
#> 
#>     Filter, Find, Map, Position, Reduce, anyDuplicated, aperm, append,
#>     as.data.frame, basename, cbind, colnames, dirname, do.call,
#>     duplicated, eval, evalq, get, grep, grepl, is.unsorted, lapply,
#>     mapply, match, mget, order, paste, pmax, pmax.int, pmin, pmin.int,
#>     rank, rbind, rownames, sapply, saveRDS, table, tapply, unique,
#>     unsplit, which.max, which.min
#> Loading required package: S4Vectors
#> 
#> Attaching package: ‘S4Vectors’
#> The following object is masked from ‘package:utils’:
#> 
#>     findMatches
#> The following objects are masked from ‘package:base’:
#> 
#>     I, expand.grid, unname
#> Loading required package: IRanges
#> 
#> Attaching package: ‘IRanges’
#> The following object is masked from ‘package:sp’:
#> 
#>     %over%
#> Loading required package: GenomeInfoDb
#> Loading required package: Biobase
#> Welcome to Bioconductor
#> 
#>     Vignettes contain introductory material; view with
#>     'browseVignettes()'. To cite Bioconductor, see
#>     'citation("Biobase")', and for packages 'citation("pkgname")'.
#> 
#> Attaching package: ‘Biobase’
#> The following object is masked from ‘package:MatrixGenerics’:
#> 
#>     rowMedians
#> The following objects are masked from ‘package:matrixStats’:
#> 
#>     anyMissing, rowMedians
#> 
#> Attaching package: ‘SummarizedExperiment’
#> The following object is masked from ‘package:Seurat’:
#> 
#>     Assays
#> The following object is masked from ‘package:SeuratObject’:
#> 
#>     Assays

sce <- SingleCellExperiment(
  assays = list(counts = matrix(1:5, 5L, 3L)),
  colData = DataFrame(cell = 1:3, row.names = paste0("Cell", 1:3)),
  rowData = DataFrame(gene = 1:5, row.names = paste0("Gene", 1:5))
)

as_AnnData(sce)
#> AnnData object with n_obs × n_vars = 3 × 5
#>     obs: 'cell'
#>     var: 'gene'
#>     layers: 'counts'

Convert to an `AnnData` object

Usage

Arguments

Value

Details of mapping arguments

Converting from a `SingleCellExperiment` object

Converting from a `Seurat` object

See also

Examples

Usage

Arguments

Value

Details of mapping arguments

Converting from a SingleCellExperiment object

Converting from a Seurat object

See also

Examples

Convert to an `AnnData` object

Converting from a `SingleCellExperiment` object

Converting from a `Seurat` object