Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.: Social discussions and analytics

Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.

Koki Tsuyuzaki, Hiroyuki Sato, Kenta Sato, Itoshi Nikaido

January 2020 Genome Biol

Abstract

Background: Principal component analysis (PCA) is an essential method for analyzing single-cell RNA-seq (scRNA-seq) datasets, but for large-scale scRNA-seq datasets, computation time is long and consumes large amounts of memory.

Results: In this work, we review the existing fast and memory-efficient PCA algorithms and implementations and evaluate their practical application to large-scale scRNA-seq datasets. Our benchmark shows that some PCA algorithms based on Krylov subspace and randomized singular value decomposition are fast, memory-efficient, and more accurate than the other algorithms.

Conclusion: We develop a guideline to select an appropriate PCA implementation based on the differences in the computational environment of users and developers.

Download full-text PDF	Link	Source
Download Source 1	https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1900-3	Web Search
Download Source 2	http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6970290	PMC
Download Source 3	http://dx.doi.org/10.1186/s13059-019-1900-3	DOI Listing

Publication Analysis

Top Keywords

scrna-seq datasets

principal component

component analysis

large-scale scrna-seq

fast memory-efficient

pca algorithms

benchmarking principal

analysis large-scale

large-scale single-cell

single-cell rna-sequencing

Socials:

Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.

Abstract

Download full-text PDF

Publication Analysis

Top Keywords