EasyCellType: marker-based cell-type annotation by automatically querying multiple databases

Ruoxing Li, Jianjun Zhang, Ziyi Li

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Motivation: Cell label annotation is a challenging step in the analysis of single-cell RNA sequencing (scRNA-seq) data, especially for tissue types that are less commonly studied. The accumulation of scRNA-seq studies and biological knowledge leads to several well-maintained cell marker databases. Manually examining the cell marker lists against these databases can be difficult due to the large amount of available information. Additionally, simply overlapping the two lists without considering gene ranking might lead to unreliable results. Thus, an automated method with careful statistical testing is needed to facilitate the usage of these databases. Results: We develop a user-friendly computational tool, EasyCellType, which automatically checks an input marker list obtained by differential expression analysis against the databases and provides annotation recommendations in graphical outcomes. The package provides two statistical tests, gene set enrichment analysis and a modified version of Fisher's exact test, as well as customized database and tissue type choices. We also provide an interactive shiny application to annotate cells in a user-friendly graphical user interface. The simulation study and real-data applications demonstrate favorable results by the proposed method. Availability and implementation: https://biostatistics.mdanderson.org/shinyapps/EasyCellType/; https://bioconduc tor.org/packages/devel/bioc/html/EasyCellType.html.

Original languageEnglish (US)
Article numbervbad029
JournalBioinformatics Advances
Volume3
Issue number1
DOIs
StatePublished - 2023

ASJC Scopus subject areas

  • Structural Biology
  • Molecular Biology
  • Genetics
  • Computer Science Applications

MD Anderson CCSG core facilities

  • Biostatistics Resource Group

Fingerprint

Dive into the research topics of 'EasyCellType: marker-based cell-type annotation by automatically querying multiple databases'. Together they form a unique fingerprint.

Cite this