Chapter 7 Appendix

7.1 Useful resources

Here, we include a list of useful websites and publications related to the computational analysis of single-cell data. This mainly comprises general reviews / tutorials as well as benchmarking studies on certain aspects of single-cell analysis. If you think that there is a useful paper that can be included here, please contact me at

7.1.1 Useful websites

  • https://www.scrna-tools.org/: Database recording details of computational tools designed for analyzing scRNA-seq data, maintained by the Oshlack lab.

7.1.2 General features of single-cell data

7.1.3 General reviews / tutorials

  • (Lun, McCarthy, and Marioni 2016) “A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor”: One of the pioneering scRNA-seq analysis tutorials covering quality control, normalization, feature selection and dimension reduction

  • (Yuan et al. 2017) “Challenges and emerging directions in single-cell analysis”: Challenges in data preprocessing and potential applications and extensions of scRNA-seq

  • (Hwang, Lee, and Bang 2018) “Single-cell RNA sequencing technologies and bioinformatics pipelines”: Computational challenges (e.g. data preprocessing) and unsolved problems (cell type identification, inferring regulatory networks, cell hierarchy reconstruction)

  • (Chen, Ning, and Shi 2019) “Single-Cell RNA-Seq Technologies and Related Computational Data Analysis”: Summarizes the various steps in single cell analysis (e.g. read mapping, quality control, batch correction, normalization, imputation, dimension reduction, cell population identification, differential expression, pseudotime) and comprehensively list tools that are used in these analysis steps

  • (Luecken and Theis 2019) “Current best practices in single‐cell RNA‐seq analysis: a tutorial”: Summarizes the various steps in single cell analysis (e.g. quality control, normalization, data integration, dimension reduction, cluster analysis, trajectory analysis, cell-vs-gene-level analysis) with special attention to “pitfalls and reccomendations”

  • (Lahnemann et al. 2020) “Eleven grand challenges in single-cell data science”

7.1.4 Dimension reduction / trajectory inferrence

  • (Saelens et al. 2019) “A comparison of single-cell trajectory inference methods”

  • (Sun et al. 2019) “Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis”

  • (Tsuyuzaki et al. 2020) “Benchmarking principal component analysis for large-scale single-cell RNA-sequencing”

7.1.5 Clustering / differential expression / cell identification

7.1.6 Network / pathway-based algorithms

  • (Holland et al. 2020) “Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data”

  • (Pratapa et al. 2020) “Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data”

7.1.7 Others

  • (Tian et al. 2019) “Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments”

  • (Tran et al. 2020) “A benchmark of batch-effect correction methods for single-cell RNA sequencing data”

  • (Wagner and Klein 2020) “Lineage tracing meets single-cell omics: opportunities and challenges”

  • (Xi and Li 2021) “Benchmarking Computational Doublet-Detection Methods for Single-Cell RNA Sequencing Data”

7.1.8 Experimental protocols

  • (Ding et al. 2020) “Systematic comparison of single-cell and single-nucleus RNA-sequencing methods”

  • (Mereu et al. 2020) “Benchmarking single-cell RNA-sequencing protocols for cell atlas projects”

7.2 Installing packages: some tips

Most analysis performed in this guide uses the R programming language. Furthermore, it is useful to install RStudio, which is the preferred IDE (in layman’s term, a graphic user interface) for R. There are two ways to install R and RStudio. The first installation method (reccomended) is to install it via the Anaconda platform here. The second installation method is to install it via the Rstudio website here.

Next, we need to install the required packages:

install.packages("data.table")
install.packages("Matrix")
install.packages("ggplot2")
install.packages("gridExtra")
install.packages("RColorBrewer")
install.packages("Seurat")
install.packages("clustree")

References

Abdelaal, Tamim, Lieke Michielsen, Davy Cats, Dylan Hoogduin, Hailiang Mei, Marcel J. T. Reinders, and Ahmed Mahfouz. 2019. “A Comparison of Automatic Cell Identification Methods for Single-Cell RNA Sequencing Data.” Genome Biology 20: 194. https://doi.org/10.1186/s13059-019-1795-z.
Chen, Geng, Baitang Ning, and Tieliu Shi. 2019. “Single-Cell RNA-Seq Technologies and Related Computational Data Analysis.” Frontiers in Genetics 10: 317. https://doi.org/10.3389/fgene.2019.00317.
Ding, Jiarui, Xian Adiconis, Sean K. Simmons, Monika S. Kowalczyk, Cynthia C. Hession, Nemanja D. Marjanovic, Travis K. Hughes, et al. 2020. “Systematic Comparison of Single-Cell and Single-Nucleus RNA-Sequencing Methods.” Nature Biotechnology. https://doi.org/10.1038/s41587-020-0465-8.
Holland, Christian H., Jovan Tanevski, Javier Perales-Paton, Jan Gleixner, Manu P. Kumar, Elisabetta Mereu, Brian A. Joughin, et al. 2020. “Robustness and Applicability of Transcription Factor and Pathway Analysis Tools on Single-Cell RNA-Seq Data.” Genome Biology 21: 36. https://doi.org/10.1186/s13059-020-1949-z.
Hwang, Byungjin, Ji Hyun Lee, and Duhee Bang. 2018. “Single-Cell RNA Sequencing Technologies and Bioinformatics Pipelines.” Experimental & Molecular Medicine 50: 96. https://doi.org/10.1038/s12276-018-0071-8.
Kiselev, Vladimir Yu, Tallulah S. Andrews, and Martin Hemberg. 2019. “Challenges in Unsupervised Clustering of Single-Cell RNA-Seq Data.” Nature Reviews Genetics 20: 273–82. https://doi.org/10.1038/s41576-018-0088-9.
Lahnemann, David, Johannes Koster, Ewa Szczurek, Davis J. McCarthy, Stephanie C. Hicks, Mark D. Robinson, Catalina A. Vallejos, et al. 2020. “Eleven Grand Challenges in Single-Cell Data Science.” Genome Biology 21: 31. https://doi.org/10.1186/s13059-020-1926-6.
Luecken, Malte D, and Fabian J Theis. 2019. “Current Best Practices in Single-Cell RNA-Seq Analysis: A Tutorial.” Molecular Systems Biology 15: e8746. https://doi.org/10.15252/msb.20188746.
Lun, Aaron T. L., Davis J. McCarthy, and John C. Marioni. 2016. “A Step-by-Step Workflow for Low-Level Analysis of Single-Cell RNA-Seq Data with Bioconductor.” F1000Research 5: 2122. https://doi.org/10.12688/f1000research.9501.2.
Mereu, Elisabetta, Atefeh Lafzi, Catia Moutinho, Christoph Ziegenhain, Davis J. McCarthy, Adrian Alvarez-Varela, Eduard Batlle, et al. 2020. “Benchmarking Single-Cell RNA-Sequencing Protocols for Cell Atlas Projects.” Nature Biotechnology, 1–9. https://doi.org/10.1038/s41587-020-0469-4.
Pratapa, Aditya, Amogh P. Jalihal, Jeffrey N. Law, Aditya Bharadwaj, and T. M. Murali. 2020. “Benchmarking Algorithms for Gene Regulatory Network Inference from Single-Cell Transcriptomic Data.” Nature Methods 17: 147–54. https://doi.org/10.1038/s41592-019-0690-6.
Saelens, Wouter, Robrecht Cannoodt, Helena Todorov, and Yvan Saeys. 2019. “A Comparison of Single-Cell Trajectory Inference Methods.” Nature Biotechnology 37: 547–54. https://doi.org/10.1038/s41587-019-0071-9.
Soneson, Charlotte, and Mark D Robinson. 2018. “Bias, Robustness and Scalability in Single-Cell Differential Expression Analysis.” Nature Methods 15: 255–61. https://doi.org/10.1038/nmeth.4612.
Sun, Shiquan, Jiaqiang Zhu, Ying Ma, and Xiang Zhou. 2019. “Accuracy, Robustness and Scalability of Dimensionality Reduction Methods for Single-Cell RNA-Seq Analysis.” Genome Biology 20: 269. https://doi.org/10.1186/s13059-019-1898-6.
Svensson, Valentine. 2020. “Droplet scRNA-Seq Is Not Zero-Inflated.” Nature Biotechnology 38: 147–50. https://doi.org/10.1038/s41587-019-0379-5.
Tian, Luyi, Xueyi Dong, Saskia Freytag, Kim-Anh Le Cao, Shian Su, Abolfazl JalalAbadi, Daniela Amann-Zalcenstein, et al. 2019. “Benchmarking Single Cell RNA-Sequencing Analysis Pipelines Using Mixture Control Experiments.” Nature Methods 16: 479–87. https://doi.org/10.1038/s41592-019-0425-8.
Tran, Hoa Thi Nhu, Kok Siong Ang, Marion Chevrier, Xiaomeng Zhang, Nicole Yee Shin Lee, Michelle Goh, and Jinmiao Chen. 2020. “A Benchmark of Batch-Effect Correction Methods for Single-Cell RNA Sequencing Data.” Genome Biology 21: 12. https://doi.org/10.1186/s13059-019-1850-9.
Tsuyuzaki, Koki, Hiroyuki Sato, Kenta Sato, and Itoshi Nikaido. 2020. “Benchmarking Principal Component Analysis for Large-Scale Single-Cell RNA-Sequencing.” Genome Biology 21: 9. https://doi.org/10.1186/s13059-019-1900-3.
Wagner, Daniel E., and Allon M. Klein. 2020. “Lineage Tracing Meets Single-Cell Omics: Opportunities and Challenges.” Nature Reviews Genetics. https://doi.org/10.1038/s41576-020-0223-2.
Xi, Nan M., and Jingyi J. Li. 2021. “Benchmarking Computational Doublet-Detection Methods for Single-Cell RNA Sequencing Data.” Cell Systems. https://doi.org/10.1016/j.cels.2020.11.008.
Yuan, Guo-Cheng, Long Cai, Michael Elowitz, Tariq Enver, Guoping Fan, Guoji Guo, Rafael Irizarry, et al. 2017. “Challenges and Emerging Directions in Single-Cell Analysis.” Genome Biology 18: 84. https://doi.org/10.1186/s13059-017-1218-y.