Deep learning: new computational modelling techniques for genomics

As a data-driven science, genomics largely utilizes machine learning to capture dependencies in data and derive novel biological hypotheses. However, the ability to extract new insights from the exponentially increasing volume of genomics data requires more expressive machine learning models. By effectively leveraging large data sets, deep learning has transformed fields such as computer vision and natural language processing. Now, it is becoming the method of choice for many genomics modelling tasks, including predicting the impact of genetic variation on gene regulatory mechanisms such as DNA accessibility and splicing.

This is a preview of subscription content, access via your institution

Access options

Access Nature and 54 other Nature Portfolio journals

Get Nature+, our best-value online-access subscription

cancel any time

Subscribe to this journal

Receive 12 print issues and online access

206,07 € per year

only 17,17 € per issue

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Prices may be subject to local taxes which are calculated during checkout

Current progress and open challenges for applying deep learning across the biosciences

Article Open access 01 April 2022

Predictive analyses of regulatory sequences with EUGENe

Article Open access 16 November 2023

A self-supervised deep learning method for data-efficient training in genomics

Article Open access 11 September 2023

References

Hieter, P. & Boguski, M. Functional genomics: it’s all how you read it. Science278, 601–602 (1997). CASPubMedGoogle Scholar
Brown, P. O. & Botstein, D. Exploring the new world of the genome with DNA microarrays. Nat. Genet.21, 33–37 (1999). CASPubMedGoogle Scholar
Ozaki, K. et al. Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction. Nat. Genet.32, 650–654 (2002). CASPubMedGoogle Scholar
Golub, T. R. et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science286, 531–537 (1999). CASPubMedGoogle Scholar
Oliver, S. Guilt-by-association goes global. Nature403, 601–603 (2000). CASPubMedGoogle Scholar
The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature489, 57–74 (2012). PubMed CentralGoogle Scholar
Murphy, K. P. Machine Learning: A Probabilistic Perspective (MIT Press, 2012).
Bishop, C. M. Pattern Recognition and Machine Learning (Springer, New York, 2016).
Libbrecht, M. W. & Noble, W. S. Machine learning applications in genetics and genomics. Nat. Rev. Genet.16, 321–332 (2015). CASPubMed CentralPubMedGoogle Scholar
Durbin, R., Eddy, S. R., Krogh, A. & Mitchison, G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (Cambridge Univ. Press, 1998).
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016). This textbook covers theoretical and practical aspects of deep learning with introductory sections on linear algebra and machine learning.
Shi, S., Wang, Q., Xu, P. & Chu, X. in 2016 7th International Conference on Cloud Computing and Big Data (CCBD) 99–104 (IEEE, 2016).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. in Advances in Neural Information Processing Systems 25 (NIPS 2012) (eds Pereira, F., Burges, C. J. C., Bottou, L. & Weinberger, K. Q.) 1097–1105 (Curran Associates, Inc., 2012).
Girshick, R., Donahue, J., Darrell, T. & Malik, J. in 2014 IEEE Conference on Computer Vision and Pattern Recognition 580–587 (IEEE, 2014).
Long, J., Shelhamer, E. & Darrell, T. in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 3431–3440 (IEEE, 2015).
Hannun, A. et al. Deep speech: scaling up end-to-end speech recognition. Preprint at arXivhttps://arxiv.org/abs/1412.5567 (2014).
Wu, Y. et al. Google’s neural machine translation system: bridging the gap between human and machine translation. Preprint at arXivhttps://arxiv.org/abs/1609.08144 (2016).
Alipanahi, B., Delong, A., Weirauch, M. T. & Frey, B. J. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol.33, 831–838 (2015). This paper describes a pioneering convolutional neural network application in genomics. CASPubMedGoogle Scholar
Zhou, J. & Troyanskaya, O. G. Predicting effects of noncoding variants with deep learning-based sequence model. Nat. Methods12, 931–934 (2015). This paper applies deep CNNs to predict chromatin features and transcription factor binding from DNA sequence and demonstrates its utility in non-coding variant effect prediction. CASPubMed CentralPubMedGoogle Scholar
Zou, J. et al. A primer on deep learning in genomics. Nat. Genet.51, 12–18 (2019). CASPubMedGoogle Scholar
Angermueller, C., Pärnamaa, T., Parts, L. & Stegle, O. Deep learning for computational biology. Mol. Syst. Biol.12, 878 (2016). PubMed CentralPubMedGoogle Scholar
Min, S., Lee, B. & Yoon, S. Deep learning in bioinformatics. Brief. Bioinform.18, 851–869 (2017). PubMedGoogle Scholar
Jones, W., Alasoo, K., Fishman, D. & Parts, L. Computational biology: deep learning. Emerg. Top. Life Sci.1, 257–274 (2017). PubMedGoogle Scholar
Wainberg, M., Merico, D., Delong, A. & Frey, B. J. Deep learning in biomedicine. Nat. Biotechnol.36, 829–838 (2018). CASPubMedGoogle Scholar
Ching, T. et al. Opportunities and obstacles for deep learning in biology and medicine. J. R. Soc. Interface15, 20170387 (2018). PubMed CentralPubMedGoogle Scholar
Morgan, J. N. & Sonquist, J. A. Problems in the analysis of survey data, and a proposal. J. Am. Stat. Assoc.58, 415–434 (1963). Google Scholar
Boser, B. E., Guyon, I. M. & Vapnik, V. N. A. in Proceedings of the Fifth Annual Workshop on Computational Learning Theory 144–152 (ACM, 1992).
Breiman, L. Random forests. Mach. Learn.45, 5–32 (2001). Google Scholar
Friedman, J. H. Greedy function approximation: a gradient boosting machine. Ann. Stat.29, 1189–1232 (2001). Google Scholar
Xiong, H. Y. et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science347, 1254806 (2015). PubMedGoogle Scholar
Jha, A., Gazzara, M. R. & Barash, Y. Integrative deep models for alternative splicing. Bioinformatics33, i274–i282 (2017). CASPubMed CentralPubMedGoogle Scholar
Quang, D., Chen, Y. & Xie, X. DANN: a deep learning approach for annotating the pathogenicity of genetic variants. Bioinformatics31, 761–763 (2015). CASPubMedGoogle Scholar
Liu, F., Li, H., Ren, C., Bo, X. & Shu, W. PEDLA: predicting enhancers with a deep learning-based algorithmic framework. Sci. Rep.6, 28517 (2016). CASPubMed CentralPubMedGoogle Scholar
Li, Y., Shi, W. & Wasserman, W. W. Genome-wide prediction of cis-regulatory regions using supervised deep learning methods. BMC Bioinformatics19, 202 (2018). PubMed CentralPubMedGoogle Scholar
Johnson, D. S., Mortazavi, A., Myers, R. M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science316, 1497–1502 (2007). CASPubMedGoogle Scholar
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell129, 823–837 (2007). CASPubMedGoogle Scholar
Robertson, G. et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods4, 651–657 (2007). CASPubMedGoogle Scholar
Park, P. J. ChIP-seq: advantages and challenges of a maturing technology. Nat. Rev. Genet.10, 669–680 (2009). CASPubMed CentralPubMedGoogle Scholar
Weirauch, M. T. et al. Evaluation of methods for modeling transcription factor sequence specificity. Nat. Biotechnol.31, 126 (2013). CASPubMed CentralPubMedGoogle Scholar
Lee, D., Karchin, R. & Beer, M. A. Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res.21, 2167–2180 (2011). CASPubMed CentralPubMedGoogle Scholar
Ghandi, M., Lee, D., Mohammad-Noori, M. & Beer, M. A. Enhanced regulatory sequence prediction using gapped k-mer features. PLOS Comput. Biol.10, e1003711 (2014). PubMed CentralPubMedGoogle Scholar
Stormo, G. D., Schneider, T. D., Gold, L. & Ehrenfeucht, A. Use of the ‘Perceptron’ algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res.10, 2997–3011 (1982). CASPubMed CentralPubMedGoogle Scholar
Stormo, G. D. DNA binding sites: representation and discovery. Bioinformatics16, 16–23 (2000). CASPubMedGoogle Scholar
D’haeseleer, P. What are DNA sequence motifs? Nat. Biotechnol.24, 423–425 (2006). PubMedGoogle Scholar
Kelley, D. R., Snoek, J. & Rinn, J. L. Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks. Genome Res.26, 990–999 (2016). This paper describes the application of a deep CNN to predict chromatin accessibility in 164 cell types from DNA sequence. CASPubMed CentralPubMedGoogle Scholar
Wang, M., Tai, C., E, W. & Wei, L. DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants. Nucleic Acids Res.46, e69 (2018). PubMed CentralPubMedGoogle Scholar
Kelley, D. R. et al. Sequential regulatory activity prediction across chromosomes with convolutional neural networks. Genome Res.28, 739–750 (2018). In this paper, a deep CNN was trained to predict more than 4,000 genomic measurements including gene expression as measured by cap analysis of gene expression (CAGE) for every 150 bp in the genome using a receptive field of 32 kb. CASPubMed CentralPubMedGoogle Scholar
Schreiber, J., Libbrecht, M., Bilmes, J. & Noble, W. Nucleotide sequence and DNaseI sensitivity are predictive of 3D chromatin architecture. Preprint at bioRxivhttps://doi.org/10.1101/103614 (2018). ArticleGoogle Scholar
Zeng, H. & Gifford, D. K. Predicting the impact of non-coding variants on DNA methylation. Nucleic Acids Res.45, e99 (2017). PubMed CentralPubMedGoogle Scholar
Angermueller, C., Lee, H. J., Reik, W. & Stegle, O. DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol.18, 67 (2017). PubMed CentralPubMedGoogle Scholar
Zhou, J. et al. Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk. Nat. Genet.50, 1171–1179 (2018). In this paper, two models, a deep CNN and a linear model, are stacked to predict tissue-specific gene expression from DNA sequence, which demonstrates the utility of this approach in non-coding variant effect prediction. CASPubMed CentralPubMedGoogle Scholar
Cuperus, J. T. et al. Deep learning of the regulatory grammar of yeast 5’ untranslated regions from 500,000 random sequences. Genome Res.27, 2015–2024 (2017). CASPubMed CentralPubMedGoogle Scholar
Pan, X. & Shen, H.-B. RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach. BMC Bioinformatics18, 136 (2017). PubMed CentralPubMedGoogle Scholar
Avsec, Ž., Barekatain, M., Cheng, J. & Gagneur, J. Modeling positional effects of regulatory sequences with spline transformations increases prediction accuracy of deep neural networks. Bioinformatics34, 1261–1269 (2018). CASPubMedGoogle Scholar
Budach, S. & Marsico, A. pysster: classification of biological sequences by learning sequence and structure motifs with convolutional neural networks. Bioinformatics34, 3035–3037 (2018). CASPubMed CentralPubMedGoogle Scholar
Cheng, S. et al. MiRTDL: a deep learning approach for miRNA target prediction. IEEE/ACM Trans. Comput. Biol. Bioinform.13, 1161–1169 (2016). Google Scholar
Kim, H. K. et al. Deep learning improves prediction of CRISPR-Cpf1 guide RNA activity. Nat. Biotechnol.36, 239–241 (2018). CASPubMedGoogle Scholar
Koh, P. W., Pierson, E. & Kundaje, A. Denoising genome-wide histone ChIP-seq with convolutional neuralnetworks. Bioinformatics33, i225–i233 (2017). CASPubMed CentralPubMedGoogle Scholar
Zhang, Y. et al. Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus. Nat. Commun.9, 750 (2018). PubMed CentralPubMedGoogle Scholar
Nielsen, A. A. K. & Voigt, C. A. Deep learning to predict the lab-of-origin of engineered DNA. Nat. Commun.9, 3135 (2018). PubMed CentralPubMedGoogle Scholar
Luo, R., Sedlazeck, F. J., Lam, T.-W. & Schatz, M. Clairvoyante: a multi-task convolutional deep neural network for variant calling in single molecule sequencing. Preprint at bioRxivhttps://doi.org/10.1101/310458 (2018). ArticleGoogle Scholar
Poplin, R. et al. A universal SNP and small-indel variant caller using deep neural networks. Nat. Biotechnol.36, 983–987 (2018). In this paper, a deep CNN is trained to call genetic variants from different DNA-sequencing technologies. CASPubMedGoogle Scholar
Jaganathan, K. et al. Predicting splicing from primary sequence with deep learning. Cell176, 535–548 (2019). CASPubMedGoogle Scholar
Elman, J. L. Finding structure in time. Cogn. Sci.14, 179–211 (1990). Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput.9, 1735–1780 (1997). CASPubMedGoogle Scholar
Bai, S., Zico Kolter, J. & Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. Preprint at arXivhttps://arxiv.org/abs/1803.01271 (2018).
Pan, X., Rijnbeek, P., Yan, J. & Shen, H.-B. Prediction of RNA-protein sequence and structure binding preferences using deep convolutional and recurrent neural networks. BMC Genomics19, 511 (2018). PubMed CentralPubMedGoogle Scholar
Quang, D. & Xie, X. DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences. Nucleic Acids Res.44, e107 (2016). PubMed CentralPubMedGoogle Scholar
Quang, D. & Xie, X. FactorNet: a deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data. Preprint at bioRxivhttps://doi.org/10.1101/151274 (2017). ArticleGoogle Scholar
Lee, B., Baek, J., Park, S. & Yoon, S. in Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics 434–442 (ACM, 2016).
Park, S., Min, S., Choi, H. & Yoon, S. deepMiRGene: deep neural network based precursor microRNA prediction. Preprint at arXivhttps://arxiv.org/abs/1605.00017 (2016).
Boža, V., Brejová, B. & Vinař;, T. DeepNano: deep recurrent neural networks for base calling in MinION nanopore reads. PLOS ONE12, e0178751 (2017). PubMed CentralPubMedGoogle Scholar
Mikheyev, A. S. & Tin, M. M. Y. A first look at the Oxford Nanopore MinION sequencer. Mol. Ecol. Resour.14, 1097–1102 (2014). CASPubMedGoogle Scholar
Barabási, A.-L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nat. Rev. Genet.12, 56–68 (2011). PubMed CentralPubMedGoogle Scholar
Mitra, K., Carvunis, A.-R., Ramesh, S. K. & Ideker, T. Integrative approaches for finding modular structure in biological networks. Nat. Rev. Genet.14, 719–732 (2013). CASPubMed CentralPubMedGoogle Scholar
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M. & Monfardini, G. The graph neural network model. IEEE Trans. Neural Netw.20, 61–80 (2009). PubMedGoogle Scholar
Defferrard, M., Bresson, X. & Vandergheynst, P. in Advances in Neural Information Processing Systems 29 (NIPS 2016) (eds Lee, D. D., Sugiyama, M., Luxburg, U. V., Guyon, I. & Garnett, R.) 3844–3852 (Curran Associates Inc., 2016).
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. Preprint at arXivhttps://arxiv.org/abs/1609.02907 (2016).
Battaglia, P. W. et al. Relational inductive biases, deep learning, and graph networks. Preprint at arXivhttps://arxiv.org/abs/1806.01261 (2018).
Hamilton, W. L., Ying, R. & Leskovec, J. Inductive representation learning on large graphs. Preprint at arXivhttps://arxiv.org/abs/1706.02216 (2017).
Chen, J., Ma, T. & Xiao, C. FastGCN: fast learning with graph convolutional networks via importance sampling. Preprint at arXivhttps://arxiv.org/abs/1801.10247 (2018).
Zitnik, M. & Leskovec, J. Predicting multicellular function through multi-layer tissue networks. Bioinformatics33, i190–i198 (2017). CASPubMed CentralPubMedGoogle Scholar
Zitnik, M., Agrawal, M. & Leskovec, J. Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics34, i457–i466 (2018). CASPubMed CentralPubMedGoogle Scholar
Duvenaud, D. K. et al. in Advances in Neural Information Processing Systems 28 (NIPS2015) (eds Cortes, C., Lawrence, N. D., Lee, D. D., Sugiyama, M. & Garnett, R.) 2224–2232 (Curran Associates Inc., 2015).
Kearnes, S., McCloskey, K., Berndl, M., Pande, V. & Riley, P. Molecular graph convolutions: moving beyond fingerprints. J. Comput. Aided Mol. Des.30, 595–608 (2016). CASPubMed CentralPubMedGoogle Scholar
Dutil, F., Cohen, J. P., Weiss, M., Derevyanko, G. & Bengio, Y. Towards gene expression convolutions using gene interaction graphs. Preprint at arXivhttps://arxiv.org/abs/1806.06975 (2018).
Rhee, S., Seo, S. & Kim, S. in Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence 3527–3534 (IJCAI, 2018).
Chen, Z., Badrinarayanan, V., Lee, C.-Y. & Rabinovich, A. GradNorm: gradient normalization for adaptive loss balancing in deep multitask networks. Preprint at arXivhttps://arxiv.org/abs/1711.02257 (2017).
Sung, K. & Poggio, T. Example-based learning for view-based human face detection. IEEE Trans. Pattern Anal. Mach. Intell.20, 39–51 (1998). Google Scholar
Felzenszwalb, P. F., Girshick, R. B., McAllester, D. & Ramanan, D. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell.32, 1627–1645 (2010). PubMedGoogle Scholar
Guo, M., Haque, A., Huang, D.-A., Yeung, S. & Fei-Fei, L. in Computer Vision – ECCV 2018 (eds Ferrari, V., Hebert, M., Sminchisescu, C. & Weiss, Y.) Vol. 11220 282–299 (Springer International Publishing, 2018).
Sundaram, L. et al. Predicting the clinical impact of human mutation with deep neural networks. Nat. Genet.50, 1161–1170 (2018). CASPubMed CentralPubMedGoogle Scholar
Zitnik, M. et al. Machine learning for integrating data in biology and medicine: principles, practice, and opportunities. Inf. Fusion50, 71–91 (2018). PubMedPubMed CentralGoogle Scholar
Yosinski, J., Clune, J., Bengio, Y. & Lipson, H. in Advances in Neural Information Processing Systems 27 (NIPS2014) (eds Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N. D. & Weinberger, K. Q.) 3320–3328 (Curran Associates Inc., 2014).
Kornblith, S., Shlens, J. & Le, Q. V. Do better ImageNet models transfer better? Preprint at arXivhttps://arxiv.org/abs/1805.08974 (2018).
Russakovsky, O. et al. ImageNet large scale visual recognition challenge. Preprint at arXivhttps://arxiv.org/abs/1409.0575 (2014).
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature542, 115–118 (2017). CASPubMedPubMed CentralGoogle Scholar
Pawlowski, N., Caicedo, J. C., Singh, S., Carpenter, A. E. & Storkey, A. Automating morphological profiling with generic deep convolutional networks. Preprint at bioRxivhttps://doi.org/10.1101/085118 (2016). ArticleGoogle Scholar
Zeng, T., Li, R., Mukkamala, R., Ye, J. & Ji, S. Deep convolutional neural networks for annotating gene expression patterns in the mouse brain. BMC Bioinformatics16, 147 (2015). PubMed CentralPubMedGoogle Scholar
Zhang, W. et al. in IEEE Transactions on Big Data (IEEE, 2018).
Adam, P. et al. Automatic differentiation in PyTorch. Presented at 31st Conference on Neural Information Processing Systems (NIPS 2017).
Abadi, M. et al. Tensorflow: large-scale machine learning on heterogeneous distributed systems. Preprint at arXivhttps://arxiv.org/abs/1603.04467 (2016).
Avsec, Z. et al. Kipoi: accelerating the community exchange and reuse of predictive models for genomics. Preprint at bioRxivhttps://doi.org/10.1101/375345 (2018).This paper describes a platform to exchange trained predictive models in genomics including deep neural networks. ArticleGoogle Scholar
Breiman, L. Statistical modeling: the two cultures (with comments and a rejoinder by the author). Stat. Sci.16, 199–231 (2001). Google Scholar
Greenside, P., Shimko, T., Fordyce, P. & Kundaje, A. Discovering epistatic feature interactions from neural network models of regulatory DNA sequences. Bioinformatics34, i629–i637 (2018). CASPubMed CentralPubMedGoogle Scholar
Zeiler, M. D. & Fergus, R. in Computer Vision – ECCV 2014 (eds Fleet, D., Pajdla, T., Schiele, B. & Tuytelaars, T.) Vol. 8689 818–833 (Springer International Publishing, 2014).
Simonyan, K., Vedaldi, A. & Zisserman, A. Deep inside convolutional networks: visualising image classification models and saliency maps. Preprint at arXivhttps://arxiv.org/abs/1312.6034 (2013).
Shrikumar, A., Greenside, P., Shcherbina, A. & Kundaje, A. Not just a black box: learning important features through propagating activation differences. Preprint at arXivhttps://arxiv.org/abs/1605.01713 (2016). This paper introduces DeepLIFT, a neural network interpretation method that highlights inputs most influential for the prediction.
Sundararajan, M., Taly, A. & Yan, Q. Axiomatic attribution for deep networks. Preprint at arXivhttps://arxiv.org/abs/1703.01365 (2017).
Lanchantin, J., Singh, R., Wang, B. & Qi, Y. Deep motif dashboard: visualizing and understanding genomic sequences using deep neural networks. Pac. Symp. Biocomput.22, 254–265 (2017). PubMed CentralPubMedGoogle Scholar
Shrikumar, A. et al. TF-MoDISco v0.4.4.2-alpha: technical note. Preprint at arXivhttps://arxiv.org/abs/1811.00416v2 (2018).
Ma, J. et al. Using deep learning to model the hierarchical structure and function of a cell. Nat. Methods15, 290–298 (2018). CASPubMed CentralPubMedGoogle Scholar
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science313, 504–507 (2006). CASPubMedGoogle Scholar
Kramer, M. A. Nonlinear principal component analysis using autoassociative neural networks. AIChE J.37, 233–243 (1991). CASGoogle Scholar
Vincent, P., Larochelle, H., Bengio, Y. & Manzagol, P.-A. in Proceedings of the 25th International Conference on Machine Learning 1096–1103 (ACM, 2008).
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y. & Manzagol, P.-A. Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J. Mach. Learn. Res.11, 3371–3408 (2010). Google Scholar
Jolliffe, I. in International Encyclopedia of Statistical Science (ed. Lovric, M.) 1094–1096 (Springer Berlin Heidelberg, 2011).
Plaut, E. From principal subspaces to principal components with linear autoencoders. Preprint at arXivhttps://arxiv.org/abs/1804.10253 (2018).
Kunin, D., Bloom, J. M., Goeva, A. & Seed, C. Loss landscapes of regularized linear autoencoders. Preprint at arXivhttps://arxiv.org/abs/1901.08168 (2019).
Scholz, M., Kaplan, F., Guy, C. L., Kopka, J. & Selbig, J. Non-linear PCA: a missing data approach. Bioinformatics21, 3887–3895 (2005). CASPubMedGoogle Scholar
Tan, J., Hammond, J. H., Hogan, D. A. & Greene, C. S. ADAGE-based integration of publicly available Pseudomonas aeruginosa gene expression data with denoising autoencoders illuminates microbe-host interactions. mSystems1, e00025–15 (2016). PubMed CentralPubMedGoogle Scholar
Tan, J. et al. ADAGE signature analysis: differential expression analysis with data-defined gene sets. BMC Bioinformatics18, 512 (2017). PubMed CentralPubMedGoogle Scholar
Tan, J. et al. Unsupervised extraction of stable expression signatures from public compendia with an ensemble of neural networks. Cell Syst.5, 63–71 (2017). CASPubMed CentralPubMedGoogle Scholar
Brechtmann, F. et al. OUTRIDER: a statistical method for detecting aberrantly expressed genes in RNA sequencing data. Am. J. Hum. Genet.103, 907–917 (2018). CASPubMed CentralPubMedGoogle Scholar
Ding, J., Condon, A. & Shah, S. P. Interpretable dimensionality reduction of single cell transcriptome data with deep generative models. Nat. Commun.9, 2002 (2018). PubMed CentralPubMedGoogle Scholar
Cho, H., Berger, B. & Peng, J. Generalizable and scalable visualization of single-cell data using neural networks. Cell Syst.7, 185–191 (2018). CASPubMed CentralPubMedGoogle Scholar
Deng, Y., Bao, F., Dai, Q., Wu, L. & Altschuler, S. Massive single-cell RNA-seq analysis and imputation via deep learning. Preprint at bioRxivhttps://doi.org/10.1101/315556 (2018). ArticleGoogle Scholar
Talwar, D., Mongia, A., Sengupta, D. & Majumdar, A. AutoImpute: autoencoder based imputation of single-cell RNA-seq data. Sci. Rep.8, 16329 (2018). PubMed CentralPubMedGoogle Scholar
Amodio, M. et al. Exploring single-cell data with deep multitasking neural networks. Preprint at bioRxivhttps://doi.org/10.1101/237065 (2019). ArticleGoogle Scholar
Eraslan, G., Simon, L. M., Mircea, M., Mueller, N. S. & Theis, F. J. Single-cell RNA-seq denoising using a deep count autoencoder. Nat. Commun.10, 390 (2019). PubMed CentralPubMedGoogle Scholar
Lin, C., Jain, S., Kim, H. & Bar-Joseph, Z. Using neural networks for reducing the dimensions of single-cell RNA-Seq data. Nucleic Acids Res.45, e156 (2017). PubMed CentralPubMedGoogle Scholar
Kingma, D. P. & Welling, M. Auto-encoding variational bayes. Preprint at arXivhttps://arxiv.org/abs/1312.6114 (2013).
Goodfellow, I. et al. in Advances in Neural Information Processing Systems 27 (NIPS2014) (eds Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N. D. & Weinberger, K. Q.) 2672–2680 (Curran Associates Inc., 2014).
Lopez, R., Regier, J., Cole, M. B., Jordan, M. I. & Yosef, N. Deep generative modeling for single-cell transcriptomics. Nat. Methods15, 1053–1058 (2018). CASPubMed CentralPubMedGoogle Scholar
Way, G. P. & Greene, C. S. in Biocomputing 2018: Proceedings of the Pacific Symposium (eds Altman, R. B. et al.) 80–91 (World Scientific, 2018).
Grønbech, C. H. et al. scVAE: variational auto-encoders for single-cell gene expression data. Preprint at bioRxivhttps://doi.org/10.1101/318295 (2018).
Wang, D. & Gu, J. VASC: dimension reduction and visualization of single-cell RNA-seq data by deep variational autoencoder. Genomics Proteomics Bioinformatics16, 320–331 (2018). PubMed CentralPubMedGoogle Scholar
Lotfollahi, M., Alexander Wolf, F. & Theis, F. J. Generative modeling and latent space arithmetics predict single-cell perturbation response across cell types, studies and species. Preprint at bioRxivhttps://doi.org/10.1101/478503 (2018). ArticleGoogle Scholar
Hu, Q. & Greene, C. S. Parameter tuning is a key part of dimensionality reduction via deep variational autoencoders for single cell RNA transcriptomics. Preprint at bioRxivhttps://doi.org/10.1101/385534 (2018). ArticleGoogle Scholar
Gupta, A. & Zou, J. Feedback GAN (FBGAN) for DNA: a novel feedback-loop architecture for optimizing protein functions. Preprint at arXivhttps://arxiv.org/abs/1804.01694 (2018).
Killoran, N., Lee, L. J., Delong, A., Duvenaud, D. & Frey, B. J. Generating and designing DNA with deep generative models. Preprint at arXivhttps://arxiv.org/abs/1712.06148 (2017).
Ghahramani, A., Watt, F. M. & Luscombe, N. M. Generative adversarial networks simulate gene expression and predict perturbations in single cells. Preprint at bioRxivhttps://doi.org/10.1101/262501 (2018). ArticleGoogle Scholar
Amodio, M. & Krishnaswamy, S. MAGAN: aligning biological manifolds. Preprint at arXivhttps://arxiv.org/abs/1803.00385 (2018).
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science337, 1190–1195 (2012). CASPubMed CentralPubMedGoogle Scholar
Cheng, J. et al. MMSplice: modular modeling improves the predictions of genetic variant effects on splicing. Genome Biol.20, 48 (2019). PubMed CentralPubMedGoogle Scholar
van der Maaten, L. in Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (eds van Dyk, D. & Welling, M.) Vol. 5 384–391 (PMLR, 2009).
Angerer, P. et al. Single cells make big data: new challenges and opportunities in transcriptomics. Curr. Opin. Syst. Biol.4, 85–91 (2017). Google Scholar
Shaham, U. et al. Removal of batch effects using distribution-matching residual networks. Bioinformatics33, 2539–2546 (2017). CASPubMed CentralPubMedGoogle Scholar
Regev, A. et al. The human cell atlas. eLife6, e27041 (2017). PubMed CentralPubMedGoogle Scholar
Fleming, N. How artificial intelligence is changing drug discovery. Nature557, S55–S57 (2018). CASPubMedGoogle Scholar
Kalinin, A. A. et al. Deep learning in pharmacogenomics: from gene regulation to patient stratification. Pharmacogenomics19, 629–650 (2018). CASPubMed CentralPubMedGoogle Scholar
AlQuraishi, M. End-to-end differentiable learning of protein structure. Preprint at bioRxivhttps://doi.org/10.1101/265231 (2018). ArticleGoogle Scholar
Nawy, T. Spatial transcriptomics. Nat. Methods15, 30 (2018). CASGoogle Scholar
Eulenberg, P. et al. Reconstructing cell cycle and disease progression using deep learning. Nat. Commun.8, 463 (2017). PubMed CentralPubMedGoogle Scholar
KoneČný, J., McMahan, H. B., Ramage, D. & Richtárik, P. Federated optimization: distributed machine learning for on-device intelligence. Preprint at arXivhttps://arxiv.org/abs/1610.02527 (2016).
Beaulieu-Jones, B. K. et al. Privacy-preserving generative deep neural networks support clinical data sharing. Preprint at bioRxivhttps://doi.org/10.1101/159756 (2018).
Lever, J., Krzywinski, M. & Altman, N. Classification evaluation. Nat. Methods13, 603 (2016). CASGoogle Scholar
Tieleman, T. & Hinton, G. Lecture 6.5 - RMSProp, COURSERA: neural networks for machine learning (2012).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at arXivhttps://arxiv.org/abs/1412.6980 (2014).
Schmidhuber, J. Deep learning in neural networks: an overview. Neural Netw.61, 85–117 (2015). PubMedGoogle Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature521, 436–444 (2015). CASPubMedGoogle Scholar
Bottou, L. in Proceedings of Neuro-Nımes ‘91 12 (EC2, 1991).
Bengio, Y. Practical recommendations for gradient-based training of deep architectures. Preprint at arXivhttps://arxiv.org/abs/1206.5533 (2012).
Bergstra, J. & Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res.13, 281–305 (2012). Google Scholar
Bergstra, J., Yamins, D. & Cox, D. in Proceedings of the 30th International Conference on Machine Learning Vol. 28 115–123 (JMLR W&CP, 2013).
Shahriari, B., Swersky, K., Wang, Z., Adams, R. P. & de Freitas, N. Taking the human out of the loop: a review of bayesian optimization. Proc. IEEE104, 148–175 (2016). Google Scholar
Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A. & Talwalkar, A. Hyperband: a novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res.18, 6765–6816 (2017). Google Scholar
Elsken, T., Metzen, J. H. & Hutter, F. Neural architecture search: a survey. Preprint at arXivhttps://arxiv.org/abs/1808.05377 (2018).

Acknowledgements

Ž.A. was supported by the German Bundesministerium für Bildung und Forschung (BMBF) through the project MechML (01IS18053F). The authors acknowledge M. Heinig and A. Raue for valuable feedback.

Reviewer information

Nature Reviews Genetics thanks C. Greene and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Author information

These authors contributed equally: Gökcen Eraslan, Žiga Avsec.

Authors and Affiliations

Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany Gökcen Eraslan & Fabian J. Theis
School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany Gökcen Eraslan & Fabian J. Theis
Department of Informatics, Technical University of Munich, Garching, Germany Žiga Avsec & Julien Gagneur
Department of Mathematics, Technical University of Munich, Garching, Germany Fabian J. Theis

Gökcen Eraslan