RNA-Seq data analysis

Accessible, user-friendly RNA sequencing software tools designed for biologists

Intuitive analysis of RNA-Seq data

Once the domain of bioinformatics experts, RNA sequencing (RNA-Seq) data analysis is now more accessible than ever. Illumina offers pushbutton RNA-Seq software solutions packaged in intuitive user interfaces designed for biologists.

These user-friendly tools support a broad range of next-generation sequencing (NGS) studies, from gene expression analysis to total RNA expression profiling and more.

Close up clip of the back of two female scientists discussing results on the monitor of a NextSeq 1000/2000 sequencer, one woman explaining while the other nods.

Popular RNA-Seq data analysis applications

Our RNA-Seq data analysis solutions support a wide range of applications, including differential gene expression, transcriptome profiling, and more, enabling biologists to maximize the discovery power of their gene expression studies. Explore popular RNA-Seq data analysis applications.

Transcript quantification

Transcript quantification, or gene expression profiling, is a widely used application for RNA-Seq analysis. By aligning sequencing reads to a reference genome, quantification tools deliver expression levels of transcripts across samples, conditions, or cell types. These expression profiles are often used as the foundation for further analyses, including biomarker discovery, pathway enrichment, and differential expression.

Differential gene expression analysis

During secondary analysis, differential gene expression analysis measures differences in normalized read count data in a group of genes or RNA transcripts to quantify changes in expression levels across experimental groups. This data can be combined with phenotype information, biological knowledge from functional genomics, or other data sources to help researchers understand the broader impact of the secondary analysis results.

Gene fusion calling

Identifies hybrid genes formed as a result of translocation, deletions, or chromosomal inversions. Identification of these hybrid genes enables researchers to obtain insights into genome instability and to understand the role of these genes in disease mechanisms, especially cancer.1

Analysis of RNA-Seq data: key considerations and solutions

Learn about integrated, accessible gene expression and regulation data analysis solutions that support researchers across the entire informatics workflow, from lab and sample management through insight generation.

Benefits of RNA-Seq data analysis with Illumina tools

RNA-Seq data can be quickly and securely transferred, stored, and analyzed in Illumina Connected Analytics or BaseSpace Sequence Hub, the Illumina cloud computing platforms. Both platforms offer in-cloud access to DRAGEN secondary analysis for accurate, ultrarapid analysis of RNA-Seq and other NGS data. 

DRAGEN secondary analysis uses lossless genomic data compression to reduce the data storage footprint by as much as 5×, all while preserving data integrity. Illumina Connected Multiomics offers user-friendly analysis and visualization of RNA-Seq and multiomics data, and accommodates input files from DRAGEN secondary analysis or select third-party platforms for maximum flexibility. Learn more about:

DRAGEN secondary analysis

Illumina Connected Analytics

BaseSpace Sequence Hub

Illumina Connected Multiomics

Our RNA-Seq analysis tools are:

  • Accessible to any researcher, regardless of bioinformatics experience
  • An expert-preferred suite of RNA-Seq software tools, developed or optimized by Illumina or from a growing ecosystem of third-party app providers
  • Designed to support common transcriptome studies, from gene expression quantification to detection of novel transcripts, coding single nucleotide polymorphisms (cSNPs), gene fusions, and more
  • Suitable for human, mouse, and rat RNA-Seq analysis (certain apps support additional species)
  • Compatible with all Illumina sequencing systems

How to analyze RNA-Seq data

View these resources and technical tips to learn how to analyze and interpret data from RNA-Seq experiments.

Thumbnail

Introduction to sequencing data analysis

Learn about key concepts in bioinformatics analysis with our Introduction to Sequencing Data Analysis video as our team of experts discusses experimental design, RNA-Seq data analysis, variant calling, and de novo assembly.

Streamlining bulk RNA-Seq analysis

Hear from our Sr. Bioinformatics Applications Scientist, Matteo Luberti, who discusses a start-to-finish bulk RNA-Seq analysis process. This webinar demonstrates how to perform each analysis step with simplified point-and-click actions.

Thumbnail

Spatial data analysis with Illumina Connected Multiomics

Watch as our technical expert provides an end-to-end overview on spatial data analysis using Illumina Connected Multiomics.

Thumbnail

How to analyze single-cell transcriptomics data

Join our experts as they walk through the features of Illumina Connected Multiomics and the steps to analyze single-cell transcriptomics data and unlock potential biological insights.

Interpretation of RNA-seq data

After data analysis, results can be transferred easily to Correlation Engine for functional annotation to understand the biological effects of gene expression changes. This omics research database and suite of tools contain data sets from thousands of public studies that inform biological interpretation. The information is curated and normalized, giving the data additional power to help researchers define associations by tissue, disease, compound, and/or genetic perturbation.

Common applications of Correlation Engine include connecting differential gene expression data from RNA-Seq experiments with disease associations and visualizing correlated genes and microRNA targets.

Providing biological context

Learn how Correlation Engine helps University of Pittsburgh researchers understand gene upregulation and downregulation, gene function, drug activity, and other mechanisms, and contextualize their results for publications and grants.

Profile view of a female scientist looking at a screen and typing on a keyboard of a laptop; located in a lab office with printed reports on the table, a microscope, another scientist, and other monitors blurry in the background.

Core lab uses BaseSpace Sequence Hub to analyze mRNA-Seq data

The School of Biomedical Engineering (SBME) Sequencing Core (formerly BRC-Seq) at the University of British Columbia uses BaseSpace Sequence Hub to analyze and share data. Their studies frequently involve analysis of mRNA sequencing data to assess the impact of different experimental treatments on the transcriptome. The BaseSpace workflow helps the lab track samples and deliver high-quality sequencing data to its customers.

RNA-Seq data examples for common workflows

View sample data sets

See sample data sets for various RNA-Seq methods in BaseSpace Sequence Hub or test BaseSpace Apps and evaluate the results interactively.

Note that a customer login is required to access BaseSpace Sequence Hub and view specific data sets.

Learn more about BaseSpace Sequence Hub

Register for BaseSpace Sequence Hub

Browse demo data in BaseSpace Sequence Hub

See all Illumina software and informatics tools

RNA-Seq data analysis FAQ

A primary goal of RNA-Seq data analysis is to identify differential gene expression and coregulated genes and transform raw sequencing reads into biological insights. Sources of material commonly used for RNA-Seq studies include sorted cells, whole-tissue homogenates, and cells cultured in vitro.

RNA-Seq is important as it provides a quantitative, genome-wide view of the transcriptome. Data analysis bridges raw sequencing data to actionable biological insights, allowing researchers to understand how gene expression changes across tissues, conditions, and treatments.2

Visit our RNA sequencing page or watch our Introduction to RNA sequencing webinar to learn more about RNA-Seq, library prep kits, input quantity, and data quality recommendations.

Several key steps in RNA-Seq data analysis are commonly used to make sure that raw RNA-Seq reads are transformed into biological insights, for example, identifying differentially expressed genes.

Key steps in RNA-Seq data analysis include the following:

  1. Quality control and read cleaning: Using tools such as FastQC to evaluate raw reads for sequencing issues, including adapter contamination, read quality, and removing low-quality reads.
  2. Read alignment and quantification: DRAGEN secondary analysis software and other related tools map reads to a reference genome or transcriptome and quantify gene or transcript expression.
  3. Data preprocessing and normalization: Account for library size and normalize counts to correct for biases to compare gene expression across samples.
  4. Differential expression (DE) analysis and downstream analysis: Identify genes with statistically significant differences in expression between samples or conditions and interpret biological insights through pathway enrichment and visualizations.3

Quality control and trimming are important steps in RNA-Seq preprocessing where QC is used to check raw reads for issues, such as sequencing errors, while trimming removes low-quality bases and adapter sequences. These steps ultimately help improve read alignment accuracy and reliable downstream analyses.4

The Illumina DRAGEN Single Cell RNA app is like the 10X Genomics Cell Ranger pipeline in terms of functionality (read alignment, transcript counting, output metrics, and cell-by-gene matrix analysis metrics). Outputs from both pipelines can be loaded into Illumina Connected Multiomics for downstream analysis. Cell Ranger is specific to 10X Genomics data only. The DRAGEN Single Cell RNA app can support multiple assays, although we have optimized parameters for the Illumina Single Cell 3′ RNA prep kit.

/ Results

Related content

Lossless genomic data compression

Lossless genomic data compression technology reduces the data storage footprint by as much as five times, and decreases data storage and transfer costs.

Speak to a specialist

Talk to an expert to learn more about RNA sequencing data analysis solutions.

References

  1. An S, Koh HH, Chang ES, et al. Unearthing novel fusions as therapeutic targets in solid tumors using targeted RNA sequencing. Front Oncol. 2022;12:892918. Published 2022 Aug 10. doi:10.3389/fonc.2022.892918
  2. Koch CM, Chiu SF, Akbarpour M, et al. A Beginner's Guide to Analysis of RNA Sequencing Data. Am J Respir Cell Mol Biol. 2018;59(2):145-157. doi:10.1165/rcmb.2017-0430TR
  3. Deshpande D, Chhugani K, Chang Y, et al. RNA-seq data science: From raw data to effective interpretation. Front Genet. 2023;14:997383. Published 2023 Mar 13. doi:10.3389/fgene.2023.997383
  4. Sheng Q, Vickers K, Zhao S, et al. Multi-perspective quality control of Illumina RNA sequencing data analysis. Brief Funct Genomics. 2017;16(4):194-204. doi:10.1093/bfgp/elw035