Search
Close this search box.
Choosing Between Whole Transcriptome Total RNA-Seq and 3′ mRNA-Seq: A Practical Guide and Comparison

Choosing Between Whole Transcriptome Total RNA-Seq and 3′ mRNA-Seq: A Practical Guide and Comparison

Every successful RNA-Seq experiment starts with a thorough and meticulous experimental plan. Designing the experiment is key and one of the main challenges for RNA sequencing. The potential of the technology to answer a variety of biological questions can only be fully exploited with the right setup to ensure the conclusions are meaningful and interpretations are biologically sound (Fig. 1).
The success of any Next-Generation Sequencing (NGS) project depends on its initial experimental planning. Key factors like the experiment's goal, sample characteristics, and overall design directly shape the wet lab procedures and influence the selection of the entire data analysis pipeline.
Figure 1 | Experimental planning is the most crucial step of any NGS experiment. The aim of the experiment, sample type and setup determine the wet lab workflow and impact data analysis pipelines.

This blog is also available as a recorded webinar!

Key considerations for experimental planning

Several factors influence the experimental setup, for example, which type of data is needed to provide the answer for your specific project? Do you need quantitative (e.g., gene expression levels) or qualitative data (e.g., transcript resolution, splicing information, isoform abundance, structure, etc.)? Further, the type of RNA of interest is also a decisive factor for the design of the lab workflow. If you are interested in small RNAs, you not only need to use a suitable small RNA-Seq library preparation kit but also pick an RNA Extraction method or kit that retains small RNAs. In case, you are interested in non-coding RNAs, you should use ribosomal depletion during preprocessing and avoid mRNA-selection as most lncRNAs are not polyadenylated and would be lost. In addition, choosing the right conditions, including controls, optimizing the sample plate layouts, and providing replicates are crucial for the success of the experiment. For more information on experimental design and planning, see our recent blog Planning for Success.

The distinction between quantitative and qualitative data also influences the choice of library prep in the lab. For quantitative gene expression information, 3′ mRNA-Seq library preps, such as QuantSeq are ideal. For quantitative transcript-level information, whole transcriptome library preps such as CORALL RNA-Seq are used in combination with poly(A) selection or rRNA-depletion depending on the RNA species under investigation.

Choosing between 3' mRNA-Seq and Whole Transcriptome RNA-Seq Technologies

Both, 3′ mRNA-Seq and whole transcriptome sequencing (WTS) can be used to assess expression information during RNA-Seq experiments. Before diving into practical examples how these methods influence results and conclusions of real-life studies, this section briefly introduces the methodologies, advantages, and differences.

3′ mRNA-Seq libraries are generated from total RNA using in-prep poly(A) selection through an initial oligo(dT) priming step – streamlining library preparation and omitting several steps needed for classical library preps. Sequencing reads are localized to the 3′ end of polyadenylated RNAs which is sufficient to identify gene expression patterns even at low sequencing depth of 1 – 5 M reads/sample (Fig. 2). As QuantSeq generates one fragment per transcript, data analysis is straight-forward and results can be obtained extremely fast by read counting without the need to normalize to transcript coverage and concentration estimates (Moll et al., 2014). Due to the robustness of 3′ mRNA-Seq library preparation protocols, they are the method of choice for gene expression profiling from demanding samples including degraded and FFPE material. Our blog article “Transcriptomics on FFPE samples” provides several examples of successful studies utilizing 3′ mRNA-Seq on this sample type.

Transcript body coverage plot illustrating global transcriptome coverage for whole transcriptome (WTS, green) and 3' mRNA-Seq library preparations (blue). While WTS generates even coverage across the entire transcript, 3’ mRNA-Seq libraries generate reads at the 3' end of polyadenylated transcripts and coverage accumulates at transcript end sites covering the 3' untranslated region (UTR) and the ultimate exon(s).
Figure 2 | Comparison between whole transcriptome sequencing (WTS) and 3′ mRNA-Seq for transcript body coverage. 3' mRNA-Seq, e.g., QuantSeq, generates reads from the 3'-end by accessing the poly(A) tail and covering the 3' untranslated region (UTR) and 3' exon(s). WTS libraries use random primers and generate reads that are randomly distributed across the entire transcript including 5' UTR, coding sequence (CDS), and 3' UTR resulting in an even coverage.

Whole transcriptome library preps are most common for RNA-Seq projects. For WTS, cDNA synthesis is initiated with random primers ultimately distributing sequencing reads across the entire transcript (Fig. 2). To prevent random primers from binding to highly abundant ribosomal RNA (rRNA) which would result in a high percentage of unnecessary sequencing reads, rRNA must be effectively removed prior to library preparation – either by selecting polyadenylated RNAs or by specifically depleting rRNA. As this enrichment / depletion step is done prior to the library preparation, WTS workflows typically take longer than 3′ mRNA-Seq workflows. Higher read depth is required to provide sufficient coverage across the entire transcript, reads are aligned, normalized, and individual transcript concentrations are estimated.

Compared to 3′ mRNA-Seq, WTS data offers additional information such as differences in the expression between isoforms of a single gene, alternative splicing, gene fusions, etc. WTS also typically detects more differentially expressed genes, however, biological conclusions, e.g., the effects of various conditions or compounds on biological processes, networks and (de)activated pathways are highly similar between both methods. As a result, 3′ mRNA-Seq is often used for large-scale projects addressing expression patterns, initial experiments to find conditions of interest or to identify drug effects. Whole transcriptome sequencing is then employed on smaller sample sets to investigate the mode-of-action of compounds, unravel effects on transcript level or determine the influence of non-coding transcripts, fusions or splice variations.

When to choose whole transcriptome sequencing (WTS) versus 3' mRNA-Seq

Ultimately, the choice between WTS and 3′ mRNA-Seq depends on your specific research question:

Choose WTS/Total RNA-Seq if you need:

  • A global view of all RNA types (coding and non-coding).
  • Information about alternative splicing, novel isoforms, or fusion genes.
  • To work with samples where the poly(A) tail might be absent or highly degraded (e.g., prokaryotic RNA, some highly degraded clinical samples without good 3′ end preservation).

Choose 3′ mRNA-Seq if you need:

  • Accurate and cost-effective gene expression quantification (i.e., how much of each gene is expressed).
  • High-throughput screening of many samples.
  • A streamlined workflow with simpler data analysis.
  • To efficiently profile mRNA expression from degraded RNA and challenging sample types (like FFPE).

A practical example: Comparing WTS and 3' mRNA-Seq data sets for differential expression, gene set enrichment and pathway analysis using Omics Playground

Several publicly available studies have assessed 3′ mRNA-Seq workflows in comparison with traditional whole transcriptome sequencing. The following study by Ma et al., 2019 compares both RNA-Seq methods to assess differential expression in murine livers following a normal or high iron diet for 5 weeks. The authors conclude that both methods generate highly similar results and show a reproducibility between biological replicates.

Background
3’ RNA sequencing provides an alternative to whole transcript analysis. However, we do not know a priori the relative advantage of each method. Thus, a comprehensive comparison between the whole transcript and the 3′ method is needed to determine their relative merits. To this end, we used two commercially available library preparation kits, the KAPA Stranded mRNA-Seq kit (traditional method) and the Lexogen QuantSeq 3’ mRNA-Seq kit (3′ method), to prepare libraries from mouse liver RNA. We then sequenced and analyzed the libraries to determine the advantages and disadvantages of these two approaches.

Results
We found that the traditional whole transcript method and the 3’ RNA-Seq method had similar levels of reproducibility. As expected, the whole transcript method assigned more reads to longer transcripts, while the 3′ method assigned roughly equal numbers of reads to transcripts regardless of their lengths. We found that the 3’ RNA-Seq method detected more short transcripts than the whole transcript method. With regard to differential expression analysis, we found that the whole transcript method detected more differentially expressed genes, regardless of the level of sequencing depth.

Conclusions
The 3’ RNA-Seq method was better able to detect short transcripts, while the whole transcript RNA-Seq was able to detect more differentially expressed genes. Thus, both approaches have relative advantages and should be selected based on the goals of the experiment.

As expected, WTS detects more differentially expressed genes and assigns more reads to longer transcripts requiring stringent length normalization (Fig. 3). 3′ mRNA-Seq is less sensitive detecting fewer differentially expressed due to reads localizing to the less diverse 3′ UTR, which contains many common regulatory features such as protein binding motifs. Nevertheless, the conclusions for gene enrichment and pathway analysis remain the same regardless which method is chosen – with the additional benefit of 3′ mRNA-Seq requiring less read depth and thus allowing more samples to be sequenced in parallel.

To use 3′ mRNA-Seq effectively, a well-curated 3′ annotation is crucial! Model organisms such as human and mouse annotations are updated regularly and well annotated on transcript end sites. For non-model organisms, it is important to ensure transcript end site information is available and to improve the annotation if needed. Insufficient 3′ annotation leads to reduced mapping rates even if the library preparation and sequencing workflow was optimal and resulted in high-quality data. For more information on alignments and read mapping, please see our Lexicon chapter on secondary data analysis.

Figure 3 | Whole Transcriptome Sequencing (WTS) generates randomly distributed reads across the complete transcript, requiring subsequent fragment assembly and complex normalization for analysis. 3' mRNA-Seq methods like QuantSeq concentrate reads at the 3' end of transcripts. This focused coverage enables a more computationally efficient analysis through direct read counting, leading to faster results.

The authors focused mostly on differential expression for pathways associated with iron metabolism. Reanalysis of the dataset with Omics Playground confirmed the expected findings of increased detection of differentially expressed genes in WTS approaches (Fig. 4). Nevertheless, 3′ mRNA-Seq reliably captures the majority of key differentially expressed genes, providing highly similar results to WTS approaches and consistent biological conclusions at the level of enriched gene sets (Table 1) and differentially regulated pathways (Table 2). In addition to iron metabolism, reanalysis revealed further gene sets affected by an iron-rich diet, such as genes involved in the regulation of the circadian rhythm or inflammatory responses that are robustly detected with both methods. Tables 1 and 2 compare results from gene set enrichment and pathway analysis.

Comparison of differentially expressed genes detected by whole transcriptome sequencing (WTS) or 3' mRNA-Seq. While WTS approaches detect more differentially expressed genes, as expected, 3' mRNA-Seq protocols reliably capture key differentially expressed genes, 78 % of which are also detected by WTS preps. On gene set enrichment and pathway-level, both methods provide highly comparable results with 3' mRNA-Seq offering a viable cost-efficient alternative especially for larger experiments.
Figure 4 | Differentially expressed genes by sequencing method. More differentially expressed (DE) genes are detected by whole transcriptome sequencing (WTS) than by 3’ mRNA-seq. The 3' mRNA sequencing strategy identifies a smaller number of DE genes compared to the WTS strategy. However, most DE genes detected by the 3' mRNA-Seq strategy are also identified by the WTS approach, indicating strong overlap between the two methods in the detection of significant gene expression changes. This demonstrates that, while 3' mRNA-Seq detects a lower number of DE genes overall, it reliably captures the majority of key DE genes identified by the more comprehensive WTS method.
Gene setRank WTSRank 3’mRNAseq
PATHWAY_REACTOME:Response Of EIF2AK1 (HRI) To Heme Deficiency11
GO_BP:negative regulation of circadian rhythm24
PATHWAY_WIKI:Photodynamic therapy-induced unfolded protein response36
PATHWAY_WIKI:Cholesterol biosynthesis pathway411
GO_BP:negative regulation of acute inflammatory response53
PATHWAY_WIKI:Matrix metalloproteinases.639
GO_BP:myeloid dendritic cell chemotaxis77
GO_BP:prostaglandin secretion882
PATHWAY_BIOPLANET:PERK-regulated gene expression98
PATHWAY_REACTOME:ATF4 Activates Genes In Response To Endoplasmic Reticulum Stress105
GO_BP:eosinophil migration1123
GO_BP:leukocyte aggregation1262
PATHWAY_REACTOME:PERK Regulates Gene Expression139
GO_BP:negative regulation of immune effector process14172
GO_BP:eosinophil chemotaxis1524

Among the top 15 upregulated gene sets, the 3′ mRNA-seq method captures all the genesets identified by the WTS approach, though with shifts in rank for specific categories, especially below the very top hits. This suggests that for this dataset and analytical approach, all major strongest signals detected by WTS were also detected by 3’ mRNA-seq. However, the strength of association for non-top gene sets differed, which may impact pathway prioritization and secondary biological inferences.

PathwayRank WSTRank 3’mRNAseq
PATHWAY_WIKI:Photodynamic therapy-induced unfolded protein response_11
PATHWAY_WIKI:Cholesterol biosynthesis pathway28
PATHWAY_WIKI:Matrix metalloproteinases326
PATHWAY_WIKI:Mammary gland development pathway – Embryonic development (Stage 1 of 4)418
PATHWAY_WIKI:Cholesterol synthesis disorders531
PATHWAY_WIKI:Cholestasis_WikiPathways6
PATHWAY_WIKI:Chronic hyperglycemia impairment of neuron function7
PATHWAY_WIKI:Differentiation of white and brown adipocyte8
PATHWAY_WIKI:NRF2-ARE regulation916
PATHWAY_WIKI:Platelet-mediated interactions with vascular and circulating cells10
PATHWAY_WIKI:Statin inhibition of cholesterol production114
PATHWAY_WIKI:Unfolded protein response1210
PATHWAY_WIKI:Prostaglandin signaling13
PATHWAY_WIKI:Cytokines and inflammatory response14
PATHWAY_WIKI:nsp1 from SARS-CoV-2 inhibits translation initiation in the host cell1536
For Wikipathways enrichment, the 3′ mRNA-seq approach recovers most—roughly 60%—of the strongest signals detected by the WTS method among the top 15, but several strong WTS signals are undetectable or non-significant in 3′ mRNA-seq (notably immunity-related pathways), highlighting the limitations of the 3’ mRNA-seq approach.

Choosing suitable data analysis tools is of utmost importance to generate conclusive and comparable results and drawing valid, scientifically sound conclusions. Cloud-based plug-and-play platforms such as Lexogen’s Kangooroo and Omics Playground offer curated differential expression analysis pipelines with interactive visualization facilitating analysis and offering a fast, easy, and straightforward view into the data set. In addition, a variety of different options for differential expression and gene set enrichment are typically available and both platforms are suitable for analysis of different types of datasets – specifically 3′ mRNA-Seq and WTS data.

Replicates and choosing the right control conditions are the key determinants for sound biological conclusions

Control conditions are fundamental for robust and interpretable differential expression analyses. Neutral control conditions provide a meticulously defined baseline as transcriptomic reference, against which the profiles of experimentally perturbed model systems or disease states can be compared. Without such controls, it becomes impossible to attribute observed changes in gene expression to the specific experimental variable of interest. Instead, detected variations could merely reflect inherent stochastic biological variation, technical noise introduced during sample handling and processing, or the influence of uncontrolled confounding variables. Consequently, the inclusion of appropriate control groups enables the application of sound statistical inference, minimizes the risk of both false positive and false negative errors.

Technical and biological replicates ensure reliable results

The importance of replicates cannot be emphasized enough in scientific research, especially when conducting NGS experiments producing complex data sets. Replicates are essential for ensuring the reliability, robustness, and statistical validity of an experiments conclusions. Without replicates, it’s virtually impossible to distinguish true biological signals from random noise or experimental artifacts (Table 3). Not only is the majority of information (up to 90%) lost, the reliability of the experiment suffers tremendously. While technical replicates capture variabilities in the workflow, including changes to the environment, equipment or reagents used, biological replicates additionally capture the natural variance between individual biological samples.

Reducing the number of replicates in the experiment in silico resulted in more false positives and other gene sets reported to be differentially regulated. When analyzing the top 15 hits for for gene sets enriched in the data set containing three replicates, 4 of these are not found at all when only one replicate is used. In addition, four other gene sets significantly drop in rank below 50th place – and would also not be considered as likely hits in an experiment (Table 3). Hence, the absence or presence of replicates can completely change the outcome, interpretation and biological relevance of an NGS experiment.

Gene setRank – 3 Replicates per conditionRank – 1 Replicates per condition
PATHWAY_REACTOME:Response Of EIF2AK1 (HRI) To Heme Deficiency11
GO_BP:negative regulation of circadian rhythm27
PATHWAY_WIKI:Photodynamic therapy-induced unfolded protein response32
PATHWAY_WIKI:Cholesterol biosynthesis pathway4
GO_BP:negative regulation of acute inflammatory response511
PATHWAY_WIKI:Matrix metalloproteinases6113
GO_BP:myeloid dendritic cell chemotaxis7
GO_BP:prostaglandin secretion853
PATHWAY_BIOPLANET:PERK-regulated gene expression93
PATHWAY_REACTOME:ATF4 Activates Genes In Response To Endoplasmic Reticulum Stress104
GO_BP:eosinophil migration11152
GO_BP:leukocyte aggregation12
PATHWAY_REACTOME:PERK Regulates Gene Expression135
GO_BP:negative regulation of immune effector process14
GO_BP:eosinophil chemotaxis15201

Lack of biological replicates leads to missing or severely downgraded detection of roughly 47% of the top upregulated gene sets detected by the three replicate analysis, highlighting the risk of false negatives or loss of statistical significance when experiment replication is insufficient.

Summary

This blog is also available as a recorded webinar!

References

Ma, F., Fuqua, B.K., Hasin, Y. et al. (2019) A comparison between whole transcript and 3’ RNA sequencing methods using Kapa and Lexogen library preparation methods. BMC Genomics 20. DOI: 10.1186/s12864-018-5393-3

Moll, P., Ante, M., Seitz, A. and Reda, T. (2014) QuantSeq 3′ mRNA sequencing for RNA quantification. Nat Methods 11, i–iii (2014). DOI: 10.1038/nmeth.f.376

Written by Dr. Yvonne Göpel

Share This Story, Choose Your Platform!

Related Posts

Cart

Product

Product Short Description
PRICE

Product

Product Short Description
PRICE

Product

Product Short Description
PRICE

My Account

Login

Register