🧬

RNA-Seq Overview and Workflow

Jul 2, 2025

Overview

This webinar introduced RNA-Seq, highlighting its background, workflow, and applications in gene expression analysis using next-generation sequencing (NGS), followed by Q&A on common experimental and technical considerations.

Gene Expression Analysis Background

Gene expression can be studied at the DNA, RNA, or protein level.
Earlier methods included Northern blot (RNA detection), RT-qPCR (RNA quantification), and microarrays (simultaneous gene expression profiling).
Each earlier method required prior knowledge of target sequences and had limitations for discovery of novel genes.

Introduction to RNA-Seq and NGS

RNA-Seq uses NGS to quantify and discover all expressed genes in a sample, including novel transcripts.
Advantages of RNA-Seq: high throughput, single-nucleotide resolution, no need for prior sequence data.
The widespread adoption of RNA-Seq followed advances in NGS technologies, such as Illumina.

NGS Workflow and Key Considerations

Basic NGS workflow: input material (DNA/RNA) → fragmentation → adapter ligation → sequencing.
A “read” is the nucleotide sequence generated from a DNA or RNA fragment; can be single-end or paired-end.
Read length (e.g., 75, 150, 300 nucleotides) affects downstream analysis and transcriptome assembly.

RNA-Seq Experimental Design

Most cellular RNA is ribosomal (rRNA) or transfer RNA (tRNA); mRNA, typically the target, comprises only 2-3% of total RNA.
Enrichment methods: poly(A) enrichment (for eukaryotic mRNA) and rRNA depletion (for both eukaryotes and prokaryotes).
Project goals determine sequencing depth, read length, and single- vs paired-end reads.

RNA-Seq Workflow and Quality Control

Four main steps: library preparation, bridge PCR (cluster generation), sequencing by synthesis, and data analysis.
Quality control at each step: check RNA integrity (gel), DNA concentration (Qubit), fragment size (Bioanalyzer), and sequenceable library fraction (qPCR).
Avoid over- or under-clustering to ensure optimal sequencing results.

Data Analysis and Interpretation

Raw sequencing data is converted to FASTQ files containing sequence and quality info.
Reads must be aligned to a reference genome/transcriptome for downstream analysis.
Normalization methods (e.g., FPKM—Fragments Per Kilobase per Million mapped reads) adjust for varying gene lengths and sequencing depths.
Analytical outputs: heatmaps, principal component analysis, and functional annotation of differentially expressed genes.

Applications and Example Projects

ENCODE and modENCODE projects: mapped genomic regulatory elements.
Cancer Genome Atlas: used RNA-Seq to profile cancer transcriptomes.
RNA-Seq supports advances in personalized medicine for genetic diseases.

Q&A Highlights

Typical throughput from sample receipt to data: 4-6 weeks.
ABM offers bioinformatics services for analysis of user-provided sequencing data.
If samples fail QC, clients are contacted for resubmission or alternative solutions.
Separate processing (or double sample amount) is needed for both RNA-Seq and microRNA-Seq.

Key Terms & Definitions

RNA-Seq — sequencing technique to quantify and discover RNA transcripts in a sample.
Next-Generation Sequencing (NGS) — high-throughput sequencing technologies for DNA/RNA.
Read — sequence of nucleotides generated from a DNA/RNA fragment during sequencing.
FPKM — normalization metric: Fragments Per Kilobase of transcript per Million mapped reads.

Action Items / Next Steps

Expect webinar slides and answers to Q&A via email.
Visit ABM’s website for educational resources and technical support.
Watch for an invitation to the upcoming webinar on whole genome sequencing.

Full transcript