Get Help Sign In
ProcessingProcessing
Icons Library_Sky_Circle_100x100_NGS

Whole Genome Sequencing

Overview

The entire DNA sequence of a living organism is that organism’s genome. The human genome is ~3 billion base pairs long and contains all the instructions necessary for forming a living, breathing person. Whole-genome sequencing determines the precise order of every single base pair in a genome, providing comprehensive information about both protein-coding regions and non-coding regions that may have other functions. Next-generation sequencing methods have made whole genome sequencing faster, cheaper, and more powerful than ever.

How do you sequence an entire genome?

The first genome to be fully sequenced was the genome of the bacterium that causes bacterial influenza, Haemophilus influenzae. This bacterial genome was sequenced in 1995, using shotgun sequencing, which breaks the genome into small DNA pieces that are cloned into bacteria for growth, isolation, and sequencing. Sequences are then reassembled into the full genome using bioinformatics tools [1]. When the Human Genome Project attempted to sequence the first human genome in 2001, shotgun sequencing was also used, but, since the human genome is so much larger than a bacterial genome, the entire human genome could not be fully sequenced. Although faster than other sequencing methods at the time, it still took over ten years to fully sequence the human genome using shotgun sequencing.

Next generation sequencing (NGS) is a much faster sequencing approach that does not require cloning DNA fragments. Instead, DNA is extracted from an organism’s tissue and fragmented, then sequencing libraries are created by adding adapters that are later recognized by the sequencing platform. The libraries are then loaded onto the sequencer which uses a platform-specific technology to detect nucleotides one by one. Using NGS, a human genome can be sequenced in just a single day [2].

What is the difference between WGS and NGS?

WGS is an approach to sequencing the entire DNA sequence of an organism, while NGS is one of several sequencing technologies (as described below):

  • Sanger sequencing: also known as chain termination or Dideoxy sequencing, Sanger sequencing requires four different PCR reactions (one per A, T, C, G) that are then heat denatured and separated by gel electrophoresis; bands on the gel are visualized and ordered to determine the DNA sequence
  • Shotgun sequencing: randomly shears DNA sequences into smaller fragments and computationally assembles them by looking for overlaps to piece the whole sequence back together
  • NGS: also known as massively parallel sequencing, NGS is a second-generation, high-throughput sequencing technology; single DNA strands are adhered to a solid surface and as the second strand is built, a fluorescent signal (e.g., Illumina platforms) or a pH measurement (Ion Torrent platforms) indicates the sequence of DNA bases
  • Nanopore sequencing: a third-generation sequencing technology that uses protein nanopores to determine DNA sequences; when DNA passes through the nanopore, a current change occurs, with each base associated with a slightly different change

Whole Genome Sequencing workflow

Whole genome sequencing can be performed on as little as 1 ng of DNA that has been extracted from a target tissue or cell samples. Libraries are then prepared using one of several available library preparation kits and adapters which fragment the DNA then add adapters to the ends of the resulting pieces. Most sequencing libraries will contain indexing barcodes―short, fixed DNA sequences―unique to each sample, enabling multiple samples to be sequenced at one time (i.e., multiplexing). These barcodes also allow sequencing libraries to be separated during the sequence analysis step. Currently, there are companies and research cores that offer a wide range of sequencing services. A variety of free bioinformatics tools are available to analyze your data as well.

Icons_Ocean_85x85_Extraction
Extraction

  • Beckman Coulter Life Sciences Genomics Reagents
Icons_Ocean_85x85_Sequence
Sequencing

  • Illumina
  • Pacific Biosciences
  • Oxford Nanopore Technologies
  • Ion Torrent
  • Element
  • MGI
  • Singular
  • Ultima

Advantages of Whole Genome Sequencing

Whole genome sequencing (WGS) provides the most comprehensive data about a given organism. NGS can deliver large amounts of data in a short amount of time. Profiling the entire genome facilitates discovery of novel genes and variants associated with disease, particularly those in non-coding areas of the genome. Although it can be more expensive and time-consuming than targeted sequencing approaches and technologies like microarrays, these key advantages to sequencing entire genomes with WGS may prove worth it:

  • High-resolution view of the entire genome, including coding and non-coding regions
  • Capturing large and small variants that might be missed with targeted approaches
  • Finds novel, potentially causative variants for follow-up studies
  • Delivers large amounts of data in a short period of time
  • Facilitates novel genome assembly
Icons_Ocean_85x85_Discovery-Research
Discovery
Icons_Ocean_85x85_Ease-of-use
Ease-of-use
Icons_Ocean_85x85_Scalability
Scalable

Applications of Whole Genome Sequencing

WGS is a powerful tool for variant discovery with several downstream applications including cancer research, genetic diseases research, epidemiology, and genotyping. WGS can be used not only to determine variant frequencies or how often a difference occurs within a population of organisms, but also to associate genetic variants with disease through genome-wide association studies (GWAS). As the price of WGS decreases, it is becoming more common to use it as a translational research tool. Having achieved the “$1000 genome,” multiple companies are pushing towards the next goal of the “$100 genome” [2-4].

Icons_Ocean_85x85_Safety Data Sheet

NGS 101 application guide

This detailed overview walks you through major advances in sequencing technology, types of next generation sequencing, their applications and more.

Whole Genome Sequencing (WGS) vs. Whole Exome Sequencing (WES)

Of the roughly 3 billion base pairs in the human genome, only about 1–2% are translated into functional proteins. The areas of the genome that encode functional proteins are called exons Sequencing only exons (whole exome sequencing; WES) is cheaper and faster than sequencing the entire genome and is more than a suitable approach for research groups that are only interested in protein-coding regions of the genome. The main differences between WGS and WES are:

  • Search depth. Whole genome sequencing will capture variation in any area of the genome, including non-coding areas. If there is a chance the disease(s) you’re studying are associated with such variation, WGS is a better option than WES.
  • Sequencing depth. Because a much smaller percentage of the genome is sequenced through WES, you will achieve deeper sequencing coverage of regions of interest. This is important for researchers requiring comprehensive coverage of SNVs and indels (common in population genetics, genetic diseases, and cancer genetics research).
  • Reagents. WGS requires more sequencing reagents than WES, but WES requires additional preparation reagents (probes) and additional protocol steps (hybridization).
  • Cost. WES is cheaper and faster than WGS.
  • Data analysis. WGS produces large datasets that are more complex to analyze than WES datasets. Such datasets often need sophisticated bioinformatics expertise.

Get started with Whole Genome Sequencing

Are you working in an area that would benefit from WGS? See how we can help you easily improve your workflows and results.

Explore whole genome sequencing resources

References

  1. Fleischmann RD, Adams MD, White O, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269(5223):496-512.
  2. Welch JS, Westervelt P, Ding L, et al. Use of whole-genome sequencing to diagnose a cryptic fusion oncogene. JAMA. 2011;305(15):1577-1584.
  3. Lunshof JE, Bobe J, Aach J, et al. Personal genomes in progress: from the human genome project to the personal genome project. Dialogues Clin Neurosci. 2010;12(1):47-60.
  4. Hayden EC. Technology: The $1,000 genome. Nature. 2014;507(7492):294-295.
  5. Herper M (2017) ) Illumina promises to sequence human genome for $100—but not quite yet. Forbes. [Accessed Dec 30, 2019]
  6. McMorrow D (2010) The $100 genome: Implications for the DoD. The MITRE Corporation. Report number JSR-10-100.
RUO22-1174_001