06. Shotgun Sequencing

Shotgun sequencing is a type of de novo sequencing, meaning it can assemble an entire genome that has not yet been sequenced before.

Shotgun sequence is used to analyze DNA sequences longer than 1000 base pairs, up to entire chromosomes. The basic methodology is to break up multiple sequences of the same genome in various places, and reassemble them based on overlapping regions.


  1. Genomic DNA is fragmented by sonification or hydrodynamic shearing.
  2. All sticky-end fragments are blunt ended with T4 DNA polymerase and exonuclease activity.
  3. T4 polynucleotide kinase is added so that 5' ends are phosphorylated.
  4. Fragments seaprated into either small (~1kb), medium (~8kb) and large (~40kb) fragments.
  5. A library is created per each size in plasmids and transformed into E. coli cells.
  6. Vector DNA is purified from each library and amplified.
  7. Each DNA strand is sequenced (can attach a primer upstream of our vector, then use any sequencing by synthesis method).
  8. Computer program called a base caller filters out poor calls.
  9. The assembler finds overlapping segments and generates long successive continguous stretches of nucleotides, called contigs.
Assembler aligning reads into contigs.
An assembler aligning reads to a contig. A contig that is said to occur by chance is known as a false contig.


Statistically speaking, there are chances of false contigs coming up. This occurs when the assembler finds overlapping segments that occurred by chance. This may be corrected by paired-ends or mate-pairs sequencing.

Additionally, transfecting bacteria cells can take a long time.

Become a Bioinformatics Whiz!

Bioinformatics Data Skills

Become a Bioinformatics Whiz! Try Bioinformatics

Learn the best practices used by academic and industry professionals. Bioinformatics Data Skills give a great overview to the Linux Command Line, Github, and other essential tools used in the trade. This book bridges the gap between knowing a few programming languages and being able to utilize the tools to analyze large amounts of biological data.

$ Check price
49.9949.99Amazon 4.5 logo(7+ reviews)

More Bioinformatics resources

Take your Linux skills to the next level!

Linux for Beginners

Take your Linux skills to the next level! Try Linux & UNIX

Linux for Beginners doesn't make any assumptions about your background or knowledge of Linux. You need no prior knowledge to benefit from this book. You will be guided step by step using a logical and systematic approach. As new concepts, commands, or jargon are encountered they are explained in plain language, making it easy for anyone to understand.

$ Check price
24.9924.99Amazon 4.5 logo(101+ reviews)

More Linux & UNIX resources