06. Shotgun Sequencing

Shotgun sequencing is a type of de novo sequencing, meaning it can assemble an entire genome that has not yet been sequenced before.

Shotgun sequence is used to analyze DNA sequences longer than 1000 base pairs, up to entire chromosomes. The basic methodology is to break up multiple sequences of the same genome in various places, and reassemble them based on overlapping regions.


  1. Genomic DNA is fragmented by sonification or hydrodynamic shearing.
  2. All sticky-end fragments are blunt ended with T4 DNA polymerase and exonuclease activity.
  3. T4 polynucleotide kinase is added so that 5' ends are phosphorylated.
  4. Fragments seaprated into either small (~1kb), medium (~8kb) and large (~40kb) fragments.
  5. A library is created per each size in plasmids and transformed into E. coli cells.
  6. Vector DNA is purified from each library and amplified.
  7. Each DNA strand is sequenced (can attach a primer upstream of our vector, then use any sequencing by synthesis method).
  8. Computer program called a base caller filters out poor calls.
  9. The assembler finds overlapping segments and generates long successive continguous stretches of nucleotides, called contigs.
Assembler aligning reads into contigs.
An assembler aligning reads to a contig. A contig that is said to occur by chance is known as a false contig.


Statistically speaking, there are chances of false contigs coming up. This occurs when the assembler finds overlapping segments that occurred by chance. This may be corrected by paired-ends or mate-pairs sequencing.

Additionally, transfecting bacteria cells can take a long time.

Take your Linux skills to the next level!

Linux for Beginners

Take your Linux skills to the next level! Try Linux & UNIX

Linux for Beginners doesn't make any assumptions about your background or knowledge of Linux. You need no prior knowledge to benefit from this book. You will be guided step by step using a logical and systematic approach. As new concepts, commands, or jargon are encountered they are explained in plain language, making it easy for anyone to understand.

$ Check price
24.9924.99Amazon 4.5 logo(101+ reviews)

More Linux & UNIX resources

Learn to be a Pythonista!

Programming Python

Learn to be a Pythonista! Try Python

Programming Python shows in-depth tutorials on the language's number of application domains including: system administration, GUIs, the Web, networking, front-end scripting layers, and more. This book focuses on commonly used tools and libraries to give you a comprehensive understanding of Python’s many roles in practical, real-world programming.

$ Check price
64.9964.99Amazon 4 logo(56+ reviews)

More Python resources