05. BED format

BED is a tabs-delimited file format allows users to define how data lines of an annotation track are displayed.

If you're unfamiliar with an annotation track, they're simply the lines that are displayed on a genome browser.

UCSC Annotation Track
itemA and itemB are sample annotation tracks.

BED files can have up to 12 columns, but only three are required for the UCSC browser, Galaxy browser and bedtools. The number of columns must be consisted throughout each row of the file.

Let's look at all 12 BED fields, as explained by the UCSC Genome Browser Information section.

3 Required BED fields

The following 3 fields are required for all BED files.

Name of chromosome - chr5, chrX, chr2_random. or scaffold - scaffold10671
Starting position of chrom.
First base starts at 0.
Ending position.
This value does not get displayed. For example, the first 20 bases would have chromStart value of 0 to and chromEnd value of 20.

9 Optional BED fields

These 9 BED fields are optional.

Name of the BED line.
Score between 0 and 1000. If useScore is set to 1, the score will determine the level of gray that is displayed. A higher number equates to a darker shade.
Which strand - either '+' or '-'.
The position when the feature is drawn thickly (the start codon for gene display).
Ending position of where the feature is drawn thickly.
Determines the color of the data contained in the BED line. (255,0,0) for red.
Use the Color Picker to translate a color.
Number of blocks (exons) in the BED line.
Comma-separated list of block sizes.
Size of list should correspond to blockCount.
A comma-separated list of block starts.
Should be calculated relative to chromStart.
Size of list should correspond to blockCount.

Example custom track

UCSC Genome BED file display.
Here we can see the track annotations that are uploaded as a BED file is onto the UCSC Genome Browser.


BEDtools - Read the Docs

UCSC Genome Browser - BED format

Become a Bioinformatics Whiz!

Introduction to Bioinformatics Vol. 2

Become a Bioinformatics Whiz! Try Bioinformatics

This is Volume 2 of Bioinformatics Algorithms: An Active Learning Approach. This book presents students with a light-hearted and analogy-filled companion to the author's acclaimed course on Coursera. Each chapter begins with an interesting biological question that further evolves into more and more efficiently solutions of solving it.

$ Check price
49.9949.99Amazon 5 logo(5+ reviews)

More Bioinformatics resources

Take your Linux skills to the next level!

Command Line Kung Fu

Take your Linux skills to the next level! Try Linux & UNIX

Command Line Kung Fu is packed with dozens of tips and practical real-world examples. You won't find theoretical examples in this book. The examples demonstrate how to solve actual problems. The tactics are easy to find, too. Each chapter covers a specific topic and groups related tips and examples together.

$ Check price
14.9914.99Amazon 4.5 logo(27+ reviews)

More Linux & UNIX resources