GFF, or the General Feature Format is used to describe genes and other features of DNA, RNA and protein sequences. It comes with the .gff extension.
GFF is an extension of a basic file with the name, start and end parameters (NSE). For example, an NSE (Chromosome2,2000,4000) specifies two kilobases found on chromosome 2. GFF allows the annotation of these segments.
GFF allows for users to perform common operations such as intersection, exclusion, union, filtration, sorting, transformation and dereferencing.
Several types of bioinformatics software use GFF. This includes genome views such as GBrowse, Jalview and IGB.
There are several versions of GFF. The ones used today are GFF2, GTF and GFF3.
GFF2 (General Feature Format version 2) was limited in that it could only handle three-level feature hierachies instead of three-level such as gene -> transcript -> exon. Thus the Sequence Ontology and GMOD projects expanded on this with features.
GTF (General Transfer Format) has also been known as GFF Version 2.5 since it improves on verison 2, but not as much as version 3.
GFF consists of one line per feature, each containing 9 columns of data. Each column is separated by a tab, making it a tabs-delimited file.
Within the file, we can also include optional track definition lines. These go at the beginning of the list of features they are to affect.
Validators allow us to ensure that a file is formatted properly. To validate a GFF3 file, go to the GFF3 validator.
Wellcome trust sanger institute. GFF: an exchange format for feature description
Command Line Kung Fu is packed with dozens of tips and practical real-world examples. You won't find theoretical examples in this book. The examples demonstrate how to solve actual problems. The tactics are easy to find, too. Each chapter covers a specific topic and groups related tips and examples together.$ Check price
Learn the best practices used by academic and industry professionals. Bioinformatics Data Skills give a great overview to the Linux Command Line, Github, and other essential tools used in the trade. This book bridges the gap between knowing a few programming languages and being able to utilize the tools to analyze large amounts of biological data.$ Check price