A Method for Improving the Accuracy and Efficiency of Bacteriophage Genome Annotation

Document Type

Article

Publication Date

7-10-2019

Publication Title

International Journal of Molecular Sciences

Volume

20

Issue

3391

First page number:

1

Last page number:

17

Abstract

Bacteriophages are the most numerous entities on Earth. The number of sequenced phage genomes is approximately 8000 and increasing rapidly. Sequencing of a genome is followed by annotation, where genes, start codons, and functions are putatively identified. The mainstays of phage genome annotation are auto-annotation programs such as Glimmer and GeneMark. Due to the relatively small size of phage genomes, many groups choose to manually curate auto-annotation results to increase accuracy. An additional benefit of manual curation of auto-annotated phage genomes is that the process is amenable to be performed by students, and has been shown to improve student recruitment to the sciences. However, despite its greater accuracy and pedagogical value, manual curation suffers from high labor cost, lack of standardization and a degree of subjectivity in decision making, and susceptibility to mistakes. Here, we present a method developed in our lab that is designed to produce accurate annotations while reducing subjectivity and providing a degree of standardization in decision-making. We show that our method produces genome annotations more accurate than auto-annotation programs while retaining the pedagogical benefits of manual genome curation.

Keywords

Bacteriophages; Genomics; Genome Annotation; SEA-PHAGES; Lambda

Disciplines

Genetics and Genomics | Life Sciences | Molecular Genetics

Language

English

UNLV article access

Search your library

Share

COinS