↓ Skip to main content

PLOS

Artificial Neural Networks Trained to Detect Viral and Phage Structural Proteins

Overview of attention for article published in PLoS Computational Biology, August 2012
Altmetric Badge

Mentioned by

twitter
12 X users

Readers on

mendeley
163 Mendeley
citeulike
3 CiteULike
Title
Artificial Neural Networks Trained to Detect Viral and Phage Structural Proteins
Published in
PLoS Computational Biology, August 2012
DOI 10.1371/journal.pcbi.1002657
Pubmed ID
Authors

Victor Seguritan, Nelson Alves, Michael Arnoult, Amy Raymond, Don Lorimer, Alex B. Burgin, Peter Salamon, Anca M. Segall

Abstract

Phages play critical roles in the survival and pathogenicity of their hosts, via lysogenic conversion factors, and in nutrient redistribution, via cell lysis. Analyses of phage- and viral-encoded genes in environmental samples provide insights into the physiological impact of viruses on microbial communities and human health. However, phage ORFs are extremely diverse of which over 70% of them are dissimilar to any genes with annotated functions in GenBank. Better identification of viruses would also aid in better detection and diagnosis of disease, in vaccine development, and generally in better understanding the physiological potential of any environment. In contrast to enzymes, viral structural protein function can be much more challenging to detect from sequence data because of low sequence conservation, few known conserved catalytic sites or sequence domains, and relatively limited experimental data. We have designed a method of predicting phage structural protein sequences that uses Artificial Neural Networks (ANNs). First, we trained ANNs to classify viral structural proteins using amino acid frequency; these correctly classify a large fraction of test cases with a high degree of specificity and sensitivity. Subsequently, we added estimates of protein isoelectric points as a feature to ANNs that classify specialized families of proteins, namely major capsid and tail proteins. As expected, these more specialized ANNs are more accurate than the structural ANNs. To experimentally validate the ANN predictions, several ORFs with no significant similarities to known sequences that are ANN-predicted structural proteins were examined by transmission electron microscopy. Some of these self-assembled into structures strongly resembling virion structures. Thus, our ANNs are new tools for identifying phage and potential prophage structural proteins that are difficult or impossible to detect by other bioinformatic analysis. The networks will be valuable when sequence is available but in vitro propagation of the phage may not be practical or possible.

X Demographics

X Demographics

The data shown below were collected from the profiles of 12 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 163 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 8 5%
Netherlands 1 <1%
Israel 1 <1%
Chile 1 <1%
Iran, Islamic Republic of 1 <1%
Canada 1 <1%
Unknown 150 92%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 36 22%
Researcher 31 19%
Student > Master 21 13%
Student > Bachelor 14 9%
Other 11 7%
Other 27 17%
Unknown 23 14%
Readers by discipline Count As %
Agricultural and Biological Sciences 57 35%
Biochemistry, Genetics and Molecular Biology 25 15%
Computer Science 12 7%
Immunology and Microbiology 9 6%
Engineering 7 4%
Other 19 12%
Unknown 34 21%