TY - JOUR
T1 - Functional classification of protein toxins as a basis for bioinformatic screening
AU - Negi, Surendra S.
AU - Schein, Catherine H.
AU - Ladics, Gregory S.
AU - Mirsky, Henry
AU - Chang, Peter
AU - Rascle, Jean Baptiste
AU - Kough, John
AU - Sterck, Lieven
AU - Papineni, Sabitha
AU - Jez, Joseph M.
AU - Pereira Mouriès, Lucilia
AU - Braun, Werner
N1 - Publisher Copyright:
© 2017 The Author(s).
PY - 2017/12/1
Y1 - 2017/12/1
N2 - Proteins are fundamental to life and exhibit a wide diversity of activities, some of which are toxic. Therefore, assessing whether a specific protein is safe for consumption in foods and feeds is critical. Simple BLAST searches may reveal homology to a known toxin, when in fact the protein may pose no real danger. Another challenge to answer this question is the lack of curated databases with a representative set of experimentally validated toxins. Here we have systematically analyzed over 10,000 manually curated toxin sequences using sequence clustering, network analysis, and protein domain classification. We also developed a functional sequence signature method to distinguish toxic from non-toxic proteins. The current database, combined with motif analysis, can be used by researchers and regulators in a hazard screening capacity to assess the potential of a protein to be toxic at early stages of development. Identifying key signatures of toxicity can also aid in redesigning proteins, so as to maintain their desirable functions while reducing the risk of potential health hazards.
AB - Proteins are fundamental to life and exhibit a wide diversity of activities, some of which are toxic. Therefore, assessing whether a specific protein is safe for consumption in foods and feeds is critical. Simple BLAST searches may reveal homology to a known toxin, when in fact the protein may pose no real danger. Another challenge to answer this question is the lack of curated databases with a representative set of experimentally validated toxins. Here we have systematically analyzed over 10,000 manually curated toxin sequences using sequence clustering, network analysis, and protein domain classification. We also developed a functional sequence signature method to distinguish toxic from non-toxic proteins. The current database, combined with motif analysis, can be used by researchers and regulators in a hazard screening capacity to assess the potential of a protein to be toxic at early stages of development. Identifying key signatures of toxicity can also aid in redesigning proteins, so as to maintain their desirable functions while reducing the risk of potential health hazards.
UR - http://www.scopus.com/inward/record.url?scp=85032179790&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032179790&partnerID=8YFLogxK
U2 - 10.1038/s41598-017-13957-1
DO - 10.1038/s41598-017-13957-1
M3 - Article
C2 - 29066768
AN - SCOPUS:85032179790
SN - 2045-2322
VL - 7
JO - Scientific reports
JF - Scientific reports
IS - 1
M1 - 13940
ER -