Molecular Medicine Faculty Publications

DeepAdd: Protein Function Prediction from K-mer Embedding and Additional Features

Zhihua Du, Shenzhen University
Yufeng He, Shenzhen University
Jianqiang Li, Shenzhen University
Vladimir N. Uversky, University of South FloridaFollow

Document Type

Article

Publication Date

2020

Digital Object Identifier (DOI)

https://doi.org/10.1016/j.compbiolchem.2020.107379

Abstract

With the application of new high throughput sequencing technology, a large number of protein sequences is becoming available. Determination of the functional characteristics of these proteins by experiments is an expensive endeavor that requires a lot of time. Furthermore, at the organismal level, such kind of experimental functional analyses can be conducted only for a very few selected model organisms. Computational function prediction methods can be used to fill this gap. The functions of proteins are classified by Gene Ontology (GO), which contains more than 40,000 classifications in three domains, Molecular Function (MF), Biological Process (BP), and Cellular Component (CC). Additionally, since proteins have many functions, function prediction represents a multi-label and multi-class problem. We developed a new method to predict protein function from sequence. To this end, natural language model was used to generate word embedding of sequence and learn features from it by deep learning, and additional features to locate every protein. Our method uses the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and have noticeable improvement over several algorithms, such as FFPred, DeepGO, GoFDR and other methods compared on the CAFA3 datasets.

Was this content written or created while at USF?

Yes

Citation / Publisher Attribution

Computational Biology and Chemistry, v. 89, art. 107379

Scholar Commons Citation

Du, Zhihua; He, Yufeng; Li, Jianqiang; and Uversky, Vladimir N., "DeepAdd: Protein Function Prediction from K-mer Embedding and Additional Features" (2020). Molecular Medicine Faculty Publications. 210.
https://digitalcommons.usf.edu/mme_facpub/210

Link to Full Text

Find in your library

COinS

Molecular Medicine Faculty Publications

DeepAdd: Protein Function Prediction from K-mer Embedding and Additional Features

Document Type

Publication Date

Digital Object Identifier (DOI)

Abstract

Was this content written or created while at USF?

Citation / Publisher Attribution

Scholar Commons Citation

Search

Browse By

Useful Links

Molecular Medicine Faculty Publications

DeepAdd: Protein Function Prediction from K-mer Embedding and Additional Features

Authors

Document Type

Publication Date

Digital Object Identifier (DOI)

Abstract

Was this content written or created while at USF?

Citation / Publisher Attribution

Scholar Commons Citation

Share

Search

Browse By

Useful Links