3rd IEEE International Conference on Electronic Engineering (ICEEM-2023)
Protein classification based on graphical encoding and convolutional neural network
Paper ID : 1102-ICEEM2023 (R2)
Authors:
Heba Mohamed Wassfy *1, wafaa Mohamed shalaby2, Prof. Fathi Sayed2, Hanaa Torky2, moawad Dessouky ‎3, Said elkhamy4
1Faculty of Electronic Engineering, Menoufia University
2Department of Electronics and Electrical Communication, Menoufia University, Menouf, Egypt
3Electronics and Electrical ‎Communications Engineering ‎Department Faculty of Electronic Engineering, ‎Menoufia University
4Department of Electronics and Electrical Communication, Alexandria University, Alexandria, Egypt
Abstract:
Sequence signature methods to analyze protein sequences have lately attracted the attention of bioinformaticians and computational biologists. These methods are based on alignment-free approaches using Frequency Chaos Game Representation (FCGR). Indeed, there are many successful applications of these approaches such as DNA classification, sequence comparison, and phylogeny studies. Consequently, applying such methods to explore the properties of protein sequences opens a new research area in recent years. However, there is difficulty in dealing with proteins due to high space dimensionality and the physicochemical characteristics embedded in the 20 amino acids the protein sequence is made of. Here, we explore the effectiveness of applying different strategies for encoding amino acid (protein) sequences into images using FCGR for assigning these sequences into their corresponding families and subsequently predicting their function. Reverse translation and 20-flake representation of protein sequence FCGR strategies have been examined. A Convolutional Neural Network (CNN) has been implemented to classify the performed protein images. The results propose that the 20-flake strategy has improved the accuracy of the multi-class classification task compared to the reverse translation technique.
Keywords:
Protein function prediction, Chaos Game Representation, Deep Learning.
Status : Paper Accepted