ET-MSF: a model stacking framework to identify electron transport proteins

^*Correspondence: qingfengpan1074@163.com (Qingfeng Pan); viplxb163@163.com (Xiaobin Liu); wuxi_dyj@163.com (Yijie Ding)
Academic Editor: Graham Pawelec

Front. Biosci. (Landmark Ed) 2022, 27(1), 12; https://doi.org/10.31083/j.fbl2701012

Submitted: 18 October 2021 | Revised: 26 November 2021 | Accepted: 9 December 2021 | Published: 11 January 2022

(This article belongs to the Special Issue Computational biomarker detection and analysis)

This is an open access article under the CC BY 4.0 license.

Abstract

Introduction: The electron transport chain is closely related to cellular respiration and has been implicated in various human diseases. However, the traditional “wet” experimental method is time consuming. Therefore, it is key to identify electron transport proteins by computational methods. Many approaches have been proposed, but performance of them still has room for further improvement. Methodological issues: In our study, we propose a model stacking framework, which combines multiple base models. The protein features are extracted via PsePSSM from protein sequences. Features are fed into the base model including support vector machines (SVM), random forest (RF), XGBoost, etc. The results of base model are entered into logistic regression model for final process. Results: On the independent dataset, the accuracy and Matthew’s correlation coefficient (MCC) of proposed method are 95.70% and 0.8756, respectively. Furthermore, we show that the model stacking framework outperforms single machine learning classifiers statistically. Conclusion: Our models are better than most known strategies for identifying electron transport proteins. Our model can be used to more precisely identify electron transport proteins.

Keywords

Electron transport chain

Ensemble learning

Model stacking

Logistic regression

Transport protein

Figures

Fig. 1.

Previous article in this issue

Next article in this issue

Front. Biosci. (Landmark Ed) Print ISSN 2768-6701 Electronic ISSN 2768-6698