Automatic Feature Selection in Markov State Models Using Genetic Algorithm

Qihua Chen, Jiangyan Feng, Shriyaa Mittal, and Diwakar Shukla

Volume 9, Issue 2 (December 2018), pp. 14–22

https://doi.org/10.22369/issn.2153-4136/9/2/2

PDF icon Download PDF

BibTeX
@article{jocse-9-2-2,
  author={Qihua Chen and Jiangyan Feng and Shriyaa Mittal and Diwakar Shukla},
  title={Automatic Feature Selection in Markov State Models Using Genetic Algorithm},
  journal={The Journal of Computational Science Education},
  year=2018,
  month=dec,
  volume=9,
  issue=2,
  pages={14--22},
  doi={https://doi.org/10.22369/issn.2153-4136/9/2/2}
}
Copied to clipboard!

Markov State Models (MSMs) are a powerful framework to reproduce the long-time conformational dynamics of biomolecules using a set of short Molecular Dynamics (MD) simulations. However, precise kinetics predictions of MSMs heavily rely on the features selected to describe the system. Despite the importance of feature selection for large system, determining an optimal set of features remains a difficult unsolved problem. Here, we introduce an automatic approach to optimize feature selection based on genetic algorithms (GA), which adaptively evolves the most fitted solution according to natural selection laws. The power of the GA-based method is illustrated on long atomistic folding simulations of four proteins, varying in length from 28 to 80 residues. Due to the diversity of tested proteins, we expect that our method will be extensible to other proteins and drive MSM building to a more objective protocol.