Benchmarking Machine Learning Models on a Dielectric Constant Database for Bandgap Prediction
Mohammad Hadi Yazdani, Paulo S. Branicio, and Ken-ichi NomuraVolume 15, Issue 1 (March 2024), pp. 10–12
https://doi.org/10.22369/issn.2153-4136/15/1/2BibTeX
@article{jocse-15-1-2, author={Mohammad Hadi Yazdani and Paulo S. Branicio and Ken-ichi Nomura}, title={Benchmarking Machine Learning Models on a Dielectric Constant Database for Bandgap Prediction}, journal={The Journal of Computational Science Education}, year=2024, month=mar, volume=15, issue=1, pages={10--12}, doi={https://doi.org/10.22369/issn.2153-4136/15/1/2} }
In this study, we investigate the performance of several regression models by utilizing a database of dielectric constants. First, the database is processed using the Matminer Python library to create features, and then divided into training, validation, and testing subsets. We evaluate several models: Linear Regression, Random Forest, Gradient Boosting, XGBoost, Support Vector Regression, and Feedforward Neural Network, with the objective of predicting the bandgap values. The results indicate superior performance of tree-based ensemble models over Linear Regression and Support Vector Regression. Additionally, a Feedforward Neural Network with two hidden layers demonstrates comparable proficiency in capturing the relationship between the features generated by Matminer and the bandgap target values.