Cross-domain assessment of feature effectiveness in functional materials prediction via multiple machine larning models

dc.contributor.authorRogti, Maroua
dc.contributor.authorBenghia, Ali
dc.contributor.authorMechraoui, Bouchra Kaima
dc.date.accessioned2025-10-06T13:12:35Z
dc.date.available2025-10-06T13:12:35Z
dc.date.issued2025
dc.description.abstractThis study explores the impact of feature selection on the accuracy of band gap prediction across two distinct material datasets: nonlinear optical (NLO) compounds and perovskite-structured ABX₃ materials. Features were extracted from the Pymatgen library in Python. The NLO dataset includes 224 chemically diverse compounds collected from literature, while the perovskite dataset consists of 643 ABX₃ materials with X being O, S, Se, F, Cl, Br, or I, sourced from the OQMD database. Several machine learning models were employed, with the Random Forest Regressor (RFR) consistently achieving the best performance. For the ABX₃ dataset, RFR yielded a mean absolute error (MAE) of 0.4941 eV using the top 10 features and 0.5044 eV with all features; for the NLO dataset, the model achieved 0.3309 eV and 0.3238 eV, respectively. Dimensionality reduction through Principal Component Analysis (PCA) did not enhance the results compared to manually selected or full feature sets. These findings highlight the significance of proper feature engineering, especially when relying on descriptors generated by automated tools like Pymatgen, to improve the efficiency and reliability of machine learning applications in materials discovery.
dc.identifier.urihttps://dspace.lagh-univ.dz/handle/123456789/13665
dc.language.isoen
dc.publisherLaghouat : Université Amar Telidji - Département des sciences de la matière
dc.titleCross-domain assessment of feature effectiveness in functional materials prediction via multiple machine larning models
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MP 01-116.pdf
Size:
3.46 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: