Core Concepts
This work presents optimal machine learning models that reliably learn and predict the temporal degradation of power conversion efficiency in polymeric organic solar cells with a multilayer structure ITO/PEDOT:PSS/P3HT:PCBM/Al.
Abstract
The key highlights and insights from the content are:
The authors generated a database with 996 entries, including up to 7 variables regarding the manufacturing process and environmental conditions for over 180 days of 45 organic solar cell (OSC) devices.
They utilized an automated machine learning framework, ROBERT, to benchmark and hyper-optimize various regression models including Random Forests, Gradient Boosting, Neural Networks, and others.
The optimal machine learning models achieved high accuracy, with coefficient of determination (R2) widely exceeding 0.90 and root mean squared error (RMSE), sum of squared error (SSE), and mean absolute error (MAE) around 1% of the target power conversion efficiency (PCE) value.
The machine learning models outperformed classical Bayesian regression fitting based on non-linear least squares, which only performed sufficiently for univariate cases of single OSC devices.
The machine learning models demonstrated advantages in capturing multivariate relationships, learning from the entire dataset, and predicting the behavior of unseen OSC devices.
Analysis of variable importance using permutation feature importance and SHAP provided insights into the dependencies between the manufacturing and environmental variables and their implications for optimal OSC performance and stability.
Stats
The dataset includes up to 7 variables regarding the manufacturing process and environmental conditions for over 180 days of 45 organic solar cell devices.
Quotes
"The accuracy achieved reaches values of the coefficient determination (R2) widely exceeding 0.90, whereas the root mean squared error (RMSE), sum of squared error (SSE), and mean absolute error (MAE) À1% of the target value, the PCE."
"The machine learning models demonstrated advantages in capturing multivariate relationships, learning from the entire dataset, and predicting the behavior of unseen OSC devices."