Predictive modeling of photocatalytic NO removal is highly desirable for efficient air pollution abatement. However, great challenges remain in precisely predicting photocatalytic performance and understanding interactions of diverse features in the catalytic systems. Herein, a dataset of g-C3N4-based catalysts with 255 data points was collected from peer-reviewed publications and machine learning (ML) model was proposed to predict the NO removal rate. The result shows that the Gradient Boosting Decision Tree (GBDT) demonstrated the greatest prediction accuracy with R2 of 0.999 and 0.907 on the training and test data, respectively. The SHAP value and feature importance analysis revealed that the empirical categories for NO removal rate, in the order of importance, were catalyst characteristics > reaction process > preparation conditions. Moreover, the partial dependence plots broke the ML black box to further quantify the marginal contributions of the input features (e.g., doping ratio, flow rate, and pore volume) to the model output outcomes. This ML approach presents a pure data-driven, interpretable framework, which provides new insights into the influence of catalyst characteristics, reaction process, and preparation conditions on NO removal.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |