Based on water quality indicators, climate indicators, and wetland operation parameters, data from previous studies were collected to predict the effluent concentrations of ammonia nitrogen (NH4+-N), COD, sulfamethoxazole (SMX), and some heavy metals in constructed wetlands using three machine learning models. The results showed that the Random Forest model slightly outperformed XGBoost and LightGBM in overall performance, demonstrating more stable R2 and RMSE values. In particular, it achieved higher accuracy in predicting NH4+-N and SMX concentrations, with R2 values of 0.93, 0.89, and 0.87, respectively, for NH4+-N. In contrast, the models performed relatively weaker in COD predictions, with R2 values of 0.71, 0.61, and 0.64, respectively. By incorporating the SMOTE data augmentation technique, the prediction performance and accuracy of the models were significantly enhanced, especially for COD, where improvements ranged from 7.04% to 26.23%. This study combines scientific data analysis with machine learning algorithms, providing a feasible approach for practical engineering applications.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |