Modern air traffic management systems necessitate efficient and accurate identification and classification of hazard-related text data to ensure flight safety. Air traffic control hazard data encompasses information on potential factors, conditions, or events that may adversely impact aviation safety. Existing text classification methods face challenges due to the diversity of data categories and imbalances within classes. An enhanced ensemble model based on the Stacking framework, incorporating a dual-weighting mechanism was proposed for improved performance. A dual-protection strategy was implemented to categorize hazards and safety risks systematically. The methodology employed the term frequency-inverse document frequency(TF-IDF)algorithm to extract and vectorize features from preprocessed hazard texts. To address class imbalance, the synthetic minority over-sampling technique(SMOTE) and adaptive synthetic sampling approach(ADASYN)algorithms were utilized to generate synthetic samples for minority classes. The Stacking ensemble model was refined by dynamically weighting the F1 scores derived from cross-validation of base learners and integrating a sensitivity assessment mechanism across the ensemble. Experimental results on the constructed dataset demonstrate that the ADASYN-enhanced ensemble model achieves notable improvements in precision, recall, and F1 scores by 0.9%, 1.1%, and 1.0%, respectively, effectively mitigating overfitting in majority classes. The proposed algorithm significantly enhances the classification performance of imbalanced hazard text categories, contributing to the advancement of safety risk management in air traffic control.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |