In order to improve the safety condition of highways,26 320 highway traffic accident records in France from 2018 to 2022 were selected as the research object. Three representative algorithms were selected to impute missing values in the data,including the RF algorithm,the expectation-maximization (EM) algorithm,and the K-nearest neighbors (KNN) algorithm. The impact of different imputation algorithms on data stability was compared based on the changes in variable variance before and after imputation. The Apriori association rule algorithm was then applied to analyze the causes of highway accidents with different severity levels using the completed dataset. The results indicate that after missing value imputation,the RF algorithm demonstrates superior stability. Compared to the model trained on the original data,the accuracy is improved by 5.66%,the recall rate is increased by 9.22%,and the F1 score is enhanced by 9.91%. It is found that passenger vehicles are more likely to cause property damage accidents; motorcycles are prone to cause injury accidents on roads with lower speed limits and fatal accidents on roads with higher speed limits. The use of safety equipment is significantly related to the severity level of accidents.
| 科 Family | 属数 Number of genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) | 属 Genus | 种数 Number of species | 占总种数比例 Percentage of total species (%) |
|---|---|---|---|---|---|---|
| 鹅膏菌科Amanitaceae | 2 | 11 | 5.26 | 鹅膏菌属 Amanita | 10 | 4.78 |
| 小菇科 Mycenaceae | 2 | 12 | 5.74 | 丝盖伞属 Inocybe | 5 | 2.39 |
| 多孔菌科 Polyporaceae | 8 | 14 | 6.70 | 蜡蘑属 Laccaria | 5 | 2.39 |
| 红菇科 Russulaceae | 3 | 23 | 11.00 | 小皮伞属 Marasmius | 6 | 2.87 |
| 小菇属 Mycena | 11 | 5.26 | ||||
| 光柄菇属 Pluteus | 5 | 2.39 | ||||
| 红菇属 Russula | 17 | 8.13 | ||||
| 栓菌属 Trametes | 5 | 2.39 |