Clustering and pedestrian crashes prediction modelling: Amman case

This study analyses the spatial distribution of pedestrian casualties to define contributory factors and delineate the means for their prediction. Three years of crash data were collected along with other factors and analysed using kernel density estimation (KDE), spatial autocorrelation (Moran's I), cluster K-Means, spatial regression, and general linear regressions (GLM). Kernel density estimate defines a cluster of pedestrian deaths within 1250 meters. According to Moran's I, 17/22 attributes about casualties, road networks, demographics, and land use have positive values, indicating similar importance clustering. The spatial pattern of pedestrian casualties is random and insignificant and does not change with time. Casualties are negatively related to the surrounding attributes, indicating a tendency towards dispersion. A K-Means analysis of multiple variables revealed that when variables included in the clustering were higher, the variance explanation percentage was lower. In the multi-variable GLM assuming Poisson distribution, the road network length alone or with the house permits combined were the best predictors of casualties. Classic regressions were not significantly enhanced by spatial dimension, and none of the autoregressive coefficients were significant. The predictions from the Poisson-based GLM model are similar to the classic regressions.PMID:37357318 | DOI:10.1080/17457300.2023.2214900
Source: International Journal of Injury Control and Safety Promotion - Category: Accident Prevention Authors: Source Type: research