Abstract:
|
Geographically Weighted Regression (GWR) has been developed to capture the strong effect of the local variations. For binary classification, GWR assumes linear relationship between log-odds of the response and the independent variables. Support Vector Machines (SVM) does not require a specific relationship between response and independent variables. Further, it can automatically account for the interactions in the data. However, SVM has never been applied to geographically correlated data. Therefore, we developed a method called Geographical Support Vector Machines (GSVM), which combines geographically related data with SVM. This approach works by creating separate SVM for each local context and weighting observations based on their distance to the local context. To test our method, we build a model to predict counties with increase in the urologists’ availability from 2010 to 2018. We used socioeconomic variables of each county as a predictive parameter. In this situation, our GSVM (AUC 0.80) model performs significantly (p< 0.05) better than SVM (0.70). In conclusion, we have developed GSVM, that can be used to classify geographically correlated data.
|