Abstract
Mining geographic location of social media users is a crucial technology for realizing the mapping of cyberspace to geographical world, which can provide strong support for wide-ranging location-based services. As a typical approach, user geolocation methods based on relationships rely on the assumption of location homophily between users and their neighbors. However, these methods only utilize the geographic influence between pair-wise relationship, resulting in undesired geolocation performance. In this paper, a social media user geolocation method based on geographically compact social subgraphs (SMUG-GCS) is proposed. Firstly, we analyze the relationship pattern among users in geographic proximity, and find a phenomenon that users who are geographically close tend to have tightly social groups. Based on this finding, a subgraph partitioning algorithm is presented which integrates structure compactness and geographical credibility to identify a set of subgraphs, whose nodes are more tightly connected and geographically proximity. Finally, user locations are inferred using the propagation of user information only based on the geographically compact subgraph. Extensive experiments are conducted on three real-world social media datasets. The results show that, compared with 5 typical relationship-based methods, SMUG-GCS improves the geolocating accuracy while reducing storage costs, leading to a significant reduction in median error distance ranging from 26.7% to 82.9%, as well as decrease in storage requirements by up to 56.5%.
Get full access to this article
View all access options for this article.
