标题: Urban neighborhood socioeconomic status (SES) inference: A machine learning approach based on semantic and sentimental analysis of online housing advertisements
作者: Wang, LQ (Wang, Lingqi); He, SJ (He, Shenjing); Su, SL (Su, Shiliang); Li, Y (Li, Yu); Hu, LR (Hu, Lirong); Li, GE (Li, Guie)
来源出版物: HABITAT INTERNATIONAL 卷: 124 文献号: 102572 DOI: 10.1016/j.habitatint.2022.102572 出版年: JUN 2022
摘要: Understanding the dynamic distribution of residents' socioeconomic status (SES) across neighborhoods within cities is essential for urban planning and policy-making aligning to the Sustainable Development Goals 2030. Whereas the promise in explicitly linking geographical features to SES has been highlighted fairly clear in previous works, scholars hold an eclectic attitude in their outlook, given the absence of theoretical ground, the heavy reliance on nontransparent proprietary data sources and the relatively coarse resolution predictions. Drawing on a case study of Hangzhou metropolitan in China, this paper aims to address these problems by demonstrating a novel approach to neighborhood SES inference based on online housing advertisements. We first revisit the theoretical debates on the linkage between neighborhood SES and online housing advertisements. Then, the Naive Bayes classifier is employed to semantically identify the topics from online housing advertise-ments and the associated sentiments are quantified using the lexicon-based approach. Following that, seven commonly used machine learning algorithms are compared and utilized to infer the fine-grained neighborhood SES at residential quarters scale based on the housing attributes and extracted topics from online housing ad-vertisements. Results show that machine learning algorithms vary with predictive ability and the tree-based algorithms are much more powerful in inferring neighborhood SES. More specifically, we distinguish 8 reli-able features which not only present relative high importance estimated by all the machine learning algorithms but also exhibit great robustness in inferring neighborhood SES and show promising potential to being applied for unraveling social inequalities. We also observe noteworthy spatial heterogeneity in neighborhood SES across the research site. The demonstrated approach not only enables the policymakers to take stock of deprived neighborhoods in a timely manner, but also lays firm ground for framing contextualized strategies of urban governance. This study is among the first attempts to bridge the theoretical interpretation of housing attributes with the proxy indicator-based approach for fine-grained neighborhood SES measurement.
作者关键词: Neighborhood socioeconomic status; Area deprivation; Machine learning; Open data; Social inequalities; Online housing listings
地址: [Wang, Lingqi; Su, Shiliang; Li, Yu; Hu, Lirong] Wuhan Univ, Sch Resource & Environm Sci, Wuhan, Peoples R China.
[He, Shenjing] Univ Hong Kong, Dept Urban Planning & Design, Hong Kong, Peoples R China.
[He, Shenjing; Li, Guie] China Univ Min & Technol, Sch Publ Policy & Management, Xuzhou, Peoples R China.
[Su, Shiliang] 129 Luoyu Rd, Wuhan, Hubei, Peoples R China.
Univ Hong Kong, Social Infrastruct Equ & Wellbeing SIEW Lab, Hong Kong, Peoples R China.
通讯作者地址: Su, SL (通讯作者),129 Luoyu Rd, Wuhan, Hubei, Peoples R China.
电子邮件地址: shiliangsu@163.com
影响因子:5.369
版权所有 © bwin·必赢(中国)唯一官方网站
地址:湖北省武汉市珞喻路129号 邮编:430079
电话:027-68778381,68778284,68778296 传真:027-68778893 邮箱:sres@whu.edu.cn