Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/117892
PIRA download icon_1.1View/Download Full Text
Title: A data driven approach to urban area delineation using multi source geospatial data
Authors: Fang, C
Zhou, L
Gu, X 
Liu, X
Werner, M
Issue Date: 2025
Source: Scientific reports, 2025, v. 15, 8708
Abstract: This study introduces a data-driven, bottom-up approach to urban delineation, integrating feature engineering with the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm, which represents a significant improvement in precision and methodology compared to traditional approaches that rely on simplistic OpenStreetMap (OSM) road node data aggregations. By employing a broad array of OSM categories and refining data selection through feature engineering, our research significantly enhances the precision and relevance of urban clustering. Using Bavaria, Germany, as a case study, we demonstrate that feature engineering effectively reduces noise and mitigates common DBSCAN clustering pitfalls by filtering out irrelevant and autocorrelated data. The robustness of the proposed method is validated through a comprehensive assessment involving three key elements: (1) a 5% improvement in average accuracy, (2) optimal clustering selections based on entropy values that eliminate the need for prior knowledge, and (3) validation through nighttime light data and Zipf’s law, where a high p-value of 0.99 confirms a good fit, supporting the power law. This study contributes to urban studies by providing a scalable, replicable model that incorporates advanced data processing techniques and multidimensional data sources, supporting improved urban planning and policy-making while effectively delineating urban areas in varied settings.
Keywords: Data-Driven City
DBSCAN
Feature Engineering (FE)
OpenStreetMap
Publisher: Nature Publishing Group
Journal: Scientific reports 
EISSN: 2045-2322
DOI: 10.1038/s41598-025-93366-x
Rights: Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
© The Author(s) 2025
The following publication Fang, C., Zhou, L., Gu, X. et al. A data driven approach to urban area delineation using multi source geospatial data. Sci Rep 15, 8708 (2025) is available at https://doi.org/10.1038/s41598-025-93366-x.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
s41598-025-93366-x.pdf2.71 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

9
Citations as of May 8, 2026

WEB OF SCIENCETM
Citations

6
Citations as of Apr 23, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.