Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/117892
| Title: | A data driven approach to urban area delineation using multi source geospatial data | Authors: | Fang, C Zhou, L Gu, X Liu, X Werner, M |
Issue Date: | 2025 | Source: | Scientific reports, 2025, v. 15, 8708 | Abstract: | This study introduces a data-driven, bottom-up approach to urban delineation, integrating feature engineering with the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm, which represents a significant improvement in precision and methodology compared to traditional approaches that rely on simplistic OpenStreetMap (OSM) road node data aggregations. By employing a broad array of OSM categories and refining data selection through feature engineering, our research significantly enhances the precision and relevance of urban clustering. Using Bavaria, Germany, as a case study, we demonstrate that feature engineering effectively reduces noise and mitigates common DBSCAN clustering pitfalls by filtering out irrelevant and autocorrelated data. The robustness of the proposed method is validated through a comprehensive assessment involving three key elements: (1) a 5% improvement in average accuracy, (2) optimal clustering selections based on entropy values that eliminate the need for prior knowledge, and (3) validation through nighttime light data and Zipf’s law, where a high p-value of 0.99 confirms a good fit, supporting the power law. This study contributes to urban studies by providing a scalable, replicable model that incorporates advanced data processing techniques and multidimensional data sources, supporting improved urban planning and policy-making while effectively delineating urban areas in varied settings. | Keywords: | Data-Driven City DBSCAN Feature Engineering (FE) OpenStreetMap |
Publisher: | Nature Publishing Group | Journal: | Scientific reports | EISSN: | 2045-2322 | DOI: | 10.1038/s41598-025-93366-x | Rights: | Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. © The Author(s) 2025 The following publication Fang, C., Zhou, L., Gu, X. et al. A data driven approach to urban area delineation using multi source geospatial data. Sci Rep 15, 8708 (2025) is available at https://doi.org/10.1038/s41598-025-93366-x. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| s41598-025-93366-x.pdf | 2.71 MB | Adobe PDF | View/Open |
SCOPUSTM
Citations
9
Citations as of May 8, 2026
WEB OF SCIENCETM
Citations
6
Citations as of Apr 23, 2026
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



