Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/91894
| Title: | Screening patents of ICT in construction using deep learning and NLP techniques | Authors: | Wu, H Shen, G Lin, X Li, M Zhang, B Li, CZ |
Issue Date: | 21-Sep-2020 | Source: | Engineering, construction and architectural management, 21 Sept. 2020, v. 27, no. 8, p. 1891-1912 | Abstract: | Purpose - This study proposes an approach to solve the fundamental problem in using query-based methods (i.e. searching engines and patent retrieval tools) to screen patents of information and communication technology in construction (ICTC). The fundamental problem is that ICTC incorporates various techniques and thus cannot be simply represented by man-made queries. To investigate this concern, this study develops a binary classifier by utilizing deep learning and NLP techniques to automatically identify whether a patent is relevant to ICTC, thus accurately screening a corpus of ICTC patents. Design/methodology/approach - This study employs NLP techniques to convert the textual data of patents into numerical vectors. Then, a supervised deep learning model is developed to learn the relations between the input vectors and outputs. Findings - The validation results indicate that (1) the proposed approach has a better performance in screening ICTC patents than traditional machine learning methods; (2) besides the United States Patent and Trademark Office (USPTO) that provides structured and well-written patents, the approach could also accurately screen patents form Derwent Innovations Index (DIX), in which patents are written in different genres. Practical - implications This study contributes a specific collection for ICTC patents, which is not provided by the patent offices. Social implications The proposed approach contributes an alternative manner in gathering a corpus of patents for domains like ICTC that neither exists as a searchable classification in patent offices, nor is accurately represented by man-made queries. Originality/value A deep learning model with two layers of neurons is developed to learn the non-linear relations between the input features and outputs providing better performance than traditional machine learning models. This study uses advanced NLP techniques lemmatization and part-of-speech POS to process textual data of ICTC patents. This study contributes specific collection for ICTC patents which is not provided by the patent offices. |
Keywords: | ICT in construction NLP Deep learning Information management |
Publisher: | Emerald Group Publishing Limited | Journal: | Engineering, construction and architectural management | ISSN: | 0969-9988 | EISSN: | 1365-232X | DOI: | 10.1108/ECAM-09-2019-0480 | Rights: | © 2020, Emerald Publishing Limited. This AAM is provided for your own personal use only. It may not be used for resale, reprinting, systematic distribution, emailing, or for any other commercial purpose without the permission of the publisher The following publication Wu, H., Shen, G., Lin, X., Li, M., Zhang, B. and Li, C.Z. (2020), "Screening patents of ICT in construction using deep learning and NLP techniques", Engineering, Construction and Architectural Management, Vol. 27 No. 8, pp. 1891-1912 is published by Emerald and is available at https://doi.org/10.1108/ECAM-09-2019-0480 |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Wu_Screening_Patents_ICT.pdf | Pre-Published version | 5.15 MB | Adobe PDF | View/Open |
Page views
143
Last Week
2
2
Last month
Citations as of Apr 14, 2025
Downloads
237
Citations as of Apr 14, 2025
SCOPUSTM
Citations
15
Citations as of Jun 21, 2024
WEB OF SCIENCETM
Citations
19
Citations as of Dec 18, 2025
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



