Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/108680
| Title: | Crop disease identification by fusing multiscale convolution and vision transformer | Authors: | Zhu, D Tan, J Wu, C Yung, K Ip, AWH |
Issue Date: | Jul-2023 | Source: | Sensors, July 2023, v. 23, no. 13, 6015 | Abstract: | With the development of smart agriculture, deep learning is playing an increasingly important role in crop disease recognition. The existing crop disease recognition models are mainly based on convolutional neural networks (CNN). Although traditional CNN models have excellent performance in modeling local relationships, it is difficult to extract global features. This study combines the advantages of CNN in extracting local disease information and vision transformer in obtaining global receptive fields to design a hybrid model called MSCVT. The model incorporates the multiscale self-attention module, which combines multiscale convolution and self-attention mechanisms and enables the fusion of local and global features at both the shallow and deep levels of the model. In addition, the model uses the inverted residual block to replace normal convolution to maintain a low number of parameters. To verify the validity and adaptability of MSCVT in the crop disease dataset, experiments were conducted in the PlantVillage dataset and the Apple Leaf Pathology dataset, and obtained results with recognition accuracies of 99.86% and 97.50%, respectively. In comparison with other CNN models, the proposed model achieved advanced performance in both cases. The experimental results show that MSCVT can obtain high recognition accuracy in crop disease recognition and shows excellent adaptability in multidisease recognition and small-scale disease recognition. | Keywords: | Convolutional neural network Crop disease recognition Image classification Self-attention mechanism Vision transformer |
Publisher: | MDPI AG | Journal: | Sensors | EISSN: | 1424-8220 | DOI: | 10.3390/s23136015 | Rights: | © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). The following publication Zhu D, Tan J, Wu C, Yung K, Ip AWH. Crop Disease Identification by Fusing Multiscale Convolution and Vision Transformer. Sensors. 2023; 23(13):6015 is available at https://doi.org/10.3390/s23136015. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| sensors-23-06015.pdf | 5.2 MB | Adobe PDF | View/Open |
Page views
81
Citations as of Nov 10, 2025
Downloads
22
Citations as of Nov 10, 2025
SCOPUSTM
Citations
21
Citations as of Dec 19, 2025
WEB OF SCIENCETM
Citations
7
Citations as of Dec 18, 2025
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



