Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108680
PIRA download icon_1.1View/Download Full Text
Title: Crop disease identification by fusing multiscale convolution and vision transformer
Authors: Zhu, D
Tan, J
Wu, C
Yung, K 
Ip, AWH
Issue Date: Jul-2023
Source: Sensors, July 2023, v. 23, no. 13, 6015
Abstract: With the development of smart agriculture, deep learning is playing an increasingly important role in crop disease recognition. The existing crop disease recognition models are mainly based on convolutional neural networks (CNN). Although traditional CNN models have excellent performance in modeling local relationships, it is difficult to extract global features. This study combines the advantages of CNN in extracting local disease information and vision transformer in obtaining global receptive fields to design a hybrid model called MSCVT. The model incorporates the multiscale self-attention module, which combines multiscale convolution and self-attention mechanisms and enables the fusion of local and global features at both the shallow and deep levels of the model. In addition, the model uses the inverted residual block to replace normal convolution to maintain a low number of parameters. To verify the validity and adaptability of MSCVT in the crop disease dataset, experiments were conducted in the PlantVillage dataset and the Apple Leaf Pathology dataset, and obtained results with recognition accuracies of 99.86% and 97.50%, respectively. In comparison with other CNN models, the proposed model achieved advanced performance in both cases. The experimental results show that MSCVT can obtain high recognition accuracy in crop disease recognition and shows excellent adaptability in multidisease recognition and small-scale disease recognition.
Keywords: Convolutional neural network
Crop disease recognition
Image classification
Self-attention mechanism
Vision transformer
Publisher: MDPI AG
Journal: Sensors 
EISSN: 1424-8220
DOI: 10.3390/s23136015
Rights: © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
The following publication Zhu D, Tan J, Wu C, Yung K, Ip AWH. Crop Disease Identification by Fusing Multiscale Convolution and Vision Transformer. Sensors. 2023; 23(13):6015 is available at https://doi.org/10.3390/s23136015.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
sensors-23-06015.pdf5.2 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

81
Citations as of Nov 10, 2025

Downloads

22
Citations as of Nov 10, 2025

SCOPUSTM   
Citations

21
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

7
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.