Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/113613
Title: | Deep learning-based intraoperative video analysis for cataract surgery instrument identification | Authors: | Guo, H Chan, YH Law, NF |
Issue Date: | 2024 | Source: | 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Macau, Macao, 2024, p. 1-7, https://doi.org/10.1109/APSIPAASC63619.2025.10848777 | Abstract: | Surgical instrument detection and classification is a critical task for enhancing surgical procedures monitoring, assisting surgical operations, supporting medical education, and enabling the development of intelligent surgical systems. However, there are a few challenges in this domain. The foremost concern is the impact of varying background conditions. Additionally, class imbalance presents another challenge, potentially leading to biased classification results. To solve these challenges, this study proposes a deep learning-based system consisting of two key components: an attention region detection module and a ResNet50 classification model. The attention region detection employs an optical flow-based method to incorporate both temporal and spatial information from the surgical video so that critical attention regions covering surgical instruments are identified. Our experimental results show that the classification accuracy can be improved from 58.7% to 81.9% by using the attention region detection component. To deal with the challenge of class imbalance, we use focal loss and interleaved sampling strategy as solutions. Interleaved sampling uses both the spatial and temporal information of surgical videos to balance the number of samples across different instrument classes, through which some scarce surgical instrument classes are expanded, thus preventing biased learning of the model. And the validation accuracy on the balanced dataset achieves 87.1%. This study demonstrates the effectiveness of deep learning techniques in addressing challenges in cataract surgery video analysis. | Publisher: | Institute of Electrical and Electronics Engineers | ISBN: | 979-8-3503-6733-1 | DOI: | 10.1109/APSIPAASC63619.2025.10848777 | Description: | 2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 3-6 Dec. 2024, Macau, China |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Guo_Deep_Learning-based_Intraoperative.pdf | Pre-Published version | 2.63 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.