View: session overviewtalk overview
09:00 | 2D-3D Registration Method for X-Ray Image Using 3D Reconstruction based on Deep Neural Network ABSTRACT. This paper proposes a method for registering X-ray images with its 3D CT model by estimating 3D point clouds from X-ray images and their corresponding points on the image. Many conventional methods generate a simulated X-ray image from a 3D CT model and optimize the pose by using the similarity metrics be-tween the simulated X-ray and the input X-ray image. On the other hand, deep learning approaches that predict pose information need a canonical coordinate system defined manually on the pre-operative CT to properly utilize the estimated pose. Therefore, we devise a fully automatic registration pipeline that is independent of coordinate system by recovering 3D point clouds from X-ray images, estimating the corresponding points on the images, and aligning them with the given 3D CT model. |
09:20 | White Blood Cells Classification with YOLOv7: Single and Cascade Classification Aproaches for Images Segmented by CellaVision DM96 ABSTRACT. In this work, we study the use of YOLOv7 in the reclassification of blood cell images, segmented by CellaVisionTM DM96, into 11 classes, i.e., Band Neutrophil, Segmented Neutrophil, Basophil, Eosinophil, Erythroblast, Thrombocyte, Lymphocyte, Variant Lymphocyte, Metamyelocyte, Monocyte, and Myelocyte, in simple and cascade classification. The classification made by CellaVisionTM DM96 achieved an accuracy of 76.44%, simple classification achieved an accuracy of 94.22%, and cascade classification an accuracy of 94.44% for the same database. Both methods proved effective in increasing the performance and, mainly the cascade classification, reduced the rate of more relevant mistakes. |
09:40 | PRESENTER: Phuong Thao Nguyen ABSTRACT. Convolutional Neural Network (CNN) in medical image processing has lately received a lot of interest. Computer-aided polyp detection in gastrointestinal endoscopy has been the subject of research over the past few decades. However, despite significant advances, automatic polyp detection in real-time is still an unsolved problem. In this paper, we propose a Deep Learning method for reliable real-time polyp detection on endoscopic images and videos. We improve the performance of YOLOv8 model by modifying YOLOv8 model architecture with Ghost Convolution and Spatial and Channel Attention mechanisms (GhostAtt-YOLOv8). These techniques are integrated into the backbone network to enhance detection result. The proposed method is applied on Showa University and Nagoya University polyp database (SUN) dataset. Experimental results show that a better performance is archived with mAP@50 of 80.13% compared to the original YOLOv8, and FPS of our proposed model is 294, faster than original YOLOv8. |
10:00 | A Computer Aided Diagnosis System for Hallux Valgus ABSTRACT. Doctors have been required to record the numerical information for clarifying the basis of their diagnosis against cases of Hallux Valgus. So, they had to find landmarks in each patient’s X-ray image and calculate the Hallux Valgus (HV) angle. These works were very time-consuming and prevented doctors from seeing many patients. Kind of computer-aided diagnosis systems have been required. We propose a Segment-Anything-based method for X-ray measurement of HV angles. In the method, we first recognize the 1st proximal phalanx and the 1st metatarsal bone by using Segment Anything. Then, both longitudinal directions and their difference were calculated by applying a Principal Component Analysis. In a preliminary experiment using 14 radiographs, the HV angle obtained by the proposed method showed good agreement with the independent X-ray measurements by two doctors. |
14:35 | Jewelry Image-to-Image Translation with Consistency Regularization and Data Augmentations ABSTRACT. Image enhancement of jewelry is a difficult task because of the shape of the jewelry, its color, background elements such as shadows and glass stand, as well as the blurring of the boundary between the jewelry and the background and unique light reflections. Our preliminary results indicate that CycleGAN is effective in correcting jewelry images and that background elements in jewelry images adversely affect jewelry image correction. In this study, we propose a method to correct jewelry images with strong background elements. The results show that the target consistency of TC-ShadowGAN is effective in correcting images with strong background elements, and that the removal of background elements is further improved by introducing data augmentation with Balanced Consistency Regularization (BCR) and Dense Consistency Regularization (DCR). |
14:55 | Effect of visual saliency on emotions while viewing paintings PRESENTER: Yuma Sasaki ABSTRACT. On viewing a painting, emotions such as happiness and sadness are evoked. However, it is unclear whether these emotions are truly aroused or not, and what factors are responsible for the arousal of these emotions. It has been demonstrated that the pupil diameter changes while looking at emotional objects. In this study, using pupillary response as a physiological index of emotion, pupillary response and eye movements were measured when viewing paintings. We analyzed their relationship with the evaluation of the painting (valence, arousal, liking) to examine the influence of lower-order features of the painting on emotions. |
15:15 | Deformation Invariant Palmprint Recognition with Multiple-Resolution Feature Matching ABSTRACT. Palmprint recognition from high-resolution images involves a lot of ridge and minutiae-based features. A high-resolution image of palmprint involves many small ridges and these crossing points. To identify such features of palmprint images, we need to extract micro feature points of the images and find combinations of the feature points that are consistent to physical constraints. The palm, however, has a soft skin, and can deform, and, as a result, we cannot assume a linear transformation of the image. Our approach is loosely-coupled feature-pair matching using neural network. First, we analyze two palm images and extracts multiple-resolution features using convolutional neural network with skip connections. Second, we compare coarse-grained features and find matching parts roughly using graph neural network. Finally, we explore fine-grained features around the rough matching pairs, and determine the identity according to the number of matching pairs. All of these process is done by neural network. We evaluated the proposed scheme and showed a fine result for palmprint identification. |