Download PDFOpen PDF in browser

Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

EasyChair Preprint no. 5758

5 pagesDate: June 7, 2021


This paper presents a novel pothole detection approach based on single-modal semantic segmentation. It first extracts visual features from input images using a convolutional neural network. A channel attention module then reweighs the channel features to enhance the consistency of different feature maps. Subsequently, we employ an atrous spatial pyramid pooling module (comprising of atrous convolutions in series, with progressive rates of dilation) to integrate the spatial context information. This helps better distinguish between potholes and undamaged road areas. Finally, the feature maps in the adjacent layers are fused using our proposed multi-scale feature fusion module. This further reduces the semantic gap between different feature channel layers. Extensive experiments were carried out on the Pothole-600 dataset to demonstrate the effectiveness of our proposed method. The quantitative comparisons suggest that our method achieves the state-of-the-art (SoTA) performance on both RGB images and transformed disparity images, outperforming three SoTA single-modal semantic segmentation networks.

Keyphrases: Convolutional Neural Network, feature fusion, pothole detection, single-modal semantic segmentation

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Jiahe Fan and Mohammud J. Bocus and Brett Hosking and Rigen Wu and Yanan Liu and Sergey Vityazev and Rui Fan},
  title = {Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection},
  howpublished = {EasyChair Preprint no. 5758},

  year = {EasyChair, 2021}}
Download PDFOpen PDF in browser