Timezone: »

Poster
Region Mutual Information Loss for Semantic Segmentation
Shuai Zhao · Yang Wang · Zheng Yang · Deng Cai

Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #82
Semantic segmentation is a fundamental problem in computer vision.
It is considered as a pixel-wise classification problem in practice,
and most segmentation models use a pixel-wise loss as their optimization criterion.
However, the pixel-wise loss ignores the dependencies between pixels in an image.
Several ways to exploit the relationship between pixels have been investigated,
\eg, conditional random fields (CRF) and pixel affinity based methods.
Nevertheless, these methods usually require additional model
branches, large extra memories, or more inference time.
In this paper, we develop a region mutual information (RMI) loss
to model the dependencies among pixels more simply and efficiently.
In contrast to the pixel-wise loss which treats the pixels as independent samples,
RMI uses one pixel and its neighbour pixels to represent this pixel.
Then for each pixel in an image,
we get a multi-dimensional point that encodes the relationship between pixels,
and the image is cast into a multi-dimensional distribution of
these high-dimensional points.
The prediction and ground truth thus can achieve high order consistency
through maximizing the mutual information (MI) between their multi-dimensional distributions.
Moreover, as the actual value of the MI is hard to calculate,
we derive a lower bound of the MI and maximize the lower bound to maximize the
real value of the MI.
RMI only requires a few extra computational resources in the training stage,
and there is no overhead during testing.
Experimental results demonstrate
that RMI can achieve substantial and consistent improvements in performance on PASCAL VOC 2012 and CamVid datasets.
The code is available at \url{https://github.com/ZJULearning/RMI}.
`