Modeling Object Dissimilarity for Deep Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang, Seungryong Kim, Mathieu Salzmann, Sabine Süsstrunk

Research output: Contribution to journalArticlepeer-review

2   Link opens in a new tab Citations (SciVal)

Abstract

Saliency prediction has made great strides over the past two decades, with current techniques modeling low-level information, such as color, intensity and size contrasts, and high-level ones, such as attention and gaze direction for entire objects. Despite this, these methods fail to account for the dissimilarity between objects, which affects human visual attention. In this paper, we introduce a detection-guided saliency prediction network that explicitly models the differences between multiple objects, such as their appearance and size dissimilarities. Our approach allows us to fuse our object dissimilarities with features extracted by any deep saliency prediction network. As evidenced by our experiments, this consistently boosts the accuracy of the baseline networks, enabling us to outperform the state-of-the-art models on three saliency benchmarks, namely SALICON, MIT300 and CAT2000. Our project page is at https://github.com/IVRL/DisSal.

Original languageEnglish
JournalTransactions on Machine Learning Research
Volume2022-November
Early online date18 Nov 2022
Publication statusPublished - 18 Nov 2022

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Modeling Object Dissimilarity for Deep Saliency Prediction'. Together they form a unique fingerprint.

Cite this