Projects per year
Abstract
We propose a novel problem revolving around two tasks: (i) given a scene, recommend objects to insert, and (ii) given an object category, retrieve suitable background scenes. A bounding box for the inserted object is predicted in both tasks, which helps downstream applications such as semiautomated advertising and video composition. The major challenge lies in the fact that the target object is neither present nor localized in the input, and furthermore, available datasets only provide scenes with existing objects. To tackle this problem, we build an unsupervised algorithm based on object-level contexts, which explicitly models the joint probability distribution of object categories and bounding boxes using a Gaussian mixture model. Experiments on our own annotated test set demonstrate that our system outperforms existing baselines on all sub-tasks, and does so using a unified framework. Future extensions and applications are suggested.
Original language | English |
---|---|
Pages (from-to) | 79-93 |
Number of pages | 15 |
Journal | Computational Visual Media |
Volume | 6 |
Issue number | 1 |
DOIs | |
Publication status | Published - 31 Mar 2020 |
Funding
We would like to thank all reviewers for their thoughtful comments, and we would like to thank Prof. Ralph Martin for his valuable suggestions on paper revision. This work was supported by the National Key Technology R&D Program (Project Number 2016YFB1001402), the National Natural Science Foundation of China (Project Numbers 61521002, 61772298), Research Grant of Beijing Higher Institution Engineering Research Center, and Tsinghua-Tencent Joint Laboratory for Internet Innovation Technology.
Keywords
- bounding box prediction
- image composition
- object recommendation
- object-level context
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Computer Graphics and Computer-Aided Design
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'What and where: A context-based recommendation system for object insertion'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA)
Cosker, D. (PI), Bilzon, J. (CoI), Campbell, N. (CoI), Cazzola, D. (CoI), Colyer, S. (CoI), Fincham Haines, T. (CoI), Hall, P. (CoI), Kim, K. I. (CoI), Lutteroth, C. (CoI), McGuigan, P. (CoI), O'Neill, E. (CoI), Richardt, C. (CoI), Salo, A. (CoI), Seminati, E. (CoI), Tabor, A. (CoI) & Yang, Y. (CoI)
Engineering and Physical Sciences Research Council
1/09/15 → 28/02/21
Project: Research council