Classifying Images Regardless of Depictive Style

Project: Research council

Project Details

Description

Computer's today can recognise objects in photographs. This ability has formed the basis of many familiar applications such as Facebook tagging, Google Image Search, Google Goggles, and automated passport checking at UK borders. Yet a significant restriction remains: computers can only recognise objects in photographs. At least, their ability to recognise objects in drawings and paintings - in artwork of any kind - is strictly limited. If this limitation can be overcome then many more applications will become possible. One is a new way to search the web for images in which a drawing (say) is dragged from the desktop into a search bar, and paintings and photographs are given back to the user (at the moment a user gets the same sort of image back as was dragged into the search bar). Another is the automated production of catalogues for taxonomy - which is important to scientists faces with tens of thousands of microscopic creatures; species catalogues are hand-drawn right now so automation would be a significant advance for them. The output of the programme would also allow ordinary photographs to be converted to icons. This is not as dry as it sounds, but could help the visually impaired to gain access to photographic content. If photographs and drawings can be linked in the way this project has in mind, then objects in photographs could be turned into icons rendered by a set of raised pins. So there would be a symbol for car, say, not unlike that which might be drawn by a child - and in fact this is very close to the icons blind artists draw. This would allow the visually impaired to read photographs in newspapers, or in text books, and allow them to share the holiday snaps of family and friends. This proposal is about building the basic technology that underpins these applications, and quite possibly others too. Key to it is lifting the barrier that computers of today face - allowing them to recognise objects no matter how they are depicted.
StatusFinished
Effective start/end date24/06/1323/06/16

Funding

  • Engineering and Physical Sciences Research Council

Fingerprint

Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.
  • Detecting people in artwork with CNNs

    Westlake, N., Cai, H. & Hall, P., 18 Sept 2016, Computer Vision - ECCV 2016 Workshops. ECCV 2016.. Hua, G. & Jégou, H. (eds.). Springer Verlag, p. 825-841 17 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 9913 LNCS).

    Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

    49 Citations (SciVal)
  • Cross-depiction problem: Recognition and Synthesis of Photographs and Artwork

    Hall, P., Cai, H., Wu, Q. & Corradi, T., 2015, In: Computational Visual Media. 1, 2, p. 91-103

    Research output: Contribution to journalArticle

    Open Access
    File
    31 Citations (SciVal)
    208 Downloads (Pure)
  • Learning graphs to model visual objects across different depictive styles

    Wu, Q., Cai, H. & Hall, P., 22 Sept 2014, Computer Vision – ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part VII. Fleet , D., Pajdla , T., Schiele , B. & Tuytelaars , T. (eds.). Cham, Switzerland: Springer, p. 313-328 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 8695).

    Research output: Chapter or section in a book/report/conference proceedingChapter or section

    File
    17 Citations (SciVal)
    410 Downloads (Pure)