Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs > arXiv:1812.09903

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.09903 (cs)
[Submitted on 24 Dec 2018 (v1), last revised 7 Oct 2019 (this version, v3)]

Title:Adaptive Confidence Smoothing for Generalized Zero-Shot Learning

Authors:Yuval Atzmon, Gal Chechik
View a PDF of the paper titled Adaptive Confidence Smoothing for Generalized Zero-Shot Learning, by Yuval Atzmon and 1 other authors
View PDF
Abstract:Generalized zero-shot learning (GZSL) is the problem of learning a classifier where some classes have samples and others are learned from side information, like semantic attributes or text description, in a zero-shot learning fashion (ZSL). Training a single model that operates in these two regimes simultaneously is challenging. Here we describe a probabilistic approach that breaks the model into three modular components, and then combines them in a consistent way. Specifically, our model consists of three classifiers: A "gating" model that makes soft decisions if a sample is from a "seen" class, and two experts: a ZSL expert, and an expert model for seen classes.
We address two main difficulties in this approach: How to provide an accurate estimate of the gating probability without any training samples for unseen classes; and how to use expert predictions when it observes samples outside of its domain. The key insight to our approach is to pass information between the three models to improve each one's accuracy, while maintaining the modular structure. We test our approach, adaptive confidence smoothing (COSMO), on four standard GZSL benchmark datasets and find that it largely outperforms state-of-the-art GZSL models. COSMO is also the first model that closes the gap and surpasses the performance of generative models for GZSL, even-though it is a light-weight model that is much easier to train and tune.
Notably, COSMO offers a new view for developing zero-shot models. Thanks to COSMO's modular structure, instead of trying to perform well both on seen and on unseen classes, models can focus on accurate classification of unseen classes, and later consider seen class models.
Comments: (1) Accepted to CVPR 2019. (2) Previous title was "Domain-Aware Generalized Zero-Shot Learning". (3) This arxiv version is as the CVPR final version with the following modifications: (a) corrected typos found in Table 3 (b) updated "Related Work" with [52, 10, 20] (c) add a paragraph to the abstract (d) add a probabilistic explanation for the smoothing term
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1812.09903 [cs.CV]
  (or arXiv:1812.09903v3 [cs.CV] for this version)
  http://doi.org/10.48550/arXiv.1812.09903
arXiv-issued DOI via DataCite

Submission history

From: Yuval Atzmon [view email]
[v1] Mon, 24 Dec 2018 11:54:41 UTC (774 KB)
[v2] Mon, 13 May 2019 10:25:53 UTC (1,273 KB)
[v3] Mon, 7 Oct 2019 16:01:33 UTC (1,274 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled Adaptive Confidence Smoothing for Generalized Zero-Shot Learning, by Yuval Atzmon and 1 other authors
  • View PDF
  • TeX Source
  • Other Formats
view license
Current browse context:
cs.CV
< prev   |   next >
new | recent | 2018-12
Change to browse by:
cs

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

DBLP - CS Bibliography

listing | bibtex
Yuval Atzmon
Gal Chechik
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack