This website requires JavaScript.
DOI: 10.1016/j.jvcir.2022.103629

Semantic guided knowledge graph for large-scale zero-shot learning

Jiwei WeiHaotian SunYang YangXing XuJingjing LiHeng Tao Shen
Journal of Visual Communication and Image Representation
Sep 2022
Zero-shot learning has received growing attention, which aims to improve generalization to unseen concepts. The key challenge in zero-shot tasks is to precisely model the relationship between seen and unseen classes. Most existing zero-shot learning methods capture inter-class relationships via a shared embedding space, leading to inadequate use of relationships and poor performance. Recently, knowledge graph-based methods have emerged as a new trend of zero-shot learning. These methods use a knowledge graph to accurately model the inter-class relationships. However, the currently dominant method for zero-shot learning directly extracts the fixed connection from off-the-shelf WordNet, which will inherit the inherent noise in WordNet. In this paper, we propose a novel method that adopts class-level semantic information as a guidance to construct a new semantic guided knowledge graph (SG-KG) , which can correct the errors in the existing knowledge graph and accurately model the inter-class relationships. Specifically, our method includes two main steps: noise suppression and semantic enhancement. Noise suppression is used to eliminate noise edges in the knowledge graph, and semantic enhancement is used to connect two classes with strong relations. To promote high efficient information propagation among classes, we develop a novel multi-granularity fusion network (MGFN) that integrates discriminative information from multiple GCN branches. Extensive experiments on the large-scale ImageNet-21K dataset and AWA2 dataset demonstrate that our method consistently surpasses existing methods and achieves a new state-of-the-art result. • Semantic guided knowledge graph that models the relationship between classes. • Modify connections in WordNet by integrating the relationships of word embeddings. • A novel multi-granularity fusion network for efficient zero-shot recognition. • Experimental results show that our MGFN achieves a new state-of-the-art result.