This website requires JavaScript.

GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

Haiwen HuangAndreas GeigerDan Zhang
Dec 2022
摘要
We address the task of open-world class-agnostic object detection, i.e.,detecting every object in an image by learning from a limited number of baseobject classes. State-of-the-art RGB-based models suffer from overfitting thetraining classes and often fail at detecting novel-looking objects. This isbecause RGB-based models primarily rely on appearance similarity to detectnovel objects and are also prone to overfitting short-cut cues such as texturesand discriminative parts. To address these shortcomings of RGB-based objectdetectors, we propose incorporating geometric cues such as depth and normals,predicted by general-purpose monocular estimators. Specifically, we use thegeometric cues to train an object proposal network for pseudo-labelingunannotated novel objects in the training set. Our resulting Geometry-guidedOpen-world Object Detector (GOOD) significantly improves detection recall fornovel object categories and already performs well with only a few trainingclasses. Using a single "person" class for training on the COCO dataset, GOODsurpasses SOTA methods by 5.0% AR@100, a relative improvement of 24%.
展开全部
图表提取

暂无人提供速读十问回答

论文十问由沈向洋博士提出,鼓励大家带着这十个问题去阅读论文,用有用的信息构建认知模型。写出自己的十问回答,还有机会在当前页面展示哦。

Q1论文试图解决什么问题?
Q2这是否是一个新的问题?
Q3这篇文章要验证一个什么科学假设?
0
被引用
笔记
问答