2021 TowardsGeneralPurposeVisionSyst
- (Gupta et al., 2021) ⇒ Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, and Derek Hoiem. (2021). “Towards General Purpose Vision Systems.”
Subject Headings: Vision-Language System
Notes
Cited By
- Google Scholar: ~0 Citations.
Quotes
Abstract
A special purpose learning system assumes knowledge of admissible tasks at design time. Adapting such a system to unforeseen tasks requires architecture manipulation such as adding an output head for each new task or dataset. In this work, we propose a task-agnostic vision-language system that accepts an image and a natural language task description and outputs bounding boxes, confidences, and text. The system supports a wide range of vision tasks such as classification, localization, question answering, captioning, and more. We evaluate the system's ability to learn multiple skills simultaneously, to perform tasks with novel skill-concept combinations, and to learn new skills efficiently and without forgetting.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2021 TowardsGeneralPurposeVisionSyst | Aniruddha Kembhavi Tanmay Gupta Amita Kamath Derek Hoiem | Towards General Purpose Vision Systems |