VQA Benchmark Task

From GM-RKB

Jump to navigation Jump to search

A VQA Benchmark Task is a QA benchmark task for VQA tasks.

See: Vision Task, Language Task, Commonsense Task.

References

2015

http://visualqa.org/
- QUOTE: VQA is a new dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer.
  - 254,721 images (MSCOCO and abstract scenes)
  - 3 questions per image (764,163 total)
  - 10 ground truth answers per question
  - 3 plausible (but likely incorrect) answers per question
  - Open-ended and multiple-choice answering tasks.
  - Automatic evaluation metric.

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=VQA_Benchmark_Task&oldid=877454"

Concept