related documents
- SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions Conference Proceeding
- Unconstrained Foreground Object Search Conference Proceeding
- VQA Therapy: Exploring Answer Differences by Visually Grounding Answers Conference Proceeding
- Why Does a Visual Question Have Different Answers? Conference Proceeding