Visual Dialog

Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh and Dhruv Batra

CVPR 2017 (Spotlight) [Bibtex] [PDF] [Code]

Acknowledgements

We thank Harsh Agrawal and Jiasen Lu for help on the AMT data collection interface; Xiao Lin, Ramprasaath Selvaraju and Latha Pemula for model discussions; Marco Baroni, Antoine Bordes, Mike Lewis, and Marc'Aurelio Ranzato for helpful discussions. Finally, we are grateful to the developers of Torch for building an excellent framework. This work was funded in part by the NSF CAREER awards to Dhruv Batra and Devi Parikh, ONR YIP awards to Dhruv Batra and Devi Parikh, ONR Grant N00014-14-1-0679 to Dhruv Batra, a Sloan Fellowship to Devi Parikh, ARO YIP awards to Dhruv Batra and Devi Parikh, an Allen Distinguished Investigator award to Devi Parikh from the Paul G. Allen Family Foundation, ICTAS Junior Faculty awards to Dhruv Batra and Devi Parikh, Google Faculty Research Awards to Dhruv Batra and Devi Parikh, Amazon Academic Research Awards to Dhruv Batra and Devi Parikh, AWS in Education Research grant to Dhruv Batra and NVIDIA GPU donations to Dhruv Batra.

License

Visual Dialog annotations and this website are licensed under a Creative Commons Attribution 4.0 International License.

Images

Visual Dialog does not own the copyright of the images. Use of the images must abide by the COCO and Flickr Terms of Use. The users of the images accept full responsibility for the use of the dataset, including but not limited to the use of any copies of copyrighted images that they may create from the dataset.

What is Visual Dialog?

News

Improving Generative Visual Dialog by Answering Diverse Questions

Visual Coreference Resolution in Visual Dialog using Neural Module Networks

Evaluating Visual Conversational Agents via Cooperative Human-AI Games

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

Visual Dialog

Acknowledgements

License

Images

Sponsors