D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans

User: Guest

Title:: D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans
Document type:: Zeitschriftenaufsatz
Author(s):: Chen, Dave Zhenyu; Wu, Qirui; Nießner, Matthias; Chang, Angel X.
Year:: 2021
Fulltext / DOI:: doi:10.48550/ARXIV.2112.01551
Publisher:: arXiv
Date of publication:: 01.01.2021
BibTeX