User: Guest  Login

Title:

D3Net: A Speaker-Listener Architecture for Semi-supervised Dense Captioning and Visual Grounding in RGB-D Scans

Document type:
Zeitschriftenaufsatz
Author(s):
Chen, Dave Zhenyu; Wu, Qirui; Nießner, Matthias; Chang, Angel X.
Year:
2021
Fulltext / DOI:
doi:10.48550/ARXIV.2112.01551
Publisher:
arXiv
Date of publication:
01.01.2021
 BibTeX