Semantic Image Manipulation Using Scene Graphs

Dhamo, H.; Farshad, A.; Laina, I.; Navab, N.; Hager, G. D.; Tombari, F.; Rupprecht, C.

dhamo2020cvpr_1

Titel:: Semantic Image Manipulation Using Scene Graphs
Dokumenttyp:: Konferenzbeitrag
Autor(en):: Dhamo, H.; Farshad, A.; Laina, I.; Navab, N.; Hager, G. D.; Tombari, F.; Rupprecht, C.
Abstract:: Image manipulation can be considered a special case of image generation where the image to be produced is a modification of an existing image. Image generation and manipulation have been, for the most part, tasks that operate on raw pixels. However, the remarkable progress in learning rich image and object representations has opened the way for tasks such as text-to-image or layout-to-image generation that are mainly driven by semantics. In our work, we address the novel problem of image manipulation from scene graphs, in which a user can edit images by merely applying changes in the nodes or edges of a semantic graph that is generated from the image. Our goal is to encode image information in a given constellation and from there on generate new constellations, for example replacing and repositioning objects or even changing relationships between objects, while respecting the semantics and style from the original image. We introduce a spatio-semantic scene graph network that does not require direct supervision for constellation changes or image edits. This makes it possible to train the system from existing real-world data sets with no additional annotation effort. «
Image manipulation can be considered a special case of image generation where the image to be produced is a modification of an existing image. Image generation and manipulation have been, for the most part, tasks that operate on raw pixels. However, the remarkable progress in learning rich image and object representations has opened the way for tasks such as text-to-image or layout-to-image generation that are mainly driven by semantics. In our work, we address the novel problem of image manipul... »
Stichworte:: CAMP,CAMPComputerVision,ComputerVision,CVPR,CVPR2020,CNN,SceneGraphs,Deep Learning,deeplearning
Kongress- / Buchtitel:: Cvpr
Jahr:: 2020
BibTeX

Attachment-Browser öffnen...

Vorkommen:

mediaTUM Gesamtbestand Hochschulbibliographie 2020 Fakultäten Informatik Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Science Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)Import