Towards High-performance Objective Detection: Task-specific Design Considering Classification and Localization Separation

Kim, J.U.; Kim, S.T.; Kim, E.S.; Moon, S.K.; Ro, Y.M.

staekim2020icassp

Titel:: Towards High-performance Objective Detection: Task-specific Design Considering Classification and Localization Separation
Dokumenttyp:: Konferenzbeitrag
Autor(en):: Kim, J.U.; Kim, S.T.; Kim, E.S.; Moon, S.K.; Ro, Y.M.
Abstract:: Object detection performs two tasks (classification and localization) simultaneously. Two tasks share a similarity: they need robust features that effectively represent the visual appearance of the objects. However, two tasks also have differentproperties. First, classification mainly requires features from discriminative parts of an object to determine the object category, whereas localization mainly requires features from the entire object regions for localizing by drawing a bounding box. Second, classification has a translation invariant property, whereas localization has a translation variant property. In order to increase the efficiency of object detection, it is necessary to design a network in consideration of the commonalities and differences of two tasks. In this work, we simply modi?ed layers of the existing object detection networks into three parts by considering such characteristics: lower-layer feature sharing part, layer separation part, and feature fusion part. As a result, the performance of the proposed method was noticeably improved by properly sharing, separating, and fusing layers of the existing object detection networks. «
Object detection performs two tasks (classification and localization) simultaneously. Two tasks share a similarity: they need robust features that effectively represent the visual appearance of the objects. However, two tasks also have differentproperties. First, classification mainly requires features from discriminative parts of an object to determine the object category, whereas localization mainly requires features from the entire object regions for localizing by drawing a bounding box. Seco... »
Stichworte:: ObjectDetection,DeepLearning
Kongress- / Buchtitel:: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Ausrichter der Konferenz:: Ieee
Jahr:: 2020
Seiten:: 4317--4321
BibTeX

Vorkommen:

mediaTUM Gesamtbestand Hochschulbibliographie 2020 Fakultäten Informatik Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Science Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)Import