CenterNet
Last updated
Last updated
More recently, CenterNet has been introduced [66], which models an object as a single point, i.e., the center point of its bounding box. It uses keypoint estimation to find center points and regresses to all other object properties, such as size, 3D location, orientation, and pose. CenterNet is end-to-end differentiable, simpler, faster, and more accurate than corresponding bounding box-based detectors. Some of the results are shown in Figure 2-52 and Figure 2-53.
Figure 2-53 An object is modeled as the center point of its bounding box. The bounding box size and other object properties are inferred from the keypoint feature at the center [66].
Figure 2-54 Qualitative results. All images were picked thematically without considering algorithms performance. First row: object detection on COCO validation. Second and third row: Human pose estimation on COCO validation. For each pair, we show the results of center offset regression (left) and heatmap matching (right). fourth and fifth row: 3D bounding box estimation on KITTI validation. The authors show projected bounding box(left) and bird eye view map(right). The ground truth detections are shown in solid red solid box. The center heatmap and 3D boxes are shown overlaid on the original image [66].