DPOD: 6D Pose Object Detector and Refiner

Arch

Contribution
1. Prproposed the Dense Pose Object Detector (DPOD) method that regresses multi-class object masks and dense 2D-3D correspondences between image pixels and corresponding 3D models.
2. Proposed pose refinement approach also performs very well and allows for achieving a pose accuracy and having a simpler and more lightweight backbone architecture. [Faster, Simpler to train and able to trained on Synthetic and real data].

Model
1. Given an input RGB image, the correspondence block, featuring an encoder-decoder neural network, regresses the object ID mask and the correspondence map.
2. The latter one provides with explicit 2D-3D correspondences, whereas the ID mask estimates which correspondences should be taken for each detected object.
3. The respective 6D poses are then efficiently computed by the pose block based on PnP+RANSAC.