This approach uses the available methods for image object classification in 2D then projects the results based on RGB-D depth information to the 3D space. Object classification (in this case) is made with Cascade classifier, but easily can be changed with a Deep Neuronal Network.
Processig steps:
- RGB and Depth information is loaded
Note: for visualization purposes, the pyvista and itk viewer is used. Please follow the installation instructions: https://github.com/InsightSoftwareConsortium/itkwidgets
More details on the implementation, algorithms can be found here:
- https://github.com/fvilmos/kinect_point_cloud - visualization of Kinect 3d data
- https://github.com/fvilmos/cascade_tools - train your cascade
- https://github.com/fvilmos/cascade_nms - false positive filtering with Non-Maximum-Suppression
- Optimize Classifier - change with i.e. Yolo or Mobilenet
- with a good classifier optimize away the Non-Maximum-Suppression step
Any contribution is welcomed!
/Enjoy.