Object classification from 2D images to 3D point cloud

This approach uses the available methods for image object classification in 2D then projects the results based on RGB-D depth information to the 3D space. Object classification (in this case) is made with Cascade classifier, but easily can be changed with a Deep Neuronal Network.

Processig steps:

RGB and Depth information is loaded

2. RGB image is applied to a Cascade classifier. Results are filtered with Non-Maximum-Suppression

3. Object coordinated are projected in 3D space using Kinect claibration parameters. Detection are plotted on the 3D Point Cloud

Note: for visualization purposes, the pyvista and itk viewer is used. Please follow the installation instructions: https://github.com/InsightSoftwareConsortium/itkwidgets

More details on the implementation, algorithms can be found here:

https://github.com/fvilmos/kinect_point_cloud - visualization of Kinect 3d data
https://github.com/fvilmos/cascade_tools - train your cascade
https://github.com/fvilmos/cascade_nms - false positive filtering with Non-Maximum-Suppression

TODO

Optimize Classifier - change with i.e. Yolo or Mobilenet
with a good classifier optimize away the Non-Maximum-Suppression step

Any contribution is welcomed!

Resources

/Enjoy.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
info		info
utils		utils
LICENSE		LICENSE
README.md		README.md
object_classification_2D_to_3D.ipynb		object_classification_2D_to_3D.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object classification from 2D images to 3D point cloud

TODO

Resources

About

Releases

Packages

Languages

License

fvilmos/object_classification_2d_to_3d

Folders and files

Latest commit

History

Repository files navigation

Object classification from 2D images to 3D point cloud

TODO

Resources

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages