Skip to content

Distributed Data Mining and Machine Learning algorithms on top of Apache Hadoop, Apache Giraph, Apache Hama

Notifications You must be signed in to change notification settings

yxjiang/bigmining

Repository files navigation

Computing frameworks

  • Apache Hadoop Mapreduce (Well known implementation of map-reduce)
  • Apache Hama (General BSP-based distributed computing framework)
  • Apache Giraph (Distributed graph computing framework)
  • Apache Spark (In-memory distributed computing framework)
  • Apache Storm (Real time distributed computing framework)

Implemented Algorithms

Hadoop Mapreduce

Ridge Regression trained by coordinate descent

Lasso Regression trained by gradient descent

About

Distributed Data Mining and Machine Learning algorithms on top of Apache Hadoop, Apache Giraph, Apache Hama

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published