a3c-distributed_tensorflow

Distributed Tensorflow Implementation of a3c from Google Deepmind.

Implementation is on Tensorflow 1.0

"We present asynchronous variants of four standard reinforcement learning algorithms and show that parallel actor-learners have a stabilizing effect on training allowing all four methods to successfully train neural network controllers." Form Paper

The implementation is based on miyosuda git. https://github.com/miyosuda/async_deep_reinforce

The core implementation is almost same to miyosuda ones. The parts for running with Distributed Tensorflow are changed.

Atari Game

./auto_run.sh

You need to set your hostname and port number in a3c.py code. The number of parameter servers and workers can be set in auto_run.sh script file.

Settings

Almost Setting values are same to miyosuda ones except the number of workers. The Number of worker is 14, and the number of parameter server is 7.

Results

The experiment is Pong game with the number of worker as above ones. This one runs with just CPU. The CPU and memory of server are Xeon E7-8880 v4 x 4 (176 threaing with hyper-threading) and 1TB respectively (cpu and memory are not fully used. About 50% of CPU and 20GB Memory are used). As above result, about 800~900 steps run per seconds (that is better than 980ti performance in miyosuda git), and score is saturated at about 35M steps.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
cont_action		cont_action
figures		figures
LICENSE		LICENSE
README.md		README.md
a3c_dist.py		a3c_dist.py
a3c_training_thread.py		a3c_training_thread.py
a3c_training_thread.pyc		a3c_training_thread.pyc
auto_run.sh		auto_run.sh
constants.py		constants.py
constants.pyc		constants.pyc
game_ac_network.py		game_ac_network.py
game_ac_network.pyc		game_ac_network.pyc
game_state.py		game_state.py
game_state.pyc		game_state.pyc
pong.bin		pong.bin
rmsprop_applier.py		rmsprop_applier.py
rmsprop_applier.pyc		rmsprop_applier.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

a3c-distributed_tensorflow

Atari Game

Settings

Results

About

Releases

Packages

Languages

License

jsikyoon/a3c-distributed_tensorflow

Folders and files

Latest commit

History

Repository files navigation

a3c-distributed_tensorflow

Atari Game

Settings

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages