Maintenance 0.5.2
Just some minor bug fixes and documentation improvements:
- Datetime compatibility for Windows #137 #142
- Continuous Integration fixes #138
- SoftDeterministicPolicy scaling fix #140
- Fix incorrect counting of test trials in parallel experiments #143
- Remove trailing commas #146
- First action was being selected using act() instead of eval() in test mode #150
- Documentation improvements #151