Project Task introduction

This task involves reinforcement learning and will no longer use the tags in task 1. To simplify the problem, this task sets that each tick has at most 5 hand long positions and 5 hand short positions. Long positions and short positions cannot be held at the same time. A tick can only have one action at a time. Positions can be increased or decreased (with unit equals one hand) through buying and selling, and the absolute value of change in the number of positions of one action cannot exceed one hand. The current state can be maintained by an idle action. When the buying action is executed, the purchase will be successful and will not have any impact on the market. The price is AskPrice1 of the current tick. When the selling action is executed, the sell will be successful and will have no eﬀect on the market. The price is BidPrice1 of the current tick. Finally, you should include in your report: the number of buying and spelling on testing set, the average price to buy and the average price to sell. Besides, attach action selection for each tick on testing set for submission.

code reference

VIN

TCN

Use Openai base line based algorithms

The gym-stock environment is slightly different from the stockenv as the return observation is a 138-d vector containing the hands count. Not a tuple with 137-d observation and hand count

install gym-stock

pip install -e gym-stock

Then can build the gym-stock in the gym environment by

import gym
import gym_stock
gym.make('stock-v0')

add the following line in the run.py line 52 of openai baseline

_game_envs['custom_type']={'stock-v0'}

add the following line in cmd_util

from importlib import import_module
        import_module("gym_stock")

Then can train the policy with gym's algorithms

python -m baselines.run --alg=deepq --env=stock-v0 --save_path=./cartpole_model.pkl --num_timesteps=1e5 --print_freq=10

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.vscode		.vscode
DDPG		DDPG
TCN		TCN
__pycache__		__pycache__
checkpoints		checkpoints
env		env
gym-stock		gym-stock
lstmDQN		lstmDQN
pics		pics
.gitignore		.gitignore
DQN-reward-0.png		DQN-reward-0.png
README.md		README.md
Task 3 实验报告.md		Task 3 实验报告.md
main.py		main.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Task introduction

code reference

Use Openai base line based algorithms

install gym-stock

About

Releases

Packages

Languages

yangcyself/YB_stock_proj

Folders and files

Latest commit

History

Repository files navigation

Project Task introduction

code reference

Use Openai base line based algorithms

install gym-stock

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages