Actor and critc
Actor and critic is one of the state of art algorithms in deep reinforcement learning.
We need two models for actor and critic. Fully connected layer, convolutional neural network and recurrent neural network can be candidates for each of them. Like AlphaGo Zero, we can combine actor and critic model into one same model.
1.2 Loss functions (Object functions)
2.1 Choice of action
3.1 Forward step
3.2 Provide observation and reward to agent