Critiquing Assignment - Junyuan Zheng
This page has a good coverage of the topic. but I have several suggestions:
1. Some place need to add clear references.
2. The final fundamental issue is generalization: given that we can only visit a subset of the exponential number of states, how can we know the value of all the states? The most common approach is to approximate the Q/V functions using, say, a neural net. A more promising approach (according to many experts) uses the factored structure of the model to allow safe state abstraction.
This page did not mention Q/V function before, therefore, maybe need some explain....
3. The "Value function approaches" part is not very clear to me, maybe you can use Q-learning function or SARSA Algorithm to explain this part.
JunyuanZheng (talk)