Critiquing Assignment - Junyuan Zheng

Critiquing Assignment - Junyuan Zheng

This page has a good coverage of the topic. but I have several suggestions:

1. Some place need to add clear references.

2. The final fundamental issue is generalization: given that we can only visit a subset of the exponential number of states, how can we know the value of all the states? The most common approach is to approximate the Q/V functions using, say, a neural net. A more promising approach (according to many experts) uses the factored structure of the model to allow safe state abstraction.

This page did not mention Q/V function before, therefore, maybe need some explain....

3. The "Value function approaches" part is not very clear to me, maybe you can use Q-learning function or SARSA Algorithm to explain this part.

JunyuanZheng (talk)05:24, 2 February 2016

Thank you for your feedback. I've added annotated references. I've also revised the generalization and value function approaches sections and kept it as simple as possible while maintaining the important features.

AdnanReza (talk)07:20, 9 February 2016