File:Q-Learning on Taxi-v2 with Gaussian Noise on Reward.png
Q-Learning_on_Taxi-v2_with_Gaussian_Noise_on_Reward.png (775 × 538 pixels, file size: 119 KB, MIME type: image/png)
Summary
Description | English: Varying levels of sigma (standard deviation) with the fixed mean of 0 are used to apply Gaussian noise to true reward returned by OpenAI Gym's Taxi-v2 environment. The resulting corrupt reward is used to compute Q values, while the true reward is collected to observe the agent's performance. |
Date | 14 April 2019 |
File source | Own Work |
Author | NamHeeKim |
Licensing
|
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 00:37, 15 April 2019 | 775 × 538 (119 KB) | NamHeeKim (talk | contribs) | ||
00:28, 15 April 2019 | 800 × 628 (133 KB) | NamHeeKim (talk | contribs) | User created page with UploadWizard |
You cannot overwrite this file.
File usage
The following page uses this file: