File:Q-Learning on Tax-v2 with Scaled Reward.png
Size of this preview: 741 × 599 pixels. Other resolution: 858 × 694 pixels.
Original file (858 × 694 pixels, file size: 74 KB, MIME type: image/png)
Summary
Description | English: Cumulative true reward from the environment is plotted as a function of training episodes. Agents with positively scaled rewards converge to the global optimum. Negative rewards have an extremely adversarial effect on the agent's performance. |
Date | 15 April 2019 |
File source | Own Work |
Author | NamHeeKim |
Licensing
|
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 05:41, 19 April 2019 | 858 × 694 (74 KB) | NamHeeKim (talk | contribs) | User created page with UploadWizard |
You cannot overwrite this file.
File usage
The following page uses this file: