Progress for week 16 (2018)
From Robin
(Difference between revisions)
(→Vetle Bu Solgård) |
|||
Line 1: | Line 1: | ||
== Vetle Bu Solgård == | == Vetle Bu Solgård == | ||
=== Budget === | === Budget === | ||
- | * | + | * Start DDPG and TRPO implementation |
+ | * Make a decision and implement on where to validate algorithms | ||
+ | * Extend REINFORCE with baseline | ||
+ | * Read up on typical state representations and reward signals for locomotion tasks | ||
+ | * Get an overview of potensial different tasks to explore with Dyret (Balance, movement speed, etc.) | ||
=== Accounting === | === Accounting === |
Revision as of 12:27, 13 April 2018
Contents |
Vetle Bu Solgård
Budget
- Start DDPG and TRPO implementation
- Make a decision and implement on where to validate algorithms
- Extend REINFORCE with baseline
- Read up on typical state representations and reward signals for locomotion tasks
- Get an overview of potensial different tasks to explore with Dyret (Balance, movement speed, etc.)
Accounting
Martin Hovin
Budget
- Forbedre tekst
- Skrive Introduksjon
- Skrive Acknowledgement
- Skrive Abstract