Progress for week 16 (2018)
From Robin
Contents |
Vetle Bu Solgård
Budget
- Start DDPG and TRPO implementation
- Make a decision and implement on where to validate algorithms
- Extend REINFORCE with baseline
- Read up on typical state representations and reward signals for locomotion tasks
- Get an overview of potensial different tasks to explore with Dyret (Balance, movement speed, etc.)
Accounting
Martin Hovin
Budget
- Forbedre tekst
- Skrive Introduksjon
- Skrive Acknowledgement
- Skrive Abstract