Progress for week 16 (2018)

Vetle Bu Solgård

Start DDPG and TRPO implementation
Make a decision and implement on where to validate algorithms
Extend REINFORCE with baseline
Read up on typical state representations and reward signals for locomotion tasks
Get an overview of potensial different tasks to explore with Dyret (Balance, movement speed, etc.)

Skrevet første utkast til Relearning
- For kort avsluttning?
- Henger første halvdel sammen med siste?
Skrevet kort intro
- Trengs det mer?
- Må problemstillingen konkteriseres mer?
Startet på avsluttningen
- Seksjonene er nå:

Result summary

Discussion summary

Thesis conclusion

Future work