From Robin

Revision as of 13:45, 8 November 2017 by Martijho (Talk | contribs)

(diff) ← Older revision | Current revision (diff) | Newer revision → (diff)

Jump to: navigation, search

Experiments

A separate page describing research questions and the experiments proposed to answer them.

Experiments

Key "words"

Terms: Super neural network; Evolved sub-models from a larger set of parameters; Multitask learning; No catastrophic forgetting; Embedded transfer learning

Terms to use in thesis

Plastic Neural Network

NN that change topology or connectivity according to learning algorithm

In-silico

performed by computer

Modular Super Neural Network

DNN consisting of modules of smaller NNs,

Meme

Smallest concise unit of knowledge

Thoughts on Thesis

- Search for the first path is unnecessary? The search is over good permutations of parameters from the network at the same time the parameters are trained for the first time. In other words: does the search provide a significant increase in transferability or any measurable increase in performance over just picking a random path and training it for a set amount of iterations?

- When training on a saturated PathNet, it might be quicker to preprocess the data for each path (view it as feature extraction) since there is no backpropagation except for in the final task-specific softmax layer

Thesis problem specification

Studying the behaviour of super neural networks when saturated with subtasks from the same domain such as in a curriculum learning scenario. Include research questions such as

Can we estimate the decline in needed capacity for each new sub-task learned from the curriculum?
Could a PathNet saturated with optimized paths for tasks from a curriculum provide one/few-shot learning?
- What would, in that case, constitute a "saturated PathNet"?
- Is there a learning advantage to be had from this kind of learning?
Is there a measurable increase in performance by searching over optimal "first paths" instead of just training a selected segment of the PathNet?

PathNet structure

Small structure to reduce computational requirements.

(3 layers 10-20 modules of small affine MLPs)

Test scenario

Must be fairly quick to provide one episode. Small input dimentionality to reduce necessary capacity of PathNet structure and computational time. The scenario must also be easy to divide into subtasks.

OpenAI gym?
- LunarLander:
  - Hover
  - Land safely
  - Land in goal
  - Land in goal quickly

Who cites PathNet?

Born to Learn EPANN - Evolved Plastic Artificial Neural Networks Mentions Pathnet as an example of where evolution where used to train a network on multiple tasks. "While these results were only possible through significant computational resources, they demonstrate the potential of combining evolution and deep learning approaches.

Learning time-efficient deep architectures with budgeted super networks Mentions PathNet as a predecessor in the super neural network family

Deep Learning for video game playing Reviewing recent deep learning advances in the context of how they have been applied to play different types of video games

Evolutive deep models for online learning on data streams with no storage Pathnet is proposed alongside PNNS as a way to deal with changing environments. It is mentioned that both PathNet and progressive networks show good results on sequences of tasks and are a good alternative to fine-tuning to accelerate learning.

Online multi-task learning using active sampling Cites Progressive Neural Networks for multitask learning

Hierarchical Task Generalization with Neural Programs Mentions PathNet as way of reusing weights

Multitask Evolution with Cartesian Genetic Programming Mentions PathNet in a list of systems that use evolution as tool in multitasking

User:Martijho

From Robin

Contents

Experiments

Key "words"

Terms to use in thesis

Thoughts on Thesis

Thesis problem specification

PathNet structure

Test scenario

Who cites PathNet?

Views

Personal tools

Front page

Navigation

Search

Toolbox