Deep Learning in a Bilateral Brain

Original RFR here

Preprint here

Overview

The brains of all bilaterally symmetric animals on Earth are divided into left and right hemispheres. It is a remarkably conserved feature across species, indicating its importance for intelligence. The anatomy and functionality of the hemispheres have a large degree of overlap, but they specialise to possess different attributes. The most likely explanation for the emergent specialisation, is small differences in parameterisation of the substrate. For example, a higher rate of synaptic plasticity in one hemisphere, different relative layer sizes and different connectivity patterns within and across layers. The biological parameterisation could be equivalent to hyperparameters of an AI Machine Learning algorithm.

There could be great benefits to understanding and mimicking this pervasive design feature of biological intelligence by building bilateral ML algorithms – two parallel systems with different specialities. To our knowledge, this has never been explored. It has the potential to confer significant advantages and is an exciting prospect. The way to approach it is an open question, we can be inspired and guided by neuroscience models.

Objective

The objective is to build a bilateral Machine Learning model with hemispheres resembling the biological counterparts. This will form the basis for further explorations of bilateralism, including interplay with other brain and neurotransmitter systems.

Getting started

Terminology

In this project, we use a backbone (e.g. resnet or vgg type architectures) to create larger networks (either 1 hemisphere or 2) with one or two classifier heads.

Fine/Coarse: This project is designed for hierarchical datasets such as CIFAR100, where each image has a fine and a coarse label. Fine is the narrow/specific description, such as dolphin, and Coarse is the broad/general description, such as sea creature
Architecture: The backbone used i.e. the architecture for a single hemisphere
Single/Dual Head: The number of heads. Each head is a classifier, and can be trained on either fine labels or coarse labels. Uses mode_heads in config files.
Macro-architecture: The whole network, including hemispheres and heads. There are variable number of hemispheres: Unilateral = 1 hemisphere, Bilateral = 2 hemispheres. Uses mode_hemis in config files. IMPORTANT, when set to unilateral, the config parameters for 'fine' are used e.g. farch to specify architecture, even if you chose mode_heads = coarse.

The main idea

The main idea is to:

Train individual hemispheres in a way that makes them specialized So one hemisphere is trained on fine labels, the other with coarse.
Then the pre-trained hemispheres are combined into a bilateral architecture (i.e. two hemispheres), with two heads. One head is for fine labels, the other for coarse. The output of each hemisphere are concatenated to create one set of features that are fed into each head. The hemispheres are pre-trained in step (1), and so here they are frozen (they will not be trained); only the heads are trained and tested.

Preparing the data

Download CIFAR100 for python, and put it in your chosen folder.

mkdir ../datasets
cd ../datasets
wget https://www.cs.toronto.edu/~kriz/cifar-100-python.tar.gz
tar -xf cifar-100-python.tar.gz
cd ../bilateral-brain
bash data_scripts/prepare_cifar.py

Modify the paths in data_scripts/prepare_cifar.py (at the top of the file), and run the script to create the fine and coarse dataset folders.

Each folder has a config.yaml which you can use configure the experiment. In particular, set the path to dataset and checkpoints there.

The main entry point is trainer.py, used to train, validate and test.

Run the system

The easiest way to run the system in the stereotypical way (as in the section 'The main idea' above), is to use the train_system.py script. It enables you to run several seeds for each single hemisphere, and then run several seeds on the whole bilateral architecture also. It is also quite configurable i.e. different backbones. The script depends the base configs and modifies them.

You can have finer level control, and do different variations, by running trainer.py and creating new config files as required. A number of pre-set configs are also available and can be modified, in the config folder.

e.g. python trainer.py --config configs/config_file.yaml

Examples of how to do that are given below, first in the context of the stereotypical scenario

Example 1: bilateral architecture with specialization

As an example, here are step by step instructions to train Left on fine classes and Right on coarse classes, then put them together into bilateral architecture and train on all class types.

This will specialize the left for fine and the right for coarse classes, using supervised training. As in the paper, you can then apply additional asymmetries to enhance specializations such as sparsity or having hemispheres with asymmetric layer widths.

1. Train Left on specific/fine labels

Configure the experiment by modifying the config.yaml file in the config/ folder, to:

use specific labels (config/dataset/), and
change the name of the experiments (config/exp_name)
update mode_heads appropriately: fine for fine labels, coarse for coarse labels
ensure mode_hemis is set to unilateral for single hemisphere
ensure mode_out is set to pred to get the classification ouput (as opposed to raw features)
update farch to set the backbone architecture.
set evaluate to True if you want to also test the accuracy during training
see config.yaml for explanations of other parameters

Then run: python trainer.py

2. Train Right on general/coarse labels

Then do it all again on coarse labels to train the Right hemisphere.

3. Create bilateral architecture

Configure to use the appropriate checkpoints to load the Left and Right hemispheres.

update mode_heads to both. In this case it determins where the output is taken from.
update mode_hemis to bilateral
set the farch and carch to the appropriate backbones of fine and coarse hemispheres respectively
set them to be fronzen with ffreeze and cfreeze
set the dataset paths in the config to point to the fine folders and include the path to the raw files. The DataModule will then pick up fine labels from the image names and the coarse labels from the raw files. For example:

dataset:
  train_dir: datasets/CIFAR100/train/fine
  val_dir: datasets/CIFAR100/test/fine
  test_dir: datasets/CIFAR100/test/fine

see config.yaml for explanations of other parameters
set evaluate to True to get test accuracy

Then run: python trainer.py

Example 2: How to train a single hemisphere on two labels

set the config hparam mode_hemis to unilateral
set the config hparam mode_heads to both
it will use all the fine hparams e.g. farch and ffreeze, to specify this one hemisphere
set ffreeze to False

Then run: python trainer.py

Cerenaut / bilateral-brain

Deep Learning in a Bilateral Brain

Overview

Objective

Getting started

Terminology

The main idea

Preparing the data

Run the system

Example 1: bilateral architecture with specialization

1. Train Left on specific/fine labels

2. Train Right on general/coarse labels

3. Create bilateral architecture

Example 2: How to train a single hemisphere on two labels

About

Languages