my-yy / learnable_pins

Learnable Pins PyTorch implementation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyTorch implementation of Learnable Pins

Learnable PINs: Cross-Modal Embeddings for Person Identity

This code is based on the Self-Lifting project which enables train a model just in minutes.

Note that the dataset splitting is VGG-Vox style, which is different from the original paper, but you can still know how the Curriculum-based Mining is implemented 😜 (utils/pair_selection_util.py).

Dataset

The dataset is the same as the Self-Lifting project. If you already have it, you can just create a soft link in the project root:

ln -s Your-Self-Lifting-Project-Root/dataset ./dataset

Or you need to download it by referring to Self-Lifting.

Training

Just Run: python 1_pins.py

You also can use wandb to view the training process:

  1. Create wb_config.json file in the ./configs folder, using the following content:

    {
      "WB_KEY": "Your wandb auth key"
    }
    
  2. add --dryrun=False to the training command, for example: python 1_pins.py --dryrun=False

Results

Because the Backbone structure and test script are different from the original paper, the scores behave much higher.

Paper Explanation (Chinese Language)

【音脸关系学习】:Learnable Pins 论文解读与代码复现

Other Resources

Voice Face Association Learning Papers & Codes

About

Learnable Pins PyTorch implementation


Languages

Language:Python 100.0%