wutong8023 / SpeechRE

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SpeechRE

TTS for Text-to-Speech;

IWSLT for SpeechRE, and the model is placed in fairseq_modules;

fairseq is a modified version in source code;

Dataset

conll04.tgz:https://drive.google.com/file/d/1Q5k3eM6WknfjA2DWo19CyTwZngYVXRUL/view?usp=sharing

re-tacred(dev&test_part).tgz:https://drive.google.com/file/d/1qctG-n_W51zp-hiPDS-XEl7jh_bI1l_-/view?usp=sharing

re-tacred(train_part).tgz:https://drive.google.com/file/d/1ainRqlx4h9_HDFtOq8xasN-OLJDNSbwD/view?usp=sharing

For example, the data of CoNLL04 is organized as:

├── conll04
│   ├── audio
│   │   ├── train
│   │   │   ├── train-0.wav
│   │   │   ├── train-1.wav
│   │   │   ├── train-2.wav
│   │   │   ├── ...
│   │   ├── dev
│   │   │   ├── dev-0.wav
│   │   │   ├── ...
│   │   ├── test
│   │   │   ├── test-0.wav
│   │   │   ├── ...
│   ├── train_conll04.tsv
│   ├── dev_conll04.tsv
│   ├── test_conll04.tsv

The format of tsv files:

id audio duration_ms n_frames tgt_text speaker tgt_lang
train-0 /path/to/datasets/conll04/audio/train/train-0.wav:0:239828 14989 239828 Radio Reloj Network Havana OrgBased_In 0 en
train-1 /path/to/datasets/conll04/audio/train/train-1.wav:0:64099 4006 64099 Bruno Pusterla Italian Agricultural Confederation Work_For 0 en
...

About


Languages

Language:Jupyter Notebook 55.7%Language:Python 42.4%Language:Shell 1.1%Language:Cuda 0.3%Language:C++ 0.2%Language:Cython 0.1%Language:HTML 0.1%Language:Lua 0.0%Language:Batchfile 0.0%Language:Makefile 0.0%