slbert

SLBERT: A Novel Pre-training Framework for Joint Speech and Language Modeling

This is the multimodal task in which we are doing multi output classififcation using the information of both audio and text

For text-encoder we used the bert for Audio-encoder we used LSTM

with the both features considration we had get the single feature vector which is having infomation of audio as well as text

To run the project

firsly download the dataset from here Unzip the dataset

update the paths in config.py

then run: pip install -r requirements.txt to install all dependences

to train the model

python train.py

to get the f1 score on test set first download the model from here

python evaluate.py

On validattion set we get the f1 score of 0.97

all the tensorboard training logs can be found here

About

SLBERT: A Novel Pre-training Framework for Joint Speech and Language Modeling

Language:Python 100.0%