ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OOM while processing IEMOCAP dataset

AirHorizons opened this issue · comments

I was trying to create iemocap embedding on my own, but my GPU with 8GB memory gave me OOM from cuda. How much size do I need to process this?

Hi, for feature extraction using emotion2vec, the maximum GPU memory usage is only 2-3GB. You can check again to see if there is anything wrong.

Good news I found my mistake when I put the audio file of the whole dialogue instead of each utterance. Thank you for assuring me to look for another way!