Diarization is taking 90+ minutes, is that normal?

Question

Diarization is taking 90+ minutes, is that normal?

josh-may opened this issue 2 years ago · comments

I'm went through the repo and I go to this part:

DEMO_FILE = {'uri': 'blabal', 'audio': 'audio.wav'}
dz = pipeline(DEMO_FILE)  

with open("diarization.txt", "w") as text_file:
    text_file.write(str(dz))

But it's now been running for 65+ minutes. And this is for the 20 min audio file mentioned in the repo.

How long should the diarization take?

andytyrer · Answer 1 · Fri Jan 13 2023 19:18:47 GMT+0800 (China Standard Time)

Hi, yes having the same problem. It's at 1hr 4mins on the runtime for a 20 min file

Neo · Answer 2 · Thu Mar 23 2023 07:00:51 GMT+0800 (China Standard Time)

I was able to finish the Diarization in 3 minutes using Google Collab with GPU execution for a 55 minute audio file

dbtreasure · Answer 3 · Mon Apr 17 2023 06:54:54 GMT+0800 (China Standard Time)

I plugged into a M60 nvidia on azure ML workspaces through VS code and was able to still utilize github copilot and stay in my IDE and it finished an hour episode of a podcast in 10m

fabi.s · Answer 4 · Mon Apr 17 2023 15:43:39 GMT+0800 (China Standard Time)

like most people here said, it depends on the length of the audio file, your hardware and on the size of the Whisper model you choose.

Patrick Wang · Answer 5 · Mon May 22 2023 18:55:26 GMT+0800 (China Standard Time)

I have RTX 3060, after downloading and installing CUDA, it finished the processing in around 17 minutes. Before that, it took forever and I gave up in the end.
If you have a CUDA-capable GPU, you can follow the guide below to install the CUDA version of PyTorch. It does make a lot of difference.
https://pytorch.org/get-started/locally/