Vaibhavs10 / insanely-fast-whisper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Out Of memory & attempt to get argmin of an empty sequence

learner0333 opened this issue · comments

I am calling replicate model https://replicate.com/vaibhavs10/incredibly-fast-whisper using API. I am using it to get the transcript and diarization.

Initially I have used batch_size = 64 , The 1 Hour 13 minutes video worked fine.

I have tried another video that is 1 hour and 24 minutes. I got the Out of memory error. The exact error is as follows:

"Not enough memory available to process your request. Try reducing the size or number of any file inputs or outputs"

After this error I have tried to change the batch_size to 4,16,24 and 32 but I got the following error:
"attempt to get argmin of an empty sequence"

will you please guide me to fix these two problems? I want to use Replicate in production so need to figure out these before implementing in the production environment.

I have tried two more videos that are 1 hour and 46 minutes and other is 2 hours and 7 minutes. I got the following error:

"unsupported operand type(s) for -: 'NoneType' and 'float'"

The following line is copied from the replicate log:

Whisper did not predict an ending timestamp, which can happen if audio is cut off in the middle of a word. Also make sure WhisperTimeStampLogitsProcessor was used during generation.

I'm getting this too!

facing the same issue