emcf / thepipe

Extract clean data from anywhere, powered by vision-language models ⚡

Home Page:https://thepi.pe

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Swap Whisper Version

skyler14 opened this issue · comments

I was looking at your pipeline and thought you might be better served by using https://github.com/Vaibhavs10/insanely-fast-whisper or allow a bit of wiggle room in your framework to allow an optional parameter for feeding in a seperate processor for video transcription problems. This is over an order of magnitude improvement on vanilla whisper and has cpu/gpu modes. You may want to just allow a whole pipeline to be fed to futureproof this particular endpoint to new tooling