raryelcostasouza / pyTranscriber

pyTranscriber can be used to generate automatic transcription / automatic subtitles for audio/video files through a friendly graphical user interface.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

possibility to vary the region-detection threshold?

aleks-mariusz opened this issue · comments

commented

Is your feature request related to a problem? Please describe.
I have some videos where the speakers tend to pause between parts of their sentence. pyTranscriber seems to treat each of these pause as a separate "region", and so the translations are created 'in isolation' and are often incorrect.

Is there a way to vary the region-detection threshold? It looks like the parameter is called min_region_size per this function but i'm not sure how to make it so that this can be varied by the gui?

Is your feature request related to a problem? Please describe.
yes, depending on speaker, the default region-detection threshold may need to be adjusted

Describe the solution you'd like
would be nice if this could be varied (or maybe even disabled so the entire file counts as a region, unless there's a reason that's undesireable)

Describe alternatives you've considered
i considered to see if the pyTranscriber.app macOS deployment can be editted but it looks like the code is encapsulated and non-modifiable there?

Additional context
Add any other context or screenshots about the feature request here.

Hi @aleks-mariusz,
In fact the binary distributions have code encapsulated... so unless you are skillful in compiling (instructions on docs folder) no easy way to change that.

If you are confortable with command line one suggestion is to try using autosub directly with that property edited. I confess I never tweaked that so I'm not sure how helpful will that be in your case.

If you try that in autosub and it is effective in solving your issue please let me know so we can add an option from pyTranscriber to edit that in future release.

Thanks

commented

Hi Raryel, thanks for the incredibly fast reply. I did peak at autosub however the repo says it's not currently maintained and that its last commit is from 4 years ago, so i wasn't sure if that'll work. I'll give it a try and let you know if it makes a difference!

commented

Ok, i see.. so i gave autosub a try.. after modifying the min_region_size parameter to 15 seconds.. i got, still pretty lousy results. I've uploaded both the resulting .SRT file as well as a .SRT file generated by uploading a sample video to youtube. The sample one is less than 20 minutes, while i have dozens of real working set of videos are almost identical, except 80-120 minutes long, so a bit impractical to upload to youtube just to get subtitles :-/

If you have a few minutes, can you take a look a these, and let me know why the one from youtube is so incredibly accurate, whereas the one autosub outputs is basically rubbish, even though they both use google?

I'm baffled how to make this app usable if it's not the smaller regions sizes :-/