Big File can recongnize but no result.

Question

Big File can recongnize but no result.

WisdomLove opened this issue 4 months ago · comments

one mp4 file with 100MB size, about 1 hrs, cuda with float32 , the result display nothing after recognize.
the sample mp4 file can recongnize.

WisdomLove · Answer 1 · Sun Jan 28 2024 20:48:06 GMT+0800 (China Standard Time)

with model large v3

okmyworld · Answer 2 · Sun Jan 28 2024 20:54:45 GMT+0800 (China Standard Time)

what output at cmd ？

set cuda_cmd_type =int8 retry

WisdomLove · Answer 3 · Sun Jan 28 2024 21:11:03 GMT+0800 (China Standard Time)

CMD OUTPUT -- NOTHING EXCEPT 1983 I MENTIONED LAST ISSUE.

I WILL TRY int8 LATER. THANKS A LOT.

WisdomLove · Answer 4 · Sun Jan 28 2024 21:52:44 GMT+0800 (China Standard Time)

the problem was happened again, I will change to cpu.

okmyworld · Answer 5 · Sun Jan 28 2024 22:05:15 GMT+0800 (China Standard Time)

cuda_cmd_type = int8_float16

if source code deploy, line 107

segments,info = modelobj.transcribe(wav_file,  beam_size=1,best_of=1,temperature=0,  vad_filter=True,  vad_parameters=dict(min_silence_duration_ms=500),language=language)

change to

        segments,info = modelobj.transcribe(wav_file,  beam_size=5,best_of=5, vad_filter=True,  vad_parameters=dict(min_silence_duration_ms=500),language=language)

WisdomLove · Answer 6 · Mon Jan 29 2024 08:46:57 GMT+0800 (China Standard Time)

cpu test passed with normal result srt files forabout 5500s, will try cuda_cmd_type = int8_float16 later.

WisdomLove · Answer 7 · Mon Jan 29 2024 09:44:00 GMT+0800 (China Standard Time)

one mp4 file with 100MB size, about 68 mins, cuda with int8_float16 , the result display nothing after recognize.
another mp4 file with 100MB size, about 62 mins, cuda with int8_float16, the result displayed.

okmyworld · Answer 8 · Mon Jan 29 2024 20:04:24 GMT+0800 (China Standard Time)

cannot show

update to 0.91 and open set.ini and try adjusting several parameters at the bottom, all with comments. You can try to adjust them according to the maximum and minimum GPU consumption

WisdomLove · Answer 9 · Tue Jan 30 2024 00:09:15 GMT+0800 (China Standard Time)

wow its hard to ajust.... I dont know how to ajust, comments not so clear.

okmyworld · Answer 10 · Tue Jan 30 2024 00:30:17 GMT+0800 (China Standard Time)

web_address=127.0.0.1:9977
lang=en
devtype=cuda
cuda_com_type=float32
beam_size=5
best_of=5
vad=true
temperature=1
condition_on_previous_text=true

This is the best effect, but it also consumes the most GPU

web_address=127.0.0.1:9977
lang=en
devtype=cuda
cuda_com_type=int8
beam_size=1
best_of=1
vad=false
temperature=0
condition_on_previous_text=false

This is the most GPU efficient configuration and the effect is relatively poor

okmyworld · Answer 11 · Tue Jan 30 2024 00:51:56 GMT+0800 (China Standard Time)

The speech recognition to subtitle function in this project is the same as the current project's speech recognition function, both are fast wheelers. Perhaps you can download this and give it a try.

https://github.com/jianchang512/pyvideotrans

felipe · Answer 12 · Tue Jan 30 2024 10:30:50 GMT+0800 (China Standard Time)

web_address=127.0.0.1:9977
lang=en
devtype=cuda
cuda_com_type=float32
beam_size=5
best_of=5
vad=true
temperature=1
condition_on_previous_text=true

consume all my GPU that is nice
but this performance velocity or what si difference at final?

felipe · Answer 13 · Tue Jan 30 2024 11:46:21 GMT+0800 (China Standard Time)

{'web_address': '127.0.0.1:9977', 'lang': 'en', 'devtype': 'cuda', 'cuda_com_type': 'float32', 'beam_size': 5, 'best_of': 5, 'vad': True, 'temperature': 1, 'condition_on_previous_text': True}

The browser is open. If it does not open automatically, please open the URL manually http://127.0.0.1:9977
res.status_code=200
d={'version': 'v0.0.91', 'version_num': 91}
2024-01-29 21:28:53.8140166 [W:onnxruntime:Default, onnxruntime_pybind_state.cc:1983 onnxruntime::python::CreateInferencePybindStateModule] Init provider bridge failed.
CUDA failed with error out of memory
what option i have for prevent our of memory?

WisdomLove · Answer 14 · Tue Jan 30 2024 13:36:10 GMT+0800 (China Standard Time)

pyvideotrans perfcet tools

okmyworld · Answer 15 · Tue Jan 30 2024 14:45:49 GMT+0800 (China Standard Time)

{'web_address': '127.0.0.1:9977', 'lang': 'en', 'devtype': 'cuda', 'cuda_com_type': 'float32', 'beam_size': 5, 'best_of': 5, 'vad': True, 'temperature': 1, 'condition_on_previous_text': True}

The browser is open. If it does not open automatically, please open the URL manually http://127.0.0.1:9977 res.status_code=200 d={'version': 'v0.0.91', 'version_num': 91} 2024-01-29 21:28:53.8140166 [W:onnxruntime:Default, onnxruntime_pybind_state.cc:1983 onnxruntime::python::CreateInferencePybindStateModule] Init provider bridge failed. CUDA failed with error out of memory what option i have for prevent our of memory?

web_address=127.0.0.1:9977
lang=en
devtype=cuda
cuda_com_type=int8
beam_size=1
best_of=1
vad=false
temperature=0
condition_on_previous_text=false

This is the most GPU efficient configuration and the effect is relatively poor