What result should be obtained under normal circumstances after preprocessing?

Question

What result should be obtained under normal circumstances after preprocessing?

Xuzhejia opened this issue 6 months ago · comments

What result should be obtained under normal circumstances after preprocessing?
Currently I only have a 0KB file called lrs3_train_transcript_lengths_seg24s.csv. Is this correct?

Harshit Prajapati · Answer 1 · Tue Jan 09 2024 13:03:38 GMT+0800 (China Standard Time)

After preprocessing, you should be able to see training files and its tokens in "lrs3_train_transcript_lengths_seg24s.csv". It should not 0KB.

Xuzhejia · Answer 2 · Tue Jan 09 2024 14:06:26 GMT+0800 (China Standard Time)

After preprocessing, you should be able to see training files and its tokens in "lrs3_train_transcript_lengths_seg24s.csv". It should not 0KB.

Not sure if I entered the wrong command
Can you give me your command for reference?
Thank you.

malu01 · Answer 3 · Tue Feb 20 2024 10:33:57 GMT+0800 (China Standard Time)

My problem was in the data_module.py file, which was trying to read an mp4 file and got an error,
waveform, sample_rate = torchaudio.load(data_filename, normalize=True)
causing subsequent steps to be skipped. Is this normal and how can it be resolved?

Xuzhejia · Answer 4 · Tue Feb 20 2024 16:45:34 GMT+0800 (China Standard Time)

My problem was in the data_module.py file, which was trying to read an mp4 file and got an error, waveform, sample_rate = torchaudio.load(data_filename, normalize=True) causing subsequent steps to be skipped. Is this normal and how can it be resolved?
I also found this problem and I'm still looking for a solution.

malu01 · Answer 5 · Tue Feb 20 2024 18:05:09 GMT+0800 (China Standard Time)

My problem was in the data_module.py file, which was trying to read an mp4 file and got an error, waveform, sample_rate = torchaudio.load(data_filename, normalize=True) causing subsequent steps to be skipped. Is this normal and how can it be resolved?
I also found this problem and I'm still looking for a solution.

    def load_audio(self, data_filename):
        video = VideoFileClip(data_filename)
        video.audio.write_audiofile(data_filename[:-1]+"3",logger=None)
        waveform, sample_rate = torchaudio.load(data_filename[:-1]+"3", normalize=True)
        return waveform, sample_rate

This is my solution.

Xuzhejia · Answer 6 · Tue Feb 20 2024 18:43:30 GMT+0800 (China Standard Time)

My problem was in the data_module.py file, which was trying to read an mp4 file and got an error, waveform, sample_rate = torchaudio.load(data_filename, normalize=True) causing subsequent steps to be skipped. Is this normal and how can it be resolved?
I also found this problem and I'm still looking for a solution.
    def load_audio(self, data_filename):
        video = VideoFileClip(data_filename)
        video.audio.write_audiofile(data_filename[:-1]+"3",logger=None)
        waveform, sample_rate = torchaudio.load(data_filename[:-1]+"3", normalize=True)
        return waveform, sample_rate
This is my solution.

def load_audio(self, data_filename):
    probe = ffmpeg.probe(data_filename)
    process = (
    ffmpeg.input(data_filename)
    .output('-', format='wav', acodec='pcm_s16le', ar='16000')
    .run_async(pipe_stdout=True, quiet=True)
    )
    stdout, _ = process.communicate()

    waveform, sample_rate = torchaudio.load(io.BytesIO(stdout), normalize=True)
    return waveform, sample_rate

This also works