No fast pitch-shift ratios could be computed for the given sample rate and transpose range.

Question

No fast pitch-shift ratios could be computed for the given sample rate and transpose range.

Crazylov3 opened this issue 2 years ago · comments

I use PitchShift wih min=-0.9 and max = 1.1, then it raise No fast pitch-shift ratios could be computed for the given sample rate and transpose range.. I dont face with this error when use cpu version github. Do you have any plan to fix it?

Iver Jordal · Answer 1 · Sat Jul 02 2022 16:56:48 GMT+0800 (China Standard Time)

Hey

Does it work if you use 0.9 instead of -0.9?

Iver Jordal · Answer 2 · Sat Jul 02 2022 16:57:32 GMT+0800 (China Standard Time)

And what do you mean with cpu version github? Are you refering to https://github.com/iver56/audiomentations?

Iver Jordal · Answer 3 · Sat Jul 02 2022 16:58:16 GMT+0800 (China Standard Time)

What's the sample rate?

Crazylov3 · Answer 4 · Sat Jul 02 2022 17:02:14 GMT+0800 (China Standard Time)

Hey

Does it work if you use 0.9 instead of -0.9?

No, It doesn't

Crazylov3 · Answer 5 · Sat Jul 02 2022 17:02:35 GMT+0800 (China Standard Time)

What's the sample rate?
it is 8000

Crazylov3 · Answer 6 · Sat Jul 02 2022 17:03:23 GMT+0800 (China Standard Time)

And what do you mean with cpu version github? Are you refering to https://github.com/iver56/audiomentations?

Yep, that what i mean.

Iver Jordal · Answer 7 · Sat Jul 02 2022 20:44:02 GMT+0800 (China Standard Time)

Oh, that is indeed a low sample rate. Maybe you can use audiomentations for now then, since it works there

@KentoNishi Have you tried torch-pitch-shift with sr=8000?

Crazylov3 · Answer 8 · Sat Jul 02 2022 20:52:12 GMT+0800 (China Standard Time)

Torch-pitch-shift works fine, thank you!

Iver Jordal · Answer 9 · Sat Jul 02 2022 20:59:32 GMT+0800 (China Standard Time)

Good :) torch-audiomentations actually depends on torch-pitch-shift, but uses its get_fast_shifts feature. So yeah, if you use torch-pitch-shift directly without get_fast_shifts, it'll probably work.

Kento Nishi · Answer 10 · Sun Jul 03 2022 00:53:48 GMT+0800 (China Standard Time)

Good to hear that it works!

AliKarimi95 · Answer 11 · Wed Aug 24 2022 00:13:26 GMT+0800 (China Standard Time)

I have the same issue (version=0.11.0) with sample_rate = 16000 and (min, max)=(-0.2, 0,2). However, when the range changed to (-0.5, 0.5), the error didn't appear. In my case, the original dataset is piano sounds, so too much altering pitches can produce invalid data. Therefore, increasing the range is not a solution.
I have also tested torch_pith_shift. It works sometimes, but due to extensive memory consumption, the program crashed randomly for some shift values (for example, -0.16093115439744646).

Also, the running time dramatically increased after using PitchShift transform. In my laptop and using six other transformations, the average augmentation time per example (10s) is

setting	average augmentation time per example (s)
without `PitchShift`	`0.026`
with `PitchShift`	`0.232`
with `pitch_shift` function of `torch_pith_shift`	`3.951`

Kento Nishi · Answer 12 · Wed Aug 24 2022 02:33:54 GMT+0800 (China Standard Time)

Pitch shifts are generally pretty intensive operations so I'm not too surprised about the increase in execution time. Regarding speed issues with some pitch shift factors, unfortunately some factors aren't ideal for speed, so the rate of data transfer to the GPU becomes the bottleneck. In your specific case, it may be better to apply cpu-based pitch shift instead.

Iver Jordal · Answer 13 · Wed Aug 24 2022 02:45:03 GMT+0800 (China Standard Time)

That's an interesting use case, AliKarimi95

Note: Although the pitch shift transform in torch-audiomentations can be comparatively fast on GPU, it is slow on CPU. When running pitch shift on CPU, the one in audiomentations is roughly 3x as fast.