GriffinLim. RuntimeError: Given transposed=1, weight of size [1026, 1, 1024], expected input[1, 160, 181] to have 1026 channels, but got 160 channels instead

Question

GriffinLim. RuntimeError: Given transposed=1, weight of size [1026, 1, 1024], expected input[1, 160, 181] to have 1026 channels, but got 160 channels instead

psydok opened this issue 4 years ago · comments

Run "python inference.py"

2021-05-25 11:41:42.188662: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
2021-05-25 11:41:44.038910: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcuda.so.1
2021-05-25 11:41:44.067997: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:937] successful NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA node zero
2021-05-25 11:41:44.068584: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1733] Found device 0 with properties: 
pciBusID: 0000:00:04.0 name: Tesla T4 computeCapability: 7.5
coreClock: 1.59GHz coreCount: 40 deviceMemorySize: 14.75GiB deviceMemoryBandwidth: 298.08GiB/s
2021-05-25 11:41:44.068651: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.11.0
2021-05-25 11:41:44.071265: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublas.so.11
2021-05-25 11:41:44.071371: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublasLt.so.11
2021-05-25 11:41:44.073321: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcufft.so.10
2021-05-25 11:41:44.073801: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcurand.so.10
2021-05-25 11:41:44.073918: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcusolver.so.11'; dlerror: libcusolver.so.11: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/lib64-nvidia
2021-05-25 11:41:44.074408: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcusparse.so.11
2021-05-25 11:41:44.074614: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudnn.so.8
2021-05-25 11:41:44.074639: W tensorflow/core/common_runtime/gpu/gpu_device.cc:1766] Cannot dlopen some GPU libraries. Please make sure the missing libraries mentioned above are installed properly if you would like to use GPU. Follow the guide at https://www.tensorflow.org/install/gpu for how to download and setup the required libraries for your platform.
Skipping registering GPU devices...
2021-05-25 11:41:44.074929: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations:  AVX2 FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2021-05-25 11:41:44.075066: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1258] Device interconnect StreamExecutor with strength 1 edge matrix:
2021-05-25 11:41:44.075084: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1264]      
Starting
Text :  С трево+жным чу+вством беру+сь я+ за+ перо+
Checkpoint :  checkpoints/sova_fix/sova_fix_fastspeech_7788502_61k_steps.pyt
2021-05-25 11:42:00.754573: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:176] None of the MLIR Optimization Passes are enabled (registered 2)
2021-05-25 11:42:00.754979: I tensorflow/core/platform/profile_utils/cpu_utils.cc:114] CPU Frequency: 2199995000 Hz
TTS synthesis
predicting
Traceback (most recent call last):
  File "inference.py", line 259, in <module>
    main(sys.argv[1:])
  File "inference.py", line 217, in main
    wav = griffin_lim(m, stft, 30)
  File "/content/drive/MyDrive/FastSpeech2-1/dataset/audio_processing.py", line 239, in griffin_lim
    signal = stft_fn.inverse(magnitudes, angles).squeeze(1)
  File "/content/drive/MyDrive/FastSpeech2-1/utils/stft.py", line 122, in inverse
    padding=0
RuntimeError: Given transposed=1, weight of size [1026, 1, 1024], expected input[1, 160, 181] to have 1026 channels, but got 160 channels instead

I want to synthesize speech using the griffin-lim algorithm, but an error comes out.
I have set in default.yaml "melgan_vocoder: True".

I don’t understand what’s wrong. What is the error?
How to fix?