ABC0408 / WaveImage

vocoder

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

in the wave-image.ipynb file, It provides a way to build a bridge between pictures and voice.

Plan

After convert audio to wave-image, now my task also change to image to image translation.

  • Part1: read some papers about image-to-image translation
  • Part2: thinking about the difference between wave-image and others real world image

Part 1:

Papers

    • Pix2Pix Image-to-Image Translation with Conditional Adversarial Networks [paper] [code] [notes]
    • Pix2PixHD High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs [paper] [code] [notes]
    • CycleGAN Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks [paper] [code] [notes]
    • BicycleGAN Toward Multimodal Image-to-Image Translation [paper] [code] [notes]

Part 2:

  1. the object in the image has their own shape, color ,,,. In the wave-image, it consists of different words. Each word has their own shape, color.

About

vocoder


Languages

Language:Jupyter Notebook 100.0%