Aviv Shamsian's repositories
SRGAN-Keras-Implementation
Photo Realistic Single Image Super-Resolution Using a Generative Adversarial Network implemented in Keras
Pytorch-MNIST-colab
MNIST Image Classification using Pytorch
ContinuousParetoMTL
[ICML 2020] PyTorch Code for "Efficient Continuous Pareto Exploration in Multi-Task Learning"
deep-weight-space-augmentations
Official implementation of "Improved Generalization of Weight Space Networks via Augmentations", ICML 2024
VAE_pytorch
Simple variational autoencoder implementation
AVEC
[WACV 2023] Audio-Visual Efficient Conformer (AVEC) for Robust Speech Recognition
faster-whisper
Faster Whisper transcription with CTranslate2
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
insp
[NeurIPS 2022] "Signal Processing for Implicit Neural Representations" by Dejia Xu*, Peihao Wang*, Yifan Jiang, Zhiwen Fan, Zhangyang Wang
kaolin-wisp
NVIDIA Kaolin Wisp is a PyTorch library powered by NVIDIA Kaolin Core to work with neural fields (including NeRFs, NGLOD, instant-ngp and VQAD).
Lip-to-Speech-Synthesis-in-the-Wild
PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)
rd-blender-docker
A collection of Docker containers for running Blender headless or distributed ✨
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
SemSegPipeline
A simpler way of reading and augmenting image segmentation data into TensorFlow
SkeletonGroupActivityRecognition
Learning Group Activities from Skeletons without Individual Action Labels
syncnet_python
Out of time: automated lip sync in the wild
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Visual-Context-Attentional-GAN
PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)