phurich's repositories
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:PythonMIT000
dl-tutorial
a quick tutorial of deep learning
Language:Jupyter NotebookMIT000
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Language:PythonCC-BY-SA-4.0000
fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
Language:Jupyter NotebookApache-2.0000
Language:Jupyter NotebookApache-2.0000
TVLT
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Language:Jupyter NotebookMIT000
word-discovery
Word Discovery in Visually Grounded, Self-Supervised Speech Models
Language:Jupyter NotebookBSD-3-Clause000