zeyu li's repositories
douyin_auto-reply
使用selenium
ccf-deadlines
⏰ CCF recommendation conference Deadline Countdowns / Please star this project, thanks~
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
automatic-recolorization
Uses ML colorization methods to restore image color (RGB -> IR (Gray+extra data) -> RGB)
deit
Official DeiT repository
flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
Human-Falling-Detect-Tracks
AlphaPose + ST-GCN + SORT.
lizeyujack
Config files for my GitHub profile.
model
new
MOSS
An open-source tool-augmented conversational language model from Fudan University
NS-Dial
An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation
psla
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
S-Prompts
Code for NeurIPS 2022 paper “S-Prompts Learning with Pre-trained Transformers: An Occam’s Razor for Domain Incremental Learning“
simple-icons
SVG icons for popular brands
ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Zero_Shot_Audio_Source_Separation
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022