You Zhang (yzyouzhang)

yzyouzhang

Geek Repo

Company:University of Rochester

Location:NY, US

Home Page:https://yzyouzhang.com

Twitter:@yzyouzhang

Github PK Tool:Github PK Tool


Organizations
AirLabUR

You Zhang's starred repositories

openpilot

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.

Language:PythonLicense:MITStargazers:48813Issues:1302Issues:2759

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34773Issues:360Issues:65

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2566Issues:37Issues:50

ai-audio-startups

Community list of startups working with AI in audio and music technology

s2cnn

Spherical CNNs

Language:PythonLicense:MITStargazers:940Issues:28Issues:48

Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

dla

Deep learning for audio processing

Language:Jupyter NotebookLicense:MITStargazers:548Issues:23Issues:3

inspect_ai

Inspect: A framework for large language model evaluations

Language:PythonLicense:MITStargazers:442Issues:10Issues:71

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:406Issues:16Issues:40

spherical-cnn

Demo code for the paper "Learning SO(3) Equivariant Representations with Spherical CNNs"

Language:PythonLicense:MITStargazers:289Issues:16Issues:13

FBPINNs

Solve forward and inverse problems related to partial differential equations using finite basis physics-informed neural networks (FBPINNs)

Language:PythonLicense:MITStargazers:277Issues:11Issues:17

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:269Issues:14Issues:13

MyVLM

Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)

Language:PythonLicense:NOASSERTIONStargazers:135Issues:14Issues:4

ml-spatial-librispeech

A large synthetic dataset of spatial audio with multiple labels

License:NOASSERTIONStargazers:81Issues:17Issues:0

StyleTalk

Official release of StyleTalk dataset.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:29Issues:3Issues:7

brian2hears

"Brian Hears" auditory modelling toolbox for the brian2 simulator

Language:PythonLicense:NOASSERTIONStargazers:25Issues:7Issues:12

Child-ASR-Paper

A list of papers for child ASR

License:MITStargazers:24Issues:2Issues:0
Language:PythonStargazers:22Issues:0Issues:0

audio_diarization_annotation

Audio Diarization Annotation tool

Language:JavaScriptLicense:Apache-2.0Stargazers:21Issues:4Issues:2

MultiOOD

Scaling Out-of-Distribution Detection for Multiple Modalities

Language:PythonStargazers:19Issues:1Issues:0

1d-spectral-optimal-transport

An 1D optimal transport inspired loss function in the spectral domain. Can be used for improving frequency localization/estimation in differentiable digital signal processing. Experiments from paper: "Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport".

Language:PythonLicense:Apache-2.0Stargazers:15Issues:2Issues:0

conformer-based-classifier-for-anti-spoofing

Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.

Language:PythonLicense:BSD-3-ClauseStargazers:13Issues:2Issues:1

CtrSVDD2024_Baseline

Baseline system for SVDD 2024 Challenge CtrSVDD track

LAPChallenge

The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.

License:NOASSERTIONStargazers:6Issues:2Issues:0

DeepFake-Detection

DeepFake Detection using Siamese Neural Networks

Language:PythonLicense:NOASSERTIONStargazers:3Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0