Xiangyu Zhao (hsiangyuzhao)

hsiangyuzhao

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Home Page:hsiangyuzhao.github.io

Github PK Tool:Github PK Tool

Xiangyu Zhao's starred repositories

Combined_Dataset_for_Speech_Emotion_Recognition

A collection of dataset consists of a total of 8 English speech datasets for SER

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1829Issues:0Issues:0

depression-detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Language:PythonStargazers:287Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8687Issues:0Issues:0

openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Language:PythonLicense:MITStargazers:677Issues:0Issues:0

zouxian

Permanent Apple Intelligence + Xcode Predictive Code Completion for Chinese-market Mac computers

Language:ShellLicense:MITStargazers:681Issues:0Issues:0

iRingo

解锁完整的 Apple功能和集成服务

Language:Vim SnippetLicense:GPL-3.0Stargazers:9374Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30291Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2399Issues:0Issues:0

EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Language:PythonLicense:MITStargazers:790Issues:0Issues:0

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:569Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1717Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10369Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3235Issues:0Issues:0

AnyGPT

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Language:PythonStargazers:752Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:12089Issues:0Issues:0

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

Stargazers:629Issues:0Issues:0

downkyi

哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。

Language:C#License:GPL-3.0Stargazers:20830Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2576Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:17901Issues:0Issues:0

connected-components-3d

Connected components on discrete and continuous multilabel 3D & 2D images. Handles 26, 18, and 6 connected variants; periodic boundaries (4, 8, & 6)

Language:C++License:LGPL-3.0Stargazers:361Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5688Issues:0Issues:0

learning_research

本人的科研经验

Stargazers:5622Issues:0Issues:0

CLIP-Driven-Universal-Model

[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.

Language:PythonLicense:NOASSERTIONStargazers:566Issues:0Issues:0

AbdomenAtlas

[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)

Language:PythonLicense:NOASSERTIONStargazers:207Issues:0Issues:0

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:473Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:39956Issues:0Issues:0

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:92676Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6896Issues:0Issues:0

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:8051Issues:0Issues:0