Wenshuo Chen (winshot-thu)

winshot-thu

Geek Repo

Company:Tsinghua University

Github PK Tool:Github PK Tool

Wenshuo Chen's starred repositories

sail

Library for streaming data and incremental learning algorithms.

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

Stargazers:847Issues:0Issues:0

FoxbatDB

轻量级K-V数据库:支持事务ACID、兼容Redis命令

Language:C++Stargazers:6Issues:0Issues:0

c2f-seg

Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0
Language:PythonLicense:MITStargazers:313Issues:0Issues:0

fiftyone

Refine high-quality datasets and visual AI models

Language:PythonLicense:Apache-2.0Stargazers:8787Issues:0Issues:0

SignAvatars

SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

Stargazers:61Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5751Issues:0Issues:0

thuthesis

LaTeX Thesis Template for Tsinghua University

Language:TeXLicense:LPPL-1.3cStargazers:4558Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1810Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:56070Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:93695Issues:0Issues:0

Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

Language:PythonStargazers:202Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40521Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7006Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:4012Issues:0Issues:0

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:6144Issues:0Issues:0

priorMDM

The official implementation of the paper "Human Motion Diffusion as a Generative Prior"

Language:PythonLicense:MITStargazers:426Issues:0Issues:0

MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Language:PythonLicense:MITStargazers:1479Issues:0Issues:0

GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Language:PythonLicense:NOASSERTIONStargazers:757Issues:0Issues:0

MultiAct_RELEASE

Official PyTorch implementation of "MultiAct: Long-Term 3D Human Motion Generation from Multiple Action Labels", in AAAI 2023 (Oral presentation).

Language:PythonLicense:MITStargazers:58Issues:0Issues:0

T2M-GPT

(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”

Language:PythonLicense:Apache-2.0Stargazers:587Issues:0Issues:0

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

Stargazers:3305Issues:0Issues:0

motion-latent-diffusion

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

Language:PythonLicense:MITStargazers:580Issues:0Issues:0

MotionDiffuse

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Language:PythonLicense:NOASSERTIONStargazers:846Issues:0Issues:0

D3DP

[ICCV2023] The PyTorch implementation for "Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation"

Language:PythonLicense:MITStargazers:155Issues:0Issues:0

prolificdreamer

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Language:PythonLicense:Apache-2.0Stargazers:1481Issues:0Issues:0

Diffpose

[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8492Issues:0Issues:0

Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Language:PythonLicense:MITStargazers:1677Issues:0Issues:0