winshot-thu

Wenshuo Chen's starred repositories

sail

Library for streaming data and incremental learning algorithms.

Language:PythonMIT2100

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

84700

FoxbatDB

轻量级K-V数据库：支持事务ACID、兼容Redis命令

Language:C++600

c2f-seg

Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).

Language:PythonApache-2.04200

fiftyone

Refine high-quality datasets and visual AI models

Language:PythonApache-2.0878700

SignAvatars

SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark

6100

Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:Python575100

thuthesis

LaTeX Thesis Template for Tsinghua University

Language:TeXLPPL-1.3c455800

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.0181000

llama

Inference code for Llama models

Language:PythonNOASSERTION5607000

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookMIT9369500

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

Language:Python20200

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.04052100

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0700600

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Language:PythonApache-2.0401200

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonApache-2.0614400

priorMDM

The official implementation of the paper "Human Motion Diffusion as a Generative Prior"

Language:PythonMIT42600

MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Language:PythonMIT147900

GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Language:PythonNOASSERTION75700