Yang Tan (tanyang1231)

tanyang1231

Geek Repo

Company:Tsinghua University

Location:china

Home Page:https://tanyang1231.github.io/

Github PK Tool:Github PK Tool

Yang Tan's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:83Issues:0Issues:0

MMDU

Official repository of MMDU dataset

Language:PythonLicense:Apache-2.0Stargazers:49Issues:0Issues:0

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:5949Issues:0Issues:0

Awesome-Text-to-3D

A growing curation of Text-to-3D, Diffusion-to-3D works.

Language:TeXStargazers:441Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:66732Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1296Issues:0Issues:0

DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Language:PythonLicense:MITStargazers:5537Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10687Issues:0Issues:0

CogVideo

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Language:PythonLicense:Apache-2.0Stargazers:3547Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6264Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3751Issues:0Issues:0

LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

License:MITStargazers:162Issues:0Issues:0

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1587Issues:0Issues:0

SegmentationTransferability

Code of ICIP2023 paper: Efficient Prediction of Model Transferability in Semantic Segmentation Tasks

Language:PythonStargazers:2Issues:0Issues:0

Meteor

Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capabilities. (Under Review)

Language:PythonLicense:MITStargazers:92Issues:0Issues:0

sentencepiece_chinese_bpe

使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。

Language:PythonStargazers:98Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11261Issues:0Issues:0

LLMs_interview_notes

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

License:Apache-2.0Stargazers:1180Issues:0Issues:0
License:Apache-2.0Stargazers:18Issues:0Issues:0

docvqa

Document Visual Question Answering

Language:PythonLicense:MITStargazers:110Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2405Issues:0Issues:0

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:486Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:75Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4181Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7977Issues:0Issues:0

GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Language:PythonLicense:MITStargazers:1344Issues:0Issues:0

llama3-Chinese-chat

Llama3 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Language:PythonStargazers:3324Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:21930Issues:0Issues:0
Language:PythonStargazers:18Issues:0Issues:0

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1153Issues:0Issues:0