THUDM

THUDM

Geek Repo

0

followers

0

following

0

stars

Location:FIT Building, Tsinghua University

Home Page:https://huggingface.co/THUDM

Twitter:@thukeg

Github PK Tool:Github PK Tool

THUDM's repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:39908Issues:394Issues:1288

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15602Issues:135Issues:615

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:12967Issues:99Issues:751

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Language:PythonLicense:Apache-2.0Stargazers:7542Issues:62Issues:240

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5567Issues:65Issues:397

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4024Issues:40Issues:346

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:3256Issues:22Issues:208

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:1980Issues:29Issues:124

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1412Issues:24Issues:111

SwissArmyTransformer

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Language:PythonLicense:Apache-2.0Stargazers:865Issues:30Issues:72

AutoWebGLM

An LLM-based Web Navigating Agent (KDD'24)

Language:PythonLicense:Apache-2.0Stargazers:521Issues:24Issues:8

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonLicense:Apache-2.0Stargazers:272Issues:22Issues:16

AlignBench

大模型多维度中文对齐评测基准 (ACL 2024)

RelayDiffusion

The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]

Language:PythonLicense:Apache-2.0Stargazers:244Issues:11Issues:9

LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Language:PythonLicense:Apache-2.0Stargazers:149Issues:8Issues:9
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:138Issues:10Issues:20

kgTransformer

kgTransformer: pre-training for reasoning over complex KG queries (KDD 22)

Language:PythonLicense:Apache-2.0Stargazers:80Issues:12Issues:3

ScenarioMeta

Source code and dataset for KDD 2019 paper "Sequential Scenario-Specific Meta Learner for Online Recommendation"

Language:PythonLicense:MITStargazers:80Issues:9Issues:3

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Language:PythonStargazers:31Issues:0Issues:5

LVBench

LVBench: An Extreme Long Video Understanding Benchmark

MSAGPT

MSAGPT

Language:PythonLicense:Apache-2.0Stargazers:13Issues:8Issues:0

RecDCL

RecDCL: Dual Contrastive Learning for Recommendation (WWW'24, Oral)

Language:PythonLicense:MITStargazers:13Issues:3Issues:1
Language:PythonLicense:NOASSERTIONStargazers:6Issues:3Issues:3

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0
License:MITStargazers:0Issues:1Issues:0