唐国梁Tommy's starred repositories

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:17751Issues:203Issues:364

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4190Issues:62Issues:163

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3727Issues:110Issues:68

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1401Issues:19Issues:32

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1058Issues:40Issues:10

dataverse

The Universe of Data. All about data, data science, and data engineering

Language:PythonLicense:Apache-2.0Stargazers:417Issues:8Issues:22
Language:PythonLicense:Apache-2.0Stargazers:314Issues:9Issues:11

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:296Issues:11Issues:14

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

sammo

A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)

Language:PythonLicense:MITStargazers:240Issues:8Issues:13
Language:PythonLicense:Apache-2.0Stargazers:236Issues:11Issues:9

GaussianCube

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Language:PythonLicense:NOASSERTIONStargazers:115Issues:8Issues:3

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

SPRIGHT

Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"

Language:PythonLicense:Apache-2.0Stargazers:76Issues:3Issues:2

LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Language:PythonLicense:MITStargazers:74Issues:3Issues:4

ST-LLM

Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonLicense:Apache-2.0Stargazers:71Issues:7Issues:15

FlexiDreamer

An official implementation of FlexiDreamer: Single Image-to-3D Generation with FlexiCubes.

Language:PythonLicense:MITStargazers:63Issues:4Issues:5

UPD

[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Language:PythonLicense:Apache-2.0Stargazers:49Issues:4Issues:0

MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Language:PythonLicense:MITStargazers:38Issues:5Issues:2

CCoT

[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

cmulab

CMU Linguistic Annotation Backend

LLM-Attributor

LLM Attributor: Attribute LLM's Generated Text to Training Data

Language:Jupyter NotebookLicense:MITStargazers:12Issues:7Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

QAGCN

This repository includes the code for our paper QAGCN: Answering Multi-Relation Questions via Single-Step Implicit Reasoning over Knowledge Graphs.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0