唐国梁Tommy's starred repositories

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18027Issues:207Issues:372

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4362Issues:61Issues:176

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3895Issues:114Issues:73

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

MiniGemini

Official implementation for Mini-Gemini

Language:PythonLicense:Apache-2.0Stargazers:2711Issues:23Issues:75

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1507Issues:20Issues:36

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1123Issues:40Issues:11

dataverse

The Universe of Data. All about data, data science, and data engineering

Language:PythonLicense:Apache-2.0Stargazers:470Issues:11Issues:23
Language:PythonLicense:Apache-2.0Stargazers:358Issues:9Issues:13

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

GaussianCube

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Language:PythonLicense:Apache-2.0Stargazers:252Issues:11Issues:9
Language:PythonLicense:NOASSERTIONStargazers:123Issues:8Issues:4

ST-LLM

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonLicense:Apache-2.0Stargazers:91Issues:8Issues:19

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Language:PythonLicense:MITStargazers:86Issues:4Issues:6

SPRIGHT

[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"

Language:PythonLicense:Apache-2.0Stargazers:85Issues:3Issues:3

LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Language:PythonLicense:MITStargazers:83Issues:3Issues:4

Agent-Pro

The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

FlexiDreamer

An official implementation of FlexiDreamer: Single Image-to-3D Generation with FlexiCubes.

UPD

[arXiv2024] Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Language:PythonLicense:Apache-2.0Stargazers:60Issues:4Issues:1

CCoT

[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"

Language:PythonLicense:MITStargazers:47Issues:1Issues:3

MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

Language:PythonLicense:MITStargazers:42Issues:6Issues:2

LLM-Attributor

LLM Attributor: Attribute LLM's Generated Text to Training Data

Language:Jupyter NotebookLicense:MITStargazers:15Issues:7Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14Issues:1Issues:0
Language:PythonStargazers:3Issues:0Issues:0

QAGCN

This repository includes the code for our paper QAGCN: Answering Multi-Relation Questions via Single-Step Implicit Reasoning over Knowledge Graphs.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0