Zirui Song's starred repositories

License:Apache-2.0Stargazers:198Issues:0Issues:0

CausalVLR

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning (视觉-语言因果推理开源框架)

Language:PythonLicense:Apache-2.0Stargazers:118Issues:0Issues:0

DS-Agent

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

Language:PythonStargazers:73Issues:0Issues:0

Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

License:MITStargazers:179Issues:0Issues:0

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonLicense:NOASSERTIONStargazers:399Issues:0Issues:0

matryoshka-mm

Matryoshka Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:59Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:744Issues:0Issues:0

DeMamba

This repository is the code of paper 'DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark'.

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language:PythonLicense:NOASSERTIONStargazers:483Issues:0Issues:0

MotionLLM

[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos

Language:PythonLicense:NOASSERTIONStargazers:186Issues:0Issues:0

TRACE

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:52Issues:0Issues:0
Language:PythonStargazers:24Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:26927Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23370Issues:0Issues:0

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5384Issues:0Issues:0

llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

Stargazers:168Issues:0Issues:0

sdft

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Language:ShellStargazers:58Issues:0Issues:0

llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

Language:Jupyter NotebookStargazers:61Issues:0Issues:0

Awesome-Multimodal-Chain-of-Thought

Collection of papers and repos for multimodal chain-of-thought

Stargazers:3Issues:0Issues:0

MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Language:Jupyter NotebookStargazers:298Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29177Issues:0Issues:0

LLM4Chem

Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"

Language:PythonLicense:MITStargazers:51Issues:0Issues:0

crystal-text-llm

Large language models to generate stable crystals.

Language:PythonLicense:NOASSERTIONStargazers:60Issues:0Issues:0

MP5

[CVPR2024] This is the official implement of MP5

Language:PythonStargazers:66Issues:0Issues:0
Language:PythonLicense:MITStargazers:3Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

License:MITStargazers:857Issues:0Issues:0

gitbook

The open source frontend for GitBook doc sites

Language:TypeScriptLicense:GPL-3.0Stargazers:26731Issues:0Issues:0

Mora

Mora: More like Sora for Generalist Video Generation

Language:PythonStargazers:1441Issues:0Issues:0

GeoChat

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Language:PythonStargazers:372Issues:0Issues:0