TGLTommy

唐国梁Tommy's starred repositories

StructEval

This is the office repository for ACL 2024 paper "StructEval: Deepen and Broaden Large Language Assessment via Structured Evaluation"

Language:PythonApache-2.0300

omages

We present Object Images (Omages): An homage to the classic Geometry Images.

6800

MMIU

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Language:Python1700

ExoViP

[COLM 2024] ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning

Language:PythonMIT400

fantastic-data-engineering

Fantastic Data Engineering for Large Language Models

Apache-2.01500

Hallu-PI

The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs".

MIT400

flux

Official inference repo for FLUX.1 models

Language:PythonApache-2.0541700

nano-llama31

nanoGPT style version of Llama 3.1

Language:Python81700

redel

ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.

Language:PythonMIT500

UnifiedMLLM

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

Apache-2.0900

RAGFoundry

Framework for specializing LLMs for retrieval-augmented-generation tasks using fine-tuning.

Language:PythonApache-2.013800

POA

Official implementation of ECCV24 paper: POA

Apache-2.01800

RagLLaVA

Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training.

Language:PythonMIT800

TEVAD

Official implementation for paper TEVAD: Improved video anomaly detection with captions

Language:Python1900

SwinBERT

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

Language:PythonMIT23500

CoCap

[ICCV 2023] Accurate and Fast Compressed Video Captioning

Language:PythonMIT3200

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0885600