Beast code in Giters

Shen Meng's starred repositories

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.0515300

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:Python41400

ft-pali-gemma

Notebooks for fine tuning pali gemma

Language:Jupyter NotebookMIT3000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonApache-2.0804500

data_management_LLM

Collection of training data management explorations for large language models

23500

fast-detect-gpt

Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".

Language:PythonMIT16200

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.02811400

Self-Filter

Language:PythonAGPL-3.01600

HallE_Control

HallE-Control: Controlling Object Hallucination in LMMs

Language:Python1900

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Language:PythonMIT21600

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause2518400

dlrm

An implementation of a deep learning recommendation model (DLRM)

Language:PythonMIT367300

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

47800

open-instruct

Language:PythonApache-2.0110700

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookMIT28900

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Language:PythonMIT35500

vlm-evaluation

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Language:PythonNOASSERTION6700

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonGPL-3.0110200

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01830400

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.0227400

DataOptim

A collection of visual instruction tuning datasets.

Language:PythonMIT7200

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.080500

bmmal

[ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification

Language:PythonCC-BY-4.0800

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT2964500

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonNOASSERTION275900

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION930000

TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Language:PythonMIT36400

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2961500

TalkLip

Language:Python37900

sas-data-efficient-contrastive-learning

Official repository for SAS Data Efficient Contrastive Learning ICML '23

Language:Python800

MengShen0709