Shen Meng (MengShen0709)

MengShen0709

Geek Repo

Company:Nanyang Technological University

Location:Singapore

Home Page:https://mengshen0709.github.io/

Github PK Tool:Github PK Tool

Shen Meng's starred repositories

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonLicense:Apache-2.0Stargazers:5153Issues:0Issues:0

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:PythonStargazers:414Issues:0Issues:0

ft-pali-gemma

Notebooks for fine tuning pali gemma

Language:Jupyter NotebookLicense:MITStargazers:30Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8045Issues:0Issues:0

data_management_LLM

Collection of training data management explorations for large language models

Stargazers:235Issues:0Issues:0

fast-detect-gpt

Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".

Language:PythonLicense:MITStargazers:162Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28114Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:16Issues:0Issues:0

HallE_Control

HallE-Control: Controlling Object Hallucination in LMMs

Language:PythonStargazers:19Issues:0Issues:0

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Language:PythonLicense:MITStargazers:216Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25184Issues:0Issues:0

dlrm

An implementation of a deep learning recommendation model (DLRM)

Language:PythonLicense:MITStargazers:3673Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:478Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1107Issues:0Issues:0

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:289Issues:0Issues:0

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Language:PythonLicense:MITStargazers:355Issues:0Issues:0

vlm-evaluation

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Language:PythonLicense:NOASSERTIONStargazers:67Issues:0Issues:0

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

Language:PythonLicense:GPL-3.0Stargazers:1102Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18304Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2274Issues:0Issues:0

DataOptim

A collection of visual instruction tuning datasets.

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:805Issues:0Issues:0

bmmal

[ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification

Language:PythonLicense:CC-BY-4.0Stargazers:8Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29645Issues:0Issues:0

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonLicense:NOASSERTIONStargazers:2759Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9300Issues:0Issues:0

TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Language:PythonLicense:MITStargazers:364Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29615Issues:0Issues:0
Language:PythonStargazers:379Issues:0Issues:0

sas-data-efficient-contrastive-learning

Official repository for SAS Data Efficient Contrastive Learning ICML '23

Language:PythonStargazers:8Issues:0Issues:0