ZWCui (againcui)

againcui

Geek Repo

Github PK Tool:Github PK Tool

ZWCui's starred repositories

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:2122Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2351Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4669Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36034Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54906Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:11178Issues:0Issues:0

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:486Issues:0Issues:0

algorithm-visualizer

:fireworks:Interactive Online Platform that Visualizes Algorithms from Code

Language:JavaScriptLicense:MITStargazers:46486Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:716Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19323Issues:0Issues:0

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2659Issues:0Issues:0

fMRI-reconstruction-NSD

fMRI-to-image reconstruction on the NSD dataset.

Language:Jupyter NotebookLicense:MITStargazers:288Issues:0Issues:0

lvis-api

Python API for LVIS Dataset

Language:PythonLicense:NOASSERTIONStargazers:402Issues:0Issues:0

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:NOASSERTIONStargazers:1367Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9342Issues:0Issues:0

paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.

Language:PythonLicense:MITStargazers:263Issues:0Issues:0

MedSAM

Segment Anything in Medical Images

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2543Issues:0Issues:0

VLPart

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Language:PythonLicense:MITStargazers:343Issues:0Issues:0

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonLicense:MITStargazers:404Issues:0Issues:0

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14438Issues:0Issues:0

grounded-segment-any-parts

Grounded Segment Anything: From Objects to Parts

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:378Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18571Issues:0Issues:0

RelateAnything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

Language:PythonLicense:Apache-2.0Stargazers:438Issues:0Issues:0

XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Language:PythonLicense:NOASSERTIONStargazers:861Issues:0Issues:0

MedCLIP

EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts

Language:PythonStargazers:405Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25209Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4051Issues:0Issues:0

XrayGPT

[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Language:PythonStargazers:450Issues:0Issues:0