JEONG JIHYEOK's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25328Issues:218Issues:459

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9926Issues:77Issues:478

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9720Issues:97Issues:653

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8907Issues:95Issues:394

oboe

Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.

Language:C++License:Apache-2.0Stargazers:3683Issues:139Issues:1161
Language:PythonLicense:Apache-2.0Stargazers:2529Issues:32Issues:244

mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Language:PythonLicense:MITStargazers:2254Issues:30Issues:229

NeuS

Code release for NeuS

Language:PythonLicense:MITStargazers:1564Issues:25Issues:133
Language:PythonLicense:MITStargazers:698Issues:15Issues:45
Language:JavaLicense:Apache-2.0Stargazers:388Issues:43Issues:18

purchases-android

Android in-app purchases and subscriptions made easy.

Language:KotlinLicense:MITStargazers:247Issues:12Issues:133

llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

Visual-Adversarial-Examples-Jailbreak-Large-Language-Models

Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models

Language:PythonLicense:BSD-3-ClauseStargazers:164Issues:3Issues:30

CR-GAN

Yu Tian et al. "CR-GAN: Learning Complete Representations for Multi-view Generation", IJCAI 2018

AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Language:PythonLicense:Apache-2.0Stargazers:89Issues:1Issues:3

FigStep

Jailbreaking Large Vision-language Models via Typographic Visual Prompts

Language:PythonLicense:MITStargazers:76Issues:3Issues:7

WildLight

official implementation of our CVPR 2023 paper "In-the-wild Inverse Rendering with a Flashlight"

Language:PythonLicense:MITStargazers:74Issues:7Issues:9

HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Language:PythonLicense:Apache-2.0Stargazers:59Issues:4Issues:8

CFM-HRI-RGB-D-action-database

UESTC RGB-D Varying-view action database. This multi-view action database is captured by Kinect v2.0 with modality of RGB video, 3D skeleton sequences and depth map sequences.

LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

Language:PythonLicense:Apache-2.0Stargazers:37Issues:2Issues:1
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:27Issues:1Issues:4
Language:PythonStargazers:22Issues:0Issues:0

Gemini

Google Gemini AI model w/speech recognition and voice.

Language:PythonLicense:MITStargazers:20Issues:4Issues:1

diversity-eval

Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"

Language:PythonLicense:MITStargazers:19Issues:3Issues:2

mllm-dpo

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Language:Jupyter NotebookStargazers:18Issues:1Issues:3
Language:PythonStargazers:4Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0