jhCOR

followers

following

stars

JEONG JIHYEOK's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25328 218 459

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION9926 77 478

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9720 97 653

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookApache-2.08907 95 394

oboe

Oboe is a C++ library that makes it easy to build high-performance audio apps on Android.

Language:C++Apache-2.03683 139 1161

LLaVA-NeXT

Language:PythonApache-2.02529 32 244

mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Language:PythonMIT2254 30 229

NeuS

Code release for NeuS

Language:PythonMIT1564 25 133

idr

Language:PythonMIT698 15 45

android-Camera2Raw

Migrated:

Language:JavaApache-2.0388 43 18

MMVP

Language:Python282 10 26

purchases-android

Android in-app purchases and subscriptions made easy.

Language:KotlinMIT247 12 133

llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

Language:Python178 4 4

Visual-Adversarial-Examples-Jailbreak-Large-Language-Models

Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models

Language:PythonBSD-3-Clause164 3 30

CR-GAN

Yu Tian et al. "CR-GAN: Learning Complete Representations for Multi-view Generation", IJCAI 2018

Language:Python123 8 19

AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Language:PythonApache-2.089 1 3

VL-Instruction-Tuning

FigStep

Jailbreaking Large Vision-language Models via Typographic Visual Prompts

Language:PythonMIT76 3 7

WildLight

official implementation of our CVPR 2023 paper "In-the-wild Inverse Rendering with a Flashlight"

Language:PythonMIT74 7 9

HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Language:PythonApache-2.059 4 8

CFM-HRI-RGB-D-action-database

UESTC RGB-D Varying-view action database. This multi-view action database is captured by Kinect v2.0 with modality of RGB video, 3D skeleton sequences and depth map sequences.

Language:Python48 1 2

Faster-LLM-Survey

Language:Python39 2 3

LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

Language:PythonApache-2.037 2 1

InstructBLIP_PEFT

Language:Jupyter NotebookApache-2.027 1 4

eyenerf

Language:Python2200

Gemini

Google Gemini AI model w/speech recognition and voice.

Language:PythonMIT20 4 1

diversity-eval

Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"

Language:PythonMIT19 3 2

mllm-dpo

[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model

Language:Jupyter Notebook18 1 3

Korean_DCS_2024

Language:Python400

-Dacon-Multimodal-vqa

Language:Jupyter Notebook200