Junghwan Park's starred repositories
professional-programming
A collection of learning resources for curious software engineers
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
llama_parse
Parse files for optimal RAG
Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
ml-mobileclip
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
GroundingGPT
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
Beyond-INet
Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"
obsidian-web-clipper
Obsidian Web Clipper is a simple Browser extension for Obsidian, a popular note-taking application. With this extension, you can quickly capture notes directly from your web browser and save them to your Obsidian vaults.
Free-GPT-Actions
A listing of Free GPT actions available for public use