Jun Zhang's repositories
NJU-Big-Data
Course Repo for Big Data Processing: Comprehensive Experiments
The-Phoenix-Proiect
凤凰项目: 一个 IT运维的传奇故事
awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
awesome-multiple-object-tracking
Resources for Multiple Object Tracking (MOT)
CDN
Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
detr
End-to-End Object Detection with Transformers
HOI-Detection
Some Useful Links for HOI Detection
NJUCS-Courses
Course Materials from NJUCS
trackerslist
Updated list of public BitTorrent trackers
cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
copilot-gpt4-service
Convert Github Copilot to ChatGPT, free to use the GPT-4 model
FastV
Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
HOI-Learning-List
A list of Human-Object Interaction Learning.
HOI-Transformer
HOI Detection Transformer Architecture, Based on CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"
HQM
ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection
InternVideo
Video Foundation Models & Data for Multimodal Understanding
Linux-Config
My Linux Configuration Scripts, Oh-My-Zsh, etc.
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
NJU-DisSys-Go-RPC
RPC Distributed System implemented in GO
Open-LLaVA-NeXT
An open-source implementation of LLaVA-NeXT.
vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
webvid
Large-scale text-video dataset. 10 million captioned short videos.
zotero-bridge
Obsidian plugin to integrate with Zotero through ZotServer