jungle-gym-ac

followers

following

stars

Nanjing University

Jun Zhang's repositories

NJU-Big-Data

Course Repo for Big Data Processing: Comprehensive Experiments

Language:Java200

The-Phoenix-Proiect

凤凰项目：一个 IT运维的传奇故事

200

awesome-detection-transformer

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

100

deeplearning_ai_books

deeplearning.ai（吴恩达老师的深度学习课程笔记及资源）

Language:HTML100

awesome-multiple-object-tracking

Resources for Multiple Object Tracking (MOT)

000

awesome-open-vocabulary-object-detection

000

CDN

Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"

Language:PythonApache-2.0000

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptMIT000

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.0000

HOI-Detection

Some Useful Links for HOI Detection

000

NJUCS-Courses

Course Materials from NJUCS

GPL-3.0000

trackerslist

Updated list of public BitTorrent trackers

GPL-2.0000

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Apache-2.0000

copilot-gpt4-service

Convert Github Copilot to ChatGPT, free to use the GPT-4 model

Language:GoMIT000

DeepStack-VL

Apache-2.0000

FastV

Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

000

HOI-Learning-List

A list of Human-Object Interaction Learning.

000

HOI-Transformer

HOI Detection Transformer Architecture, Based on CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Language:PythonApache-2.0000

HQM

ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection

000

InternVideo

Video Foundation Models & Data for Multimodal Understanding

Apache-2.0000

Linux-Config

My Linux Configuration Scripts, Oh-My-Zsh, etc.

Language:ShellApache-2.0000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

LLaVA-NeXT

000

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION000

NJU-DisSys-Go-RPC

RPC Distributed System implemented in GO

Language:Go000

Open-LLaVA-NeXT

An open-source implementation of LLaVA-NeXT.

000

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

MIT000

webvid

Large-scale text-video dataset. 10 million captioned short videos.

000

zotero-bridge

Obsidian plugin to integrate with Zotero through ZotServer

Language:TypeScriptMIT000