姬忠鹏 (Jizhongpeng)

Jizhongpeng

Geek Repo

Location:Shanghai

Home Page:jizhongpeng.xyz

Github PK Tool:Github PK Tool

姬忠鹏's starred repositories

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2757Issues:37Issues:167

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonLicense:Apache-2.0Stargazers:1992Issues:27Issues:141

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1724Issues:18Issues:76

4K4D

[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Language:PythonLicense:NOASSERTIONStargazers:1459Issues:95Issues:35

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1235Issues:17Issues:117

InternVideo

Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1017Issues:28Issues:116

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:997Issues:20Issues:19

xgen

Salesforce open-source LLMs with 8k sequence length.

Language:PythonLicense:Apache-2.0Stargazers:713Issues:12Issues:14

MergeLM

Codebase for Merging Language Models (ICML 2024)

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:460Issues:12Issues:36

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonLicense:MITStargazers:427Issues:6Issues:47

DenseDiffusion

Official Pytorch Implementation of DenseDiffusion (ICCV 2023)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:413Issues:11Issues:18

ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Language:PythonLicense:MITStargazers:338Issues:7Issues:21

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

llm_multiagent_debate

Code for Improving Factuality and Reasoning in Language Models through Multiagent Debate

ddpo

Code for the paper "Training Diffusion Models with Reinforcement Learning"

Language:PythonLicense:MITStargazers:266Issues:7Issues:11

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:124Issues:7Issues:8

MixFormerV2

[NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking

Language:PythonLicense:MITStargazers:122Issues:10Issues:37

Rephrase-and-Respond

Official repo of Respond-and-Respond: data, code, and evaluation

Language:PythonLicense:MITStargazers:87Issues:3Issues:1

DNC

Official Pytorch implementation of 'Visual Recognition with Deep Nearest Centroids'. (ICLR2023 Spotlight)

Language:PythonLicense:MITStargazers:63Issues:7Issues:3
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:52Issues:3Issues:1

STMixer

[CVPR 2023] STMixer: A One-Stage Sparse Action Detector

multimodal_cognitive_ai

research work on multimodal cognitive ai

DVAR

Official implementation of "Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics" (NeurIPS 2023)

Language:PythonLicense:Apache-2.0Stargazers:33Issues:3Issues:0

MSLTNet

WACV 2024 (Official implementation of "4K-Resolution Photo Exposure Correction at 125 FPS with ~ 8K Parameters")

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:15Issues:1Issues:1

I2IQA

PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images

Stargazers:3Issues:0Issues:0