Yuchen Duan (duanduanduanyuchen)

duanduanduanyuchen

Geek Repo

Company:The Chinese University of Hong Kong

Location:Hong Kong

Home Page:https://scholar.google.com/citations?user=trkSLFoAAAAJ&hl=en

Github PK Tool:Github PK Tool

Yuchen Duan's starred repositories

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2574Issues:0Issues:0

ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Language:PythonLicense:NOASSERTIONStargazers:922Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:11200Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1140Issues:0Issues:0

MM-NIAH

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Language:PythonStargazers:70Issues:0Issues:0

Amazing-Python-Scripts

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

Language:Jupyter NotebookLicense:MITStargazers:2374Issues:0Issues:0

BLINK_Benchmark

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]

Language:PythonLicense:Apache-2.0Stargazers:96Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1395Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:863Issues:0Issues:0

CoMat

Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Language:PythonStargazers:116Issues:0Issues:0

PlainMamba

[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition

Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

DDPS

Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"

Language:PythonStargazers:64Issues:0Issues:0

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonLicense:Apache-2.0Stargazers:311Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:31153Issues:0Issues:0

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3178Issues:0Issues:0

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2452Issues:0Issues:0

TsinghuaBookCrawler

清华教参平台爬虫

Language:PythonStargazers:169Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:263Issues:0Issues:0

ViT-Adapter

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Language:PythonLicense:Apache-2.0Stargazers:1197Issues:0Issues:0

PVT

Official implementation of PVT series

Language:PythonLicense:Apache-2.0Stargazers:1704Issues:0Issues:0

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Language:PythonLicense:Apache-2.0Stargazers:9248Issues:0Issues:0
Language:HTMLStargazers:2Issues:0Issues:0

CVPR20_CLVision_challenge

1'st Place approach for CVPR 2020 Continual Learning Challenge

Language:PythonLicense:MITStargazers:46Issues:0Issues:0

autojs

android autojs 注册登陆签到脚本,实现只需要修改JSON配置文件,就能自定义操作流程。已实现微博自动注册,远程获取内容,自动发微博等功能!新增加网易163邮箱注册,抖音注册、点赞!正在实现【百度地图签到、大众点评签到、叮咚买菜签到、拼多多签到、什么值得买签到、苏宁易购签到、淘宝签到淘金币、微信读书(TODO)、小米商城抢购web(TODO)、云闪付签到积分、支付宝签到积分、支付宝每日花呗红包、支付宝体育服务早期打卡】https://github.com/bayson/autojs

Language:JavaScriptStargazers:459Issues:0Issues:0

FBDQA-2020S

Financial Big Data and Quantitative Analytics, Spring 2020

Stargazers:36Issues:0Issues:0

Python-100-Days

Python - 100天从新手到大师

Language:PythonStargazers:153944Issues:0Issues:0