Aaron Han's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136619Issues:1055Issues:7549

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25180Issues:222Issues:452

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3721Issues:53Issues:50

ChatGPT_JCM

OpenAI管理界面,聚合了OpenAI的所有接口进行界面操作(所有模型、图片、音频、微调、文件)等,支持Markdown格式(公式、图表,表格)等,后期会一点一点的将OpenAI接口进行接入大家支持一下,右上角点个Star。

Language:VueLicense:BSD-3-ClauseStargazers:2945Issues:30Issues:58

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookLicense:MITStargazers:2452Issues:36Issues:34

Chain-of-ThoughtsPapers

A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".

machine-learning-interview

算法工程师-机器学习面试题总结

Generative_Deep_Learning_2nd_Edition

The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:983Issues:21Issues:26

MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Language:PythonLicense:BSD-3-ClauseStargazers:467Issues:10Issues:68

Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

prophet

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

Language:PythonLicense:Apache-2.0Stargazers:262Issues:3Issues:40

Awesome-Multimodal-Reasoning

Collection of papers and resources on Multimodal Reasoning, including Vision-Language Models, Multimodal Chain-of-Thought, Visual Inference, and others.

License:MITStargazers:219Issues:4Issues:0

Yolov5-Flask-VUE

基于Flask+VUE前后端,在阿里云公网WEB端部署YOLOv5目标检测模型

Language:PythonLicense:MITStargazers:176Issues:3Issues:12

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:172Issues:3Issues:24

FrozenBiLM

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Language:PythonLicense:Apache-2.0Stargazers:151Issues:4Issues:15

fluent-python-notes

《流畅的 Python》阅读笔记

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:80Issues:3Issues:0

CVPR2023Summary

CVPR2023所有论文免费打包下载+ ChatPaper所有论文总结免费下载

Language:PythonLicense:MITStargazers:60Issues:3Issues:1

Face-recognition-for-classroom-sign-in

基于FaceNet的人脸检测+识别的课堂学生签到系统

Language:PythonLicense:Apache-2.0Stargazers:44Issues:3Issues:0

StatisticalLearning_USTC

Statistical Learning course in USTC. 中科大统计学习(刘东)课程复习资料。

Language:TeXStargazers:42Issues:0Issues:0

NExT-GQA

Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)

Language:PythonLicense:MITStargazers:41Issues:1Issues:6

Paddle-face-detection-and-expression-recognition

基于Paddle框架的TinyYOLO人脸检测和ResNet表情识别

Language:Jupyter NotebookStargazers:29Issues:2Issues:0

StatisticalLearningCheatsheet

**科学技术大学研究生课程 INFO6407P 统计学习(刘东)之半开卷小抄

Language:TeXStargazers:13Issues:2Issues:0

InvReg

Invariant Feature Regularization for Fair Face Recognition (ICCV'23)

Language:PythonLicense:MITStargazers:12Issues:0Issues:1

OOD-VSSL

[NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Language:PythonLicense:NOASSERTIONStargazers:12Issues:4Issues:1

videoqa_dataset_visualization

Load and visualize different datasets in video question answering

Language:PythonLicense:MITStargazers:9Issues:0Issues:0

FAVOR

Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models

Stargazers:7Issues:0Issues:0

dsp-homework

A backup of my homework. 现代数字信号处理 DSP II

Language:PythonLicense:GPL-3.0Stargazers:3Issues:1Issues:1