Jiahao Wang (techmonsterwang)

techmonsterwang

Geek Repo

Company:The University of HongKong

Location:Pokfulam, Hong Kong, PRC

Home Page:https://www.zhihu.com/people/wang-jia-hao-53-3

Github PK Tool:Github PK Tool

Jiahao Wang's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37544Issues:391Issues:67

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9502Issues:78Issues:116

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7687Issues:109Issues:153

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5617Issues:50Issues:559

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4635Issues:122Issues:54

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3782Issues:33Issues:513

ReAct

[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models

Language:Jupyter NotebookLicense:MITStargazers:1883Issues:16Issues:29
Language:PythonLicense:Apache-2.0Stargazers:1745Issues:121Issues:22

MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Language:PythonLicense:Apache-2.0Stargazers:1280Issues:50Issues:31

LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Language:PythonLicense:Apache-2.0Stargazers:1110Issues:11Issues:55

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:1081Issues:10Issues:166

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:990Issues:21Issues:57

awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:907Issues:17Issues:33

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:904Issues:12Issues:19

rcg

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Language:PythonLicense:MITStargazers:792Issues:7Issues:33

MixtralKit

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Language:PythonLicense:Apache-2.0Stargazers:762Issues:8Issues:16

LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Language:PythonLicense:Apache-2.0Stargazers:693Issues:14Issues:104

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonLicense:NOASSERTIONStargazers:681Issues:18Issues:5

DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:561Issues:11Issues:24

DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

mirasol-pytorch

Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:87Issues:7Issues:4

Aurora

[NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model

S2-BNN

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

UniLSeg

Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

xtuner

A toolkit for efficiently fine-tuning LLM (InternLM, Llama, Baichuan, QWen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0