Wenbo Hu's repositories
artifact-directory-template
Template for specifying locations for all capstone project components
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
Data-Visualization-DSC106-
Data Visualization Course work using javascript and html
Deep-Learning-Projects-CSE151B-
Course work of CSE151B. I strongly suggest you to read reports
peft_llama
Peft_BLIP_LLaMA
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
llama-recipes
Examples and recipes for Llama 2 model
LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (Evaluation Pipeline)
Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
pytorch_resnet_cifar10
Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.
Scalable_Analytic_System_DSC102
Course work of DSC102
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
transformers_llava
connect vision tower and projection to LLM
VALOR
Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
yolov5
YOLOv5 in PyTorch > ONNX > CoreML > TFLite