Bo Zhang's starred repositories
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Megatron-LM
Ongoing research training transformer models at scale
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
LeanCopilot
LLMs as Copilots for Theorem Proving in Lean
Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
awesome-knowledge-driven-AD
A curated list of awesome knowledge-driven autonomous driving (continually updated)
DriveLikeAHuman
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models
Expert_Sparsity
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
Once-for-Both
[CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression