Daehee Kim's starred repositories
matmulfreellm
Implementation for MatMul-free LM.
Efficient-Multimodal-LLMs-Survey
Efficient Multimodal Large Language Models: A Survey
ChartReader
[ICCV 2023] ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MM-Interleaved
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Awesome-CLIP
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).