Chongyu-Liu's starred repositories
geektime-books
:books: 极客时间电子书
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
Awesome-LLMs-Datasets
Summarize existing representative LLMs text datasets.
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
EEG-Transformer
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (channel attention) and *temporal dimension*. iii. Common spatial pattern (CSP), an efficient feature enhancement method, realized with Python.
FontDiffuser
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
MegaHan97K
MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories