zhen's starred repositories
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
coyo-dataset
COYO-700M: Large-scale Image-Text Pair Dataset
llm-embedding
Finetune Malaysian LLM for Malaysian context embedding task.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ControlNet
Let us control diffusion models!
stable-diffusion-webui
Stable Diffusion web UI
awesome-emdl
Embedded and mobile deep learning research resources
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Awesome-SLU-Survey
Tracking the progress in SLU (resources, code, and new frontiers etc.)
Capsule-NLU
Joint Slot Filling and Intent Detection via Capsule Neural Networks (ACL'19) https://arxiv.org/abs/1812.09471
task-specific-datasets
A collection of task-specific NLU datasets
QA-Survey-CN
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。