Hon-Wong's starred repositories
awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
wechat-chatgpt
Use ChatGPT On Wechat via wechaty
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Awesome-Video-Datasets
Video datasets
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
text-dedup
All-in-one text de-duplication
Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
GroundingGPT
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
FreestyleNet
[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis
orange3-text
🍊 :page_facing_up: Text Mining add-on for Orange3
PTSEFormer
[ECCV2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection