facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to implement English Language IDentification (LID)?

hbchen121 opened this issue · comments

How do you implement the function o "English Language IDentification (LID)" in the paper?

Thanks~ Looking forward to your reply

In the paper we use a production LID model. In this open-source version you can try FastText:

def lid(text):