Umar Butler's repositories
open-australian-legal-corpus-creator
The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
open-australian-legal-embeddings-creator
The code used to create and update the Open Australian Legal Embeddings, the first open-source embeddings of Australian legislative and judicial documents.
emubert-creator
The training code behind EmuBert, the largest open-source masked language model for Australian law.
persist-cache
An easy-to-use Python library for lightning-fast persistent function caching.
epel-release-latest-7.noarch.rpm
A repository intended to preserve epel-release-latest-7.noarch.rpm.
Legal-Text-Analytics
A list of selected resources, methods, and tools dedicated to Legal Text Analytics.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.