Luke A's repositories
awesome-privacy-engineering
A curated list of resources related to privacy engineering
ADS-599B
Capstone
autogluon
AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
awesome-artificial-intelligence
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.
Course-Knowledge-Graphs
Test data and example source code for the Knowledge Graphs lecture 2022
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
embiggen
π Embiggen is the Python Graph Representation learning, Prediction and Evaluation submodule of the GRAPE library.
Functional-Python-Programming-3rd-Edition
Code Repository for Functional Python Programming 3rd Edition, Published by Packt
goodreads-scraper
A Python scraper for Goodreads books and reviews.
Graph-Machine-Learning
Graph Machine Learning, published by Packt
Hands-On-Data-Analysis-with-Pandas-2nd-edition
Materials for following along with Hands-On Data Analysis with Pandas β Second Edition
HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
mastering-large-datasets
Repository for Mastering Large Datasets with Python
mmda
multimodal document analysis
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
notebooks-1
Notebooks using the Hugging Face libraries π€
OpenNRE
An Open-Source Package for Neural Relation Extraction (NRE)
PaddleNLP
π Easy-to-use and powerful NLP and LLM library with π€ Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including πText Classification, π Neural Search, β Question Answering, βΉοΈ Information Extraction, π Document Intelligence, π Sentiment Analysis etc.
PAMI
PAMI is a Python library containing 100+ algorithms to discover useful patterns in various databases across multiple computing platforms. (Active)
Practical-Data-Analysis-using-Jupyter-Notebook
Practical Data Analysis using Jupyter Notebook, published by Packt Publishing
practicing_trustworthy_machine_learning
GitHub Repo associated with the O'Reilly book "Practicing Trustworthy Machine Learning"
Prompt-Engineering-Guide
π Guides, papers, lecture, notebooks and resources for prompt engineering
recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
shared
Code and shared files
The-Pandas-Workshop
The Pandas Workshop, published by Packt
trl
Train transformer language models with reinforcement learning.
vert-papers
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
web_scraping_example
Simple RSS feed reader for HackerNews.