rdhanurkar's repositories
org-chart
Highly customizable d3 org chart. Integrations available for Angular, React, Vue
hitchhikers-guide
The Hitchhiker's Guide to Data Science for Social Good
splink_demos
Interactive notebooks containing demonstration code of the splink library
splink
Fast, accurate and scalable probabilistic data linkage using your choice of SQL backend
ai-job-title-area-classification
Classification of job titles into categories, using different ML techniques
kgtk
Knowledge Graph Toolkit
skills-ml
Data Processing and Machine learning methods for the Open Skills Project
kgtk-notebooks
Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)
SkillNER-HS
A (smart) rule based NLP module to extract job skills from text
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
duckling
Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.
aima-python
Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"
mFLICA
Given a set of time series of individual activities, our goal is to identify periods of coordinated activity, find factions of coordination if more than one exist, as well as identify leaders of each faction from a set of multivariate time series.
rltk
Record Linkage ToolKit (Find and link entities)
JobStack
Code for reproducing the results in the paper: De-identification of Privacy-related Entities in Job Postings
intro-to-data-linking
Tutorial notebooks and associated artifacts for my Introduction to Data Linking talk/workshop.
wordVectors
An R package for creating and exploring word2vec and other word embedding models
dsbox-ta2
The DSBox TA2 component
deploying-ml-model-using-flask
Deploying Machine Learning Project on Youtube Spam comment Detector Using Flask
BayesianRecordLinkage.jl
Perform Bayesian record linkage with a one-to-one matching assumption.
SkillNER
A Named Entity Recognition system that extracts soft skills from text
deep-siamese-text-similarity
Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings
Knowledge-Graph-Analysis-Programming-Exercises
Exercises for the Analysis of Knowledge Graphs
pbprdf
Generate linked data for advanced basketball analytics. Reads basketball play-by-play files and generates RDF to import into a semantic graph database like RDF4J.
atyimo
atyimo: probabilistic record linkage for massive administrative datasets
dig-etl-engine
Download DIG to run on your laptop or server.