Claudiu Branzan's repositories
NLP-demo-2017
Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies
adblockparser
Python parser for Adblock Plus filters
Covid19Study
Simulation of measures to prevent spread of Covid19
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++
DeepLearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
extruct
Extract embedded metadata from HTML markup
frontera
A scalable frontier for web crawlers
fuzzyset
A simple fuzzy matching set for python strings
ga-beacon
Google Analytics collector-as-a-service (using GA measurement protocol).
imageSimilarity
Given a new image, determine if it is likely derived from a known image.
langid.py
Stand-alone language identification system
mcmc-randomwalk
Simple MCMC
mdr
A python library detect and extract listing data from HTML page.
models
Models built with TensorFlow
mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
page_clustering
A simple algorithm for clustering web pages, suitable for crawlers
pyllms
Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.
ReP_AL-3D-Lawn-Mower
Code and Other for the ReP_AL Lawn Mower
Robotcorder
A chrome extension that generates test scripts
scrapely
A pure-python HTML screen-scraping library
seirsplus
Models of SEIRS epidemic dynamics with extensions, including network-structured populations, testing, contact tracing, and social distancing.
social-analyzer
API and Web App for analyzing & finding a person profile across 300+ social media websites (Detections are updated regularly)
sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
webpageclassifier
Categorizes a website given URL into one of blog|wiki|news|forum|classified|shopping|undecided.
webstruct
Learning the structure of the web