Andriy Mulyar's repositories
semantic-text-similarity
an easy-to-use interface to fine-tuned BERT models for computing semantic similarity in clinical and web text. that's it.
bert_document_classification
architectures and pre-trained models for long document classification.
sklearn-oblique-tree
a python interface to OC1 and other oblique decision tree implementations
reddit_visualizer
visualize reddit threads with machine learning and nlp
2020-603-Project-Mulyar
Benchmarking of Transformer Multihead Attention Kernel CUDA Implementations
huggingface.js
Utilities to use the Hugging Face Hub API
clus-predictive-clustering-docker
an example Dockerfile to build a reproducible environment for experiments in the Clus predictive clustering tree framework.
graphbrain
graphbrain
NeuralConceptLinking
linking mentions in text to structured ontologies
2020-603-A2-Mulyar
cuda knn
alpaca-lora
Finetuning InstructLLaMA on consumer hardware
bayesian_finetuning
Code repository for the course project in Andrew Wilson's Fall 2021 Bayesian Machine Learning
blog
Public repo for HF blog posts
dspy
DSPy: The framework for programming—not prompting—foundation models
fast-transformers
Pytorch library for fast transformer implementations
graph_theory_conjecturing
automatic graph theory conjecturing code for use with conjecturing. requires a sage math local install (best of luck)
Presentations
a collection of past talks and presentations
PyLearningBenchmarks
A python package for pre-processing canon machine learning benchmarks from UCI and KEEL.
recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
scikit-learn
scikit-learn: machine learning in Python
sklearn-hellinger
contains hellinger distance directly implemented into sklearn
sklearn-reproducibility-docker
a docker container template to reproduce experiments run utilizing sci-kit learn
vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
yellowbrick
Visual analysis and diagnostic tools to facilitate machine learning model selection.