Vipula Dissanayake's repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
pubtator_loader
A Python 🐍 package to load PubTator Documents 🧾, tokenize and convert them to BILUO Format.
amazon-bedrock-workshop
This is a workshop designed for Amazon Bedrock a foundational model service.
course
The Hugging Face course on Transformers
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
flink
Apache Flink
haystack
:mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more.
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
lightning
Build high-performance AI models with PyTorch Lightning (organized PyTorch). Deploy models with Lightning Apps (organized Python to build end-to-end ML systems).
lit-gpt
Implementation of Falcon, StableLM, Pythia, INCITE language models based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
mlflow
Open source platform for the machine learning lifecycle
mojo
The Mojo Programming Language
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
openvino
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
pinot
Apache Pinot - A realtime distributed OLAP datastore
quantulum3
Library for unit extraction - fork of quantulum for python3
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
ReFinED
ReFinED is an efficient and accurate entity linking (EL) system.
rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
snk
🟩⬜ Generates a snake game from a github user contributions graph and output a screen capture as animated svg or gif
spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vipulasd.github.io
My personal website built with Jekyll