Vinayak Arannil's starred repositories

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:28517Issues:213Issues:519

txtai

đź’ˇ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:7234Issues:79Issues:696

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5691Issues:49Issues:139

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookLicense:MITStargazers:4332Issues:41Issues:221

data-science-on-aws

AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3308Issues:120Issues:222

presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text and images

Language:PythonLicense:MITStargazers:3231Issues:64Issues:386

machine-learning

:earth_americas: machine learning tutorials (mainly in Python3)

Language:HTMLLicense:MITStargazers:3132Issues:129Issues:6

NAB

The Numenta Anomaly Benchmark

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:1896Issues:117Issues:179

LLM-Finetuning

LLM Finetuning with peft

Language:Jupyter NotebookStargazers:1708Issues:25Issues:3

Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Language:Jupyter NotebookLicense:MITStargazers:1693Issues:21Issues:61

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1468Issues:20Issues:0

OpenOOD

Benchmarking Generalized Out-of-Distribution Detection

Language:PythonLicense:MITStargazers:778Issues:8Issues:100

jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Language:PythonLicense:Apache-2.0Stargazers:555Issues:15Issues:43

scrubadub

Clean personally identifiable information from dirty dirty text.

Language:PythonLicense:Apache-2.0Stargazers:388Issues:11Issues:63

Glow-PyTorch

Simple, extendable, easy to understand Glow implementation in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:367Issues:3Issues:0

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonLicense:MITStargazers:354Issues:10Issues:34

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

Language:PythonLicense:MITStargazers:272Issues:18Issues:25

media-insights-on-aws

A serverless framework to accelerate the development of applications that discover next-generation insights in your video, audio, text, and image resources by utilizing AWS Machine Learning and Media services.

Language:PythonLicense:Apache-2.0Stargazers:244Issues:35Issues:383

VL_adapter

PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)

Language:PythonLicense:MITStargazers:197Issues:6Issues:17

pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

Language:PythonLicense:MITStargazers:167Issues:6Issues:8

sagemaker-studio-custom-image-samples

This repository contains examples of Docker images that can be used as custom images for KernelGateway Apps in SageMaker Studio

eraserbenchmark

A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/

Language:PythonLicense:Apache-2.0Stargazers:97Issues:10Issues:9

content-analysis-on-aws

As of August 30, 2023, this AWS Solution is no longer available. Existing deployments will continue to run. The functionality provided by Content Analysis on AWS will be superseded with functionality in Media2Cloud on AWS and Content Localization on AWS. We encourage you to explore these solutions.

Language:VueLicense:Apache-2.0Stargazers:95Issues:26Issues:50

Named-Entity-Recognition

Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.

Language:PythonLicense:GPL-3.0Stargazers:44Issues:5Issues:5

text-anonymization-benchmark

Annotated corpus + evaluation metrics for text anonymisation

Language:PythonLicense:MITStargazers:43Issues:6Issues:3

united-nations-ner

Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.

Language:Jupyter NotebookStargazers:31Issues:3Issues:0

indic-wx-converter

Python library for converting UTF to WX and vice-versa for Indian languages.

Language:PythonLicense:MITStargazers:11Issues:0Issues:0
Language:PythonLicense:MIT-0Stargazers:9Issues:0Issues:0

Text_Detection_Synthetic_DataGenerator

Synthetic data generator for text detection/localisation tasks.

Language:PythonStargazers:8Issues:1Issues:0