KV Aditya Srivatsa's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:38041Issues:999Issues:1143

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32708Issues:204Issues:5029

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30345Issues:426Issues:4194

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonLicense:NOASSERTIONStargazers:13883Issues:203Issues:2324

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13151Issues:325Issues:321

llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:12153Issues:174Issues:359

lollms-webui

Lord of Large Language Models Web User Interface

Language:VueLicense:Apache-2.0Stargazers:4296Issues:64Issues:283

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

Language:PythonLicense:Apache-2.0Stargazers:1670Issues:30Issues:171

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1283Issues:21Issues:77

SMAC3

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Language:PythonLicense:NOASSERTIONStargazers:1076Issues:42Issues:540

NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonLicense:MITStargazers:775Issues:23Issues:52

simpleT5

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

Language:PythonLicense:MITStargazers:386Issues:7Issues:49

supervenn

supervenn: precise and easy-to-read multiple sets visualization in Python

Language:PythonLicense:MITStargazers:314Issues:10Issues:32

style-transfer-paraphrase

Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (https://arxiv.org/abs/2010.05700).

Language:HTMLLicense:MITStargazers:228Issues:11Issues:36

GSMN

Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching

DeepCAVE

An interactive framework to visualize and analyze your AutoML process in real-time.

Language:PythonLicense:Apache-2.0Stargazers:70Issues:7Issues:108

text-preprocessing

A python package for text preprocessing task in natural language processing.

Language:PythonLicense:BSD-3-ClauseStargazers:63Issues:1Issues:9

TheNumericsOfGANs

This repository contains the code to reproduce the core results from the paper "The Numerics of GANs".

Language:PythonLicense:MITStargazers:46Issues:5Issues:2

bridge

NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"

Language:PythonLicense:MITStargazers:25Issues:3Issues:0

devanagari-to-roman-script-transliteration

Python scipt to convert <text written in devnagri script> TO <text in roman/english script>

Language:PythonLicense:GPL-3.0Stargazers:20Issues:1Issues:2

E2E-dialo-disentanglement

Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"

language_modeling_lstm

Language modeling based on Penn Treebank (RNN/LSTM, Pytorch)

Language:PythonStargazers:14Issues:2Issues:0

StructureCharacterization4DD

https://openreview.net/forum?id=OC1o4_OI6Jw

SemEval2022-Task-5-Multimedia-Automatic-Misogyny-Identification-MAMI-

SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset

Language:PythonStargazers:10Issues:3Issues:0
Language:PythonStargazers:10Issues:1Issues:0

BiSECT

Data and code for BiSECT project.

cache_em_all

A simple decorator to cache the results of function calls

Language:PythonLicense:GPL-3.0Stargazers:8Issues:0Issues:0

RL-Sepsis-Prediction

Final project for Introduction to Reinforcement Learning for MSDS at University of San Francisco

Language:Jupyter NotebookStargazers:8Issues:0Issues:0