kvadityasrivatsa

KV Aditya Srivatsa's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.038041 999 1143

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.032708 204 5029

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30345 426 4194

flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Language:PythonNOASSERTION13883 203 2324

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookApache-2.013151 325 321

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION12153 174 359

lollms-webui

Lord of Large Language Models Web User Interface

Language:VueApache-2.04296 64 283

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

Language:CMIT1805 19 201

graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

Language:PythonApache-2.01670 30 171

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Language:Jupyter NotebookApache-2.01283 21 77

SMAC3

SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization

Language:PythonNOASSERTION1076 42 540

NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonMIT775 23 52

simpleT5

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.

Language:PythonMIT386 7 49

supervenn

supervenn: precise and easy-to-read multiple sets visualization in Python

Language:PythonMIT314 10 32

style-transfer-paraphrase

Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (https://arxiv.org/abs/2010.05700).

Language:HTMLMIT228 11 36

GSMN

Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching

Language:Python164 7 26

DeepCAVE

An interactive framework to visualize and analyze your AutoML process in real-time.

Language:PythonApache-2.070 7 108

text-preprocessing

A python package for text preprocessing task in natural language processing.

Language:PythonBSD-3-Clause63 1 9

TheNumericsOfGANs

This repository contains the code to reproduce the core results from the paper "The Numerics of GANs".

Language:PythonMIT46 5 2

bridge

NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"

Language:PythonMIT25 30

devanagari-to-roman-script-transliteration

Python scipt to convert <text written in devnagri script> TO <text in roman/english script>

Language:PythonGPL-3.020 1 2

E2E-dialo-disentanglement

Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"

Language:Python17 2 5

language_modeling_lstm

Language modeling based on Penn Treebank (RNN/LSTM, Pytorch)

Language:Python14 20

onlinePtrNet_disentanglement

Language:Python13 3 4

StructureCharacterization4DD

https://openreview.net/forum?id=OC1o4_OI6Jw

Language:Python13 1 2

SemEval2022-Task-5-Multimedia-Automatic-Misogyny-Identification-MAMI-

SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset

Language:Python10 30

Disentangle

Language:Python10 10

BiSECT

Data and code for BiSECT project.

Language:Python9 1 3

cache_em_all

A simple decorator to cache the results of function calls

Language:PythonGPL-3.0800

RL-Sepsis-Prediction

Final project for Introduction to Reinforcement Learning for MSDS at University of San Francisco

Language:Jupyter Notebook800