yashkumaratri

Yash Kumar Atri's starred repositories

EasyEdit

[知识编辑] [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookMIT164200

geomstats

Computations and statistics on manifolds with geometric structures.

Language:Jupyter NotebookMIT118300

mslr-shared-task

Multidocument Summarization for Literature Review Shared Task 2022

Language:Jupyter NotebookApache-2.02500

This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.

Language:Jupyter Notebook2300

summac

Codebase, data and models for the SummaC paper in TACL

Language:Jupyter NotebookApache-2.07600

pubmed_parser

:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset

Language:PythonMIT56600

awesome-fairness-papers

Papers on fairness in NLP

41900

awesome-topological-deep-learning

A curated list of topological deep learning (TDL) resources and links.

18500

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1262800

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT218400

safari

Convolutions for Sequence Modeling

Language:AssemblyApache-2.085500

TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

Language:PythonMIT147500

episodic-transformer-memory-ppo

Clean baseline implementation of PPO using an episodic TransformerXL memory

Language:PythonMIT12800

biassum

Summarization benchmark for studying corpus bias of your system

Language:Python400

transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

Language:Jupyter NotebookApache-2.0123700

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.0743500

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT233400

MLfromscratch

Machine Learning algorithm implementations from scratch.

Language:PythonMIT122800

tensorly-notebooks

Tensor methods in Python with TensorLy

Language:Jupyter Notebook42400

tensorly

TensorLy: Tensor Learning in Python.

Language:PythonNOASSERTION152900

wae-rnf-lm

Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" https://arxiv.org/abs/1904.02399

Language:PythonMIT6200

pyRiemann

Machine learning for multivariate data through the Riemannian geometry of positive definite matrices in Python

Language:PythonBSD-3-Clause61100

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0888100

long-summarization

Resources for the NAACL 2018 paper "A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents"

Language:PythonApache-2.035100

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language:PythonMIT897200

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause138300

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT441300

ODE-Transformer

This is a code repository for the ACL 2022 paper "ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation", which redesigns the Transformer architecture from the ODE perspective via using high-order ODE solvers to enhance the residual connections.

Language:PythonNOASSERTION2800

FES

Language:Python1400