StephennFernandes

Stephen Fernandes's repositories

tokneizer_training_sentencepiece

Language:Python1 10

ULCA-asr-dataset-corpus

CC-BY-4.0100

t5x_cuda

A working t5x repo thats executable on nvidia GPUs. compatible to pretrain models on 2 A6000 #note: designed for personal usage, use on your own caution

01 1

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonApache-2.0000

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

MIT000

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonApache-2.0000

deepspeed-test

Testing the DeepSpeed integration in Trainer and Accelerate

Language:Python000

dora-from-scratch

LoRA and DoRA from Scratch Implementations

Language:Jupyter NotebookMIT000

FastAPI-template

Feature rich robust FastAPI template.

Language:PythonMIT000

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonApache-2.0000

instruction-datasets

All available datasets for Instruction Tuning of Large Language Models

000

longt5-eval

Language:Python000

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonApache-2.0000

min-decision-transformer

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Language:PythonMIT000

Modified-DETR

The PyTorch re-implement of the official DETR.

Language:Python000

NanoPeft

The simplest repository & Neat implementation of different Lora methods for training/fine-tuning Transformer-based models (i.e., BERT, GPTs). [ Research purpose ]

Language:Jupyter Notebook000

open-instruct

Language:PythonApache-2.0000

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

MIT000

phi-1

Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation

Language:PythonMIT000

Promptify

Prompt Engineering | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:PythonApache-2.0000