Stephen Fernandes (StephennFernandes)

StephennFernandes

Geek Repo

Location:Goa , INDIA

Home Page:www.stephenfernandes.com

Github PK Tool:Github PK Tool

Stephen Fernandes's repositories

License:CC-BY-4.0Stargazers:1Issues:0Issues:0

t5x_cuda

A working t5x repo thats executable on nvidia GPUs. compatible to pretrain models on 2 A6000 #note: designed for personal usage, use on your own caution

Stargazers:0Issues:1Issues:1

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

License:MITStargazers:0Issues:0Issues:0

conformer

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

deepspeed-test

Testing the DeepSpeed integration in Trainer and Accelerate

Language:PythonStargazers:0Issues:0Issues:0

dora-from-scratch

LoRA and DoRA from Scratch Implementations

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

FastAPI-template

Feature rich robust FastAPI template.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

instruction-datasets

All available datasets for Instruction Tuning of Large Language Models

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

min-decision-transformer

Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Modified-DETR

The PyTorch re-implement of the official DETR.

Language:PythonStargazers:0Issues:0Issues:0

NanoPeft

The simplest repository & Neat implementation of different Lora methods for training/fine-tuning Transformer-based models (i.e., BERT, GPTs). [ Research purpose ]

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

License:MITStargazers:0Issues:0Issues:0

phi-1

Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Promptify

Prompt Engineering | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

python-bpe

Byte Pair Encoding for Python!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PyTorch-Elmo-BiLSTMCRF

PyTorch BiLSTMCRF w Elmo

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RealtimeTTS

Converts text to speech in realtime

Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

unify-learning-paradigms

data collator for UL2 and U-PaLM, suprising no one wrote or tried this open source

Language:PythonStargazers:0Issues:0Issues:0