StephennFernandes

Stephen Fernandes's starred repositories

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT25437 269 680

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION10922 88 300

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.010311 107 18

Red-Teaming-Toolkit

This repository contains cutting-edge open-source security tools (OST) for a red teamer and threat hunter.

GPL-3.08763 423 15

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.06908 82 527

rags

Build ChatGPT over your data, all with natural language

Language:PythonMIT6123 55 38

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.05626 78 142

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonMIT4444 121 54

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT3098 57 663

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonApache-2.01388 26 74

functorch

functorch is JAX-like composable function transforms for PyTorch.

Language:Jupyter NotebookBSD-3-Clause1381 28 520

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonApache-2.0651 12 29

fairseq2

FAIR Sequence Modeling Toolkit 2

Language:PythonMIT638 18 97

tpu-starter

Everything you want to know about Google Cloud TPU

Language:PythonCC-BY-4.0476 8 3

LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Language:PythonMIT458 22 7

pytorch_memonger

Experimental ground for optimizing memory of pytorch models

Language:PythonGPL-3.0353 11 10

llm-seminar

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

307 180

Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Language:PythonApache-2.0241 15 4

electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Language:PythonMIT221 9 11

git-theta

git extension for {collaborative, communal, continual} model development

Language:PythonApache-2.0198 8 135

IndicTrans2

Translation models for 22 scheduled languages of India

Language:PythonMIT197 9 77

FEP_Active_Inference_Papers

A repository for major/influential FEP and active inference papers.

Language:TeXMIT170 23 1

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

Language:RustMIT123 6 14

flacuna

Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.

Language:Python109 3 4