Stephen Fernandes (StephennFernandes)

StephennFernandes

Geek Repo

Location:Goa , INDIA

Home Page:www.stephenfernandes.com

Github PK Tool:Github PK Tool

Stephen Fernandes's starred repositories

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:25437Issues:269Issues:680

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10922Issues:88Issues:300

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10311Issues:107Issues:18

Red-Teaming-Toolkit

This repository contains cutting-edge open-source security tools (OST) for a red teamer and threat hunter.

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:6908Issues:82Issues:527

rags

Build ChatGPT over your data, all with natural language

Language:PythonLicense:MITStargazers:6123Issues:55Issues:38

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5626Issues:78Issues:142

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4444Issues:121Issues:54

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3098Issues:57Issues:663

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1388Issues:26Issues:74

functorch

functorch is JAX-like composable function transforms for PyTorch.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1381Issues:28Issues:520

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:651Issues:12Issues:29

fairseq2

FAIR Sequence Modeling Toolkit 2

Language:PythonLicense:MITStargazers:638Issues:18Issues:97

tpu-starter

Everything you want to know about Google Cloud TPU

Language:PythonLicense:CC-BY-4.0Stargazers:476Issues:8Issues:3

LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Language:PythonLicense:MITStargazers:458Issues:22Issues:7

pytorch_memonger

Experimental ground for optimizing memory of pytorch models

Language:PythonLicense:GPL-3.0Stargazers:353Issues:11Issues:10

llm-seminar

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:241Issues:15Issues:4

electra-pytorch

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Language:PythonLicense:MITStargazers:221Issues:9Issues:11

git-theta

git extension for {collaborative, communal, continual} model development

Language:PythonLicense:Apache-2.0Stargazers:198Issues:8Issues:135

IndicTrans2

Translation models for 22 scheduled languages of India

Language:PythonLicense:MITStargazers:197Issues:9Issues:77

FEP_Active_Inference_Papers

A repository for major/influential FEP and active inference papers.

Language:TeXLicense:MITStargazers:170Issues:23Issues:1

openai_trtllm

OpenAI compatible API for TensorRT LLM triton backend

Language:RustLicense:MITStargazers:123Issues:6Issues:14

flacuna

Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.

Distributed-Multi-Video-Streaming-and-Processing-with-Kafka

Stream and process multiple videos in near real time using Kafka. The video frames are processed and a machine learning model does inference on them and the results are stored in a mongodb database.

PyTorch-Elmo-BiLSTMCRF

PyTorch BiLSTMCRF w Elmo

Language:PythonLicense:MITStargazers:54Issues:2Issues:2

unify-learning-paradigms

data collator for UL2 and U-PaLM

swissbert

The multilingual language model for Switzerland

Language:Jupyter NotebookLicense:MITStargazers:25Issues:1Issues:1

openhathi_instruct

This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and Hinglish.

Language:Jupyter NotebookStargazers:23Issues:4Issues:4

slam_with_vit

Visual SLAM for Mobile Robots with Vision Transformer(ViT)

Language:PythonLicense:MITStargazers:12Issues:2Issues:1