ohsuz

Suzie Oh's starred repositories

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptMIT12411 93 211

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION11357 93 310

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonMIT9867 69 76

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python9162 111 189

outlines

Structured Text Generation

Language:PythonApache-2.07936 47 533

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT6506 39 932

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01858 44 107

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookMIT1578 12 47

clean-text

🧹 Python package for text cleaning

Language:PythonNOASSERTION941 14 29

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!

Language:PythonMIT743 17 32

MergeLM

Codebase for Merging Language Models (ICML 2024)

Language:Python723 7 32

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.0579 9 39

awesome-generative-information-retrieval

577 22 7

spRAG

Retrieval engine for unstructured data

Language:PythonMIT518 6 8

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT470 8 6

kss

KSS: Korean String processing Suite

Language:PythonBSD-3-Clause403 4 57

cosmopedia

Language:PythonApache-2.0395 12 10

AutoCrawler

Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"

Language:PythonApache-2.0388 10 6

Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

MIT320 260

llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

190 50

llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.

Language:Jupyter NotebookApache-2.0177 5 7

ohsuz

Suzie Oh's starred repositories

Perplexica

llama-recipes

storm

WizardLM

outlines

FlagEmbedding

datatrove

Phi-3CookBook

clean-text

augmentoolkit

MergeLM

EasyContext

awesome-generative-information-retrieval

spRAG

textbook_quality

kss

cosmopedia

AutoCrawler

Autonomous-Agents

llm-continual-learning-survey

llamaduo

PruneMe

CALM-pytorch

Vodalus-Expert-LLM-Forge

library-of-phi

nlp-datasets

muse

Ko-Fine-tuning_DataGen

AVeriTeC

KtrlF