Suzie Oh (ohsuz)

ohsuz

Geek Repo

Home Page:ohsuz.dev

Github PK Tool:Github PK Tool


Organizations
bcaitech1
DSBA-Lab
Fashion-Reader
HAE-RAE
MINIONS-KR
TEAM-IKYO
team-vvave
wisdomify

Suzie Oh's starred repositories

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:334Issues:0Issues:0

Sensei

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Language:PythonStargazers:219Issues:0Issues:0

SeaLLMs

[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

Language:JavaScriptStargazers:139Issues:0Issues:0

synthesizer

A multi-purpose LLM framework for RAG and data creation.

Language:PythonLicense:Apache-2.0Stargazers:612Issues:0Issues:0

quickvid

Summarize, Verify & Chat with any YouTube video in seconds.

Language:TypeScriptLicense:GPL-3.0Stargazers:157Issues:0Issues:0

Skywork-MoE

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Stargazers:120Issues:0Issues:0

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:768Issues:0Issues:0

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language:PythonLicense:Apache-2.0Stargazers:1223Issues:0Issues:0

awesome-japanese-llm

日本語LLMまとめ - Overview of Japanese LLMs

Language:TypeScriptLicense:Apache-2.0Stargazers:914Issues:0Issues:0

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:4085Issues:0Issues:0

courses

Anthropic's educational courses

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1008Issues:0Issues:0

frontend

구름톤 8기 대상 '마을엔'

Language:TypeScriptStargazers:3Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:396Issues:0Issues:0

CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:580Issues:0Issues:0

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:12412Issues:0Issues:0

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets (or classifiers)!

Language:PythonLicense:MITStargazers:745Issues:0Issues:0

llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:186Issues:0Issues:0

Phi-3CookBook

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.

Language:Jupyter NotebookLicense:MITStargazers:1579Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:470Issues:0Issues:0

Vodalus-Expert-LLM-Forge

Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation editor Gradio UI.

Language:Jupyter NotebookStargazers:144Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:9161Issues:0Issues:0

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonLicense:MITStargazers:9870Issues:0Issues:0

AutoCrawler

Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"

Language:PythonLicense:Apache-2.0Stargazers:389Issues:0Issues:0

dsRAG

High-performance retrieval engine for unstructured data

Language:PythonLicense:MITStargazers:709Issues:0Issues:0

muse

Let's create synthetic textbooks together :)

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

nlp-datasets

Curation note of NLP datasets

Stargazers:91Issues:0Issues:0