Doohae Jung (wavy.ocean) (Doohae)

Doohae

Geek Repo

Company:@kakaobrain

Location:Seoul, Korea

Github PK Tool:Github PK Tool

Doohae Jung (wavy.ocean)'s starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49217Issues:561Issues:202

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24867Issues:208Issues:210

Bend

A massively parallel, high-level programming language

Language:RustLicense:Apache-2.0Stargazers:16930Issues:91Issues:203

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:14715Issues:147Issues:634

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonLicense:MITStargazers:11482Issues:72Issues:260

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11036Issues:163Issues:224

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10939Issues:102Issues:198

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10917Issues:88Issues:300

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9756Issues:84Issues:247

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:3414Issues:36Issues:309

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2776Issues:28Issues:267

scancode-toolkit

:mag: ScanCode detects licenses, copyrights, dependencies by "scanning code" ... to discover and inventory open source and third-party packages used in your code. Sponsored by NLnet project https://nlnet.nl/project/vulnerabilitydatabase, the Google Summer of Code, Azure credits, nexB and others generous sponsors!

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1911Issues:19Issues:77

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1826Issues:16Issues:149

starcoder2

Home of StarCoder2!

Language:PythonLicense:Apache-2.0Stargazers:1637Issues:17Issues:17

SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Language:PythonLicense:MITStargazers:1536Issues:22Issues:115

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1287Issues:17Issues:48

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:1204Issues:14Issues:355

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1122Issues:40Issues:11

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1093Issues:33Issues:348

resource-stream

CUDA related news and material links

dolma

Data and tools for generating and inspecting OLMo pre-training data.

Language:PythonLicense:Apache-2.0Stargazers:859Issues:18Issues:67

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonLicense:MITStargazers:729Issues:13Issues:24

NeMo-Curator

Scalable toolkit for data curation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:369Issues:16Issues:65

Local-Code-Interpreter

A local implementation of OpenAI's ChatGPT Code Interpreter.

Language:PythonLicense:Apache-2.0Stargazers:270Issues:4Issues:30

gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Language:PythonLicense:MITStargazers:141Issues:4Issues:22

LogicKor

한국어 언어모델 다분야 사고력 벤치마크

the-stack-v2

Code for the curation of The Stack v2 and StarCoder2 training data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:79Issues:5Issues:5