SJYang (wns823)

wns823

Geek Repo

Company:KRAFTON Inc.

Location:Seoul, Republic of Korea

Github PK Tool:Github PK Tool

SJYang's starred repositories

llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

Language:PythonLicense:Apache-2.0Stargazers:402Issues:0Issues:0

ADAS

Automated Design of Agentic Systems

Language:PythonLicense:Apache-2.0Stargazers:919Issues:0Issues:0

llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Language:PythonLicense:Apache-2.0Stargazers:515Issues:0Issues:0
Language:PythonStargazers:14Issues:0Issues:0

LLM-QAT

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Language:PythonLicense:NOASSERTIONStargazers:242Issues:0Issues:0

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Language:PythonLicense:BSD-3-ClauseStargazers:3241Issues:0Issues:0

hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Language:PythonLicense:Apache-2.0Stargazers:670Issues:0Issues:0

qllm-eval

Code Repository of Evaluating Quantized Large Language Models

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

llama-stack

Model components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3061Issues:0Issues:0

llama-stack-apps

Agentic components of the Llama Stack APIs

Language:PythonLicense:MITStargazers:3635Issues:0Issues:0

llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Language:PythonStargazers:697Issues:0Issues:0

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Stargazers:1826Issues:0Issues:0

kompute

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

Language:C++License:Apache-2.0Stargazers:1965Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3832Issues:0Issues:0

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Language:PythonStargazers:806Issues:0Issues:0

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:1116Issues:0Issues:0

redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:13765Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:886Issues:0Issues:0
Language:PythonStargazers:166Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16385Issues:0Issues:0

mambaformer-icl

MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0

OSWorld

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language:PythonLicense:Apache-2.0Stargazers:1137Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Language:PythonLicense:MITStargazers:13395Issues:0Issues:0

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1015Issues:0Issues:0

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1149Issues:0Issues:0

modules

🧩 Official registry of Rivet Modules.

Language:TypeScriptLicense:Apache-2.0Stargazers:110Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21763Issues:0Issues:0
Language:PythonLicense:MITStargazers:4017Issues:0Issues:0