KenAKAFrosty

followers

following

stars

Tampa, FL

Ken aka Frosty's repositories

example-movies

Language:TypeScript100

awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.

000

axolotl

Go ahead and axolotl questions

Apache-2.0000

bun-simple-server

Language:TypeScript000

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

NOASSERTION000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Apache-2.0000

gateway

A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.

MIT000

KenAKAFrosty

000

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

000

llama-tokenizer-js

JS tokenizer for LLaMA

MIT000

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Apache-2.0000

LLM-Shearing

Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

MIT000

mergekit

Tools for merging pretrained large language models.

LGPL-3.0000

meta-llama

Inference code for LLaMA models

Language:PythonNOASSERTION000

MNIST

Language:PythonMIT000

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

MIT000

onnx-model-zoo

A collection of pre-trained, state-of-the-art models in the ONNX format

Apache-2.0000

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

MIT000

Orion

Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型，包括对话模型，长文本模型，量化模型，RAG微调模型，Agent微调模型等。

Apache-2.0000

ppsnexttest

Language:TypeScriptMIT000

qwik-auth-question

Language:TypeScript000

qwik-static-plus-live

Language:TypeScript000

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Apache-2.0000

sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Apache-2.0000

sparsify

ML model optimization product to accelerate inference.

Apache-2.0000

ssr-benchmark

000

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

MIT000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

unredacter

Never ever ever use pixelation as a redaction technique

GPL-3.0000

unsloth

5X faster 60% less memory QLoRA finetuning

Apache-2.0000