Ken aka Frosty (KenAKAFrosty)

KenAKAFrosty

Geek Repo

Location:Tampa, FL

Twitter:@KenAKAFrosty

Github PK Tool:Github PK Tool

Ken aka Frosty's repositories

Language:TypeScriptStargazers:1Issues:0Issues:0

awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.

Stargazers:0Issues:0Issues:0

axolotl

Go ahead and axolotl questions

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

License:NOASSERTIONStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

License:Apache-2.0Stargazers:0Issues:0Issues:0

gateway

A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Stargazers:0Issues:0Issues:0

llama-tokenizer-js

JS tokenizer for LLaMA

License:MITStargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM-Shearing

Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

License:MITStargazers:0Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

License:LGPL-3.0Stargazers:0Issues:0Issues:0

meta-llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

License:MITStargazers:0Issues:0Issues:0

onnx-model-zoo

A collection of pre-trained, state-of-the-art models in the ONNX format

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

License:MITStargazers:0Issues:0Issues:0

Orion

Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0
Language:TypeScriptStargazers:0Issues:0Issues:0

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

License:Apache-2.0Stargazers:0Issues:0Issues:0

sparsify

ML model optimization product to accelerate inference.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

License:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

unredacter

Never ever ever use pixelation as a redaction technique

License:GPL-3.0Stargazers:0Issues:0Issues:0

unsloth

5X faster 60% less memory QLoRA finetuning

License:Apache-2.0Stargazers:0Issues:0Issues:0