BojanFaletic

followers

following

stars

searching for projects

Bojan's starred repositories

AgentLite

Language:Jupyter NotebookApache-2.036900

tpu-starter

Everything you want to know about Google Cloud TPU

Language:PythonCC-BY-4.047600

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0744800

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python912900

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT585000

Sophia

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

Language:PythonApache-2.037400

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookApache-2.01585000

lollms-webui

Lord of Large Language Models Web User Interface

Language:VueApache-2.0415100

kinda-llama

An open-source replication and extension of the Meta AI's LLAMA dataset

Language:PythonApache-2.02400

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.0909700

WebChatRWKVstic

ChatGPT-like Web UI for RWKVstic

Language:Python10000

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION2607100

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonApache-2.0372400

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonMIT292300

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT1931100

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonApache-2.03688100

Enzyme

High-performance automatic differentiation of LLVM and MLIR.

Language:LLVMNOASSERTION121500

pytorch_forward_forward

Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation

Language:PythonMIT142900

PatchTST

An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730

Language:PythonApache-2.0144000

edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Language:PythonNOASSERTION123500

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.0150100

irem_code_release

ICML 2022: Learning Iterative Reasoning through Energy Minimization

Language:PythonMIT4100

Mega-pytorch

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena

Language:PythonMIT20300

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6558500

simplerecon

[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions

Language:PythonNOASSERTION128400

sygil-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0785400

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02442700

Next-ViT

Language:PythonApache-2.054000

CodeRL

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Language:PythonBSD-3-Clause48800