Bojan (BojanFaletic)

BojanFaletic

Geek Repo

Company:searching for projects

Github PK Tool:Github PK Tool

Bojan's starred repositories

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:369Issues:0Issues:0

tpu-starter

Everything you want to know about Google Cloud TPU

Language:PythonLicense:CC-BY-4.0Stargazers:476Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7448Issues:0Issues:0

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:PythonStargazers:9129Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5850Issues:0Issues:0

Sophia

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

Language:PythonLicense:Apache-2.0Stargazers:374Issues:0Issues:0

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Stargazers:1182Issues:0Issues:0

StableLM

StableLM: Stability AI Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:15850Issues:0Issues:0

lollms-webui

Lord of Large Language Models Web User Interface

Language:VueLicense:Apache-2.0Stargazers:4151Issues:0Issues:0

kinda-llama

An open-source replication and extension of the Meta AI's LLAMA dataset

Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9097Issues:0Issues:0

WebChatRWKVstic

ChatGPT-like Web UI for RWKVstic

Language:PythonStargazers:100Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:26071Issues:0Issues:0

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Language:PythonLicense:Apache-2.0Stargazers:3724Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2923Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19311Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36881Issues:0Issues:0

Enzyme

High-performance automatic differentiation of LLVM and MLIR.

Language:LLVMLicense:NOASSERTIONStargazers:1215Issues:0Issues:0

pytorch_forward_forward

Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation

Language:PythonLicense:MITStargazers:1429Issues:0Issues:0

PatchTST

An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730

Language:PythonLicense:Apache-2.0Stargazers:1440Issues:0Issues:0

edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Language:PythonLicense:NOASSERTIONStargazers:1235Issues:0Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1501Issues:0Issues:0

irem_code_release

ICML 2022: Learning Iterative Reasoning through Energy Minimization

Language:PythonLicense:MITStargazers:41Issues:0Issues:0

Mega-pytorch

Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena

Language:PythonLicense:MITStargazers:203Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65585Issues:0Issues:0

simplerecon

[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions

Language:PythonLicense:NOASSERTIONStargazers:1284Issues:0Issues:0

sygil-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:7854Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24427Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:540Issues:0Issues:0

CodeRL

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Language:PythonLicense:BSD-3-ClauseStargazers:488Issues:0Issues:0