honglu2875

Honglu Fan's repositories

hironaka

A utility package for Hironaka game of local resolution of singularities

Language:PythonMIT5 2 14

thing

Catch your tensors in one program and quietly send to another live python session.

Language:PythonApache-2.04 10

mistral_jax

Mistral model in JAX

Language:PythonApache-2.02 10

_diff_model

documenting scripts and workflows for diff model training

Language:Python1 10

fmlang_env

Toy gym env related to formal languages.

Language:PythonMIT1 10

hironaka-experiments

Document the experiments of hironaka project

Language:PythonMIT010

_ft_dockerfile

Language:Python010

Algorithm-Distillation-RLHF

Language:PythonMIT000

ArchitextRL

Language:PythonMIT000

aria

Language:PythonApache-2.0000

capabilities

Blazon Capabilities SDK

Language:PythonApache-2.0000

composer

Train neural networks up to 7x faster

Language:PythonApache-2.0000

deep_cfg

Language:Python010

examples

Fast and flexible reference benchmarks

Language:PythonApache-2.0000

go

Implement RL (MCTS) on Go.

Language:C++NOASSERTION020

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0000

hironaka_v2

This is a clean redo using only JAX and we reconstruct a simpler design.

MIT010

honglu2875

Config files for my GitHub profile.

010

honkhonk

honk honk honk honk, honk honk!

MIT010

honx

honx honx!

Language:Python010

jag

Just Another deep learninG framework

Language:PythonMIT010

jaxformer

Minimal library to train LLMs on TPU in JAX with pjit().

Language:PythonBSD-3-Clause000

llama

Inference code for LLaMA models

Language:PythonGPL-3.0000

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CMIT000

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonMIT000

slightly-faster-gpt

A slightly faster GPT-J than Huggingface

MIT010

test_github_action

Language:Python01 1

tweets

janky twitter replacement

MIT000

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT000

yarn-patch

This repo exposes simple APIs to patch the YaRN technique to the rotary embeddings of a given Hugging Face model.

MIT010