Chris Ha (chris-ha458)

chris-ha458

Geek Repo

Company:Independent research with EleutherAI, DuckAI

Location:Seoul

Twitter:@meditech57

Github PK Tool:Github PK Tool

Chris Ha's repositories

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2/V1, MNASNet, Single-Path NAS, FBNet, and more

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:1Issues:0

txlsh

tlsh with pyo3 and soon xxhash

Language:RustLicense:Apache-2.0Stargazers:1Issues:1Issues:0

wyhash-rs

wyhash fast portable non-cryptographic hashing algorithm and random number generator in Rust

Language:RustLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Stargazers:0Issues:0Issues:0

BigLittleNet

Official repository for Big-Little Net

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CMT_CNN-meet-Vision-Transformer

A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

data_tooling

Tools for managing datasets for governance and training.

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

dps

Data processing system for polyglot

Stargazers:0Issues:0Issues:0

fast-counter

Faster concurrent atomic number updates

Language:RustStargazers:0Issues:0Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

open-lid-dataset

Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., upcoming)

License:GPL-3.0Stargazers:0Issues:0Issues:0

oscar-tools

The original tooling for the OSCAR corpus rewritten in Rust

License:Apache-2.0Stargazers:0Issues:0Issues:0

oscar-website

The website of the Oscar Project

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

License:MITStargazers:0Issues:0Issues:0

peS2o

Pretraining Efficiently on S2ORC!

License:Apache-2.0Stargazers:0Issues:0Issues:0

pii-transform

Perform transformations on PII instances detected in documents

License:Apache-2.0Stargazers:0Issues:0Issues:0

rust-bloom-filter

A fast Bloom filter implementation in Rust

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

rust-github-demo

This is for demoing features of GitHub

License:CC0-1.0Stargazers:0Issues:0Issues:0
Language:RustLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

tlsh

xxh enhanced version of Rust port of TLSH

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ungoliant

:spider: The pipeline for the OSCAR corpus

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VoCapXLM

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

Stargazers:0Issues:0Issues:0

warc-specifications

Centralised repository for WARC usage specifications.

Stargazers:0Issues:0Issues:0

xxhash-c-sys

Rust raw bindings to xxHash

License:BSL-1.0Stargazers:0Issues:0Issues:0