Beast code in Giters

AlexHT Hung's repositories

BLOOM-LORA

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Language:Jupyter NotebookApache-2.0000

chineseocr_lite

超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

Language:C++GPL-2.0000

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Language:C++BSD-3-Clause000

common

Common source, scripts and utilities shared across all Triton repositories.

Language:C++BSD-3-Clause000

data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

Language:Jupyter NotebookApache-2.0000

data_tooling

Tools for managing datasets for governance and training.

Apache-2.0000

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonApache-2.0000

DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

DifferentiableBinarization

DB (Real-time Scene Text Detection with Differentiable Binarization) implementation in Keras and Tensorflow

Language:Python000

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Apache-2.0000

flash-attention

Fast and memory-efficient exact attention

BSD-3-Clause000

GMAN

GMAN: A Graph Multi-Attention Network for Traffic Prediction (GMAN, https://fanxlxmu.github.io/publication/aaai2020/) was accepted by AAAI-2020.

Language:PythonApache-2.0000

googlesearch

A Python library for scraping the Google search engine.

Language:PythonMIT000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

minio-cpp

MinIO C++ Client SDK for Amazon S3 Compatible Cloud Storage

Apache-2.0000

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonApache-2.0000

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonApache-2.0000

alex-ht

AlexHT Hung's repositories

alex-ht

BLOOM-LORA

chineseocr_lite

client

common

data-preparation

data_tooling

datasets

DB

DeepSpeed

DeepSpeedExamples

DifferentiableBinarization

evaluate

flash-attention

GMAN

googlesearch

k2chain

langchain

Megatron-DeepSpeed

minio-cpp

mistral-common

nemo_cp_debug

olm-datasets

PaddleOCR

sentence-transformers

sgpt

t-zero

TransformerEngine

vllm

yolov7