Xiang Jiang (xiangjjj)

xiangjjj

Geek Repo

Company:Amazon.com

Github PK Tool:Github PK Tool

Xiang Jiang's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4386Issues:0Issues:0

full

Evaluation of open-domain dialog using Follow-Ups Log-Likelihood (FULL) https://aclanthology.org/2022.coling-1.40/

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8721Issues:0Issues:0

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3840Issues:0Issues:0

DL-Hard

Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.

Stargazers:35Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18392Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2127Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:362Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10800Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36836Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:163735Issues:0Issues:0

DialoGPT

Large-scale pretraining for dialogue

Language:PythonLicense:MITStargazers:2335Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1729Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54192Issues:0Issues:0
License:Apache-2.0Stargazers:96Issues:0Issues:0

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9072Issues:0Issues:0

deep-learning-containers

AWS Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in TensorFlow, TensorFlow 2, PyTorch, and MXNet.

Language:PythonLicense:NOASSERTIONStargazers:961Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38322Issues:0Issues:0

cdx_toolkit

A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine

Language:PythonLicense:Apache-2.0Stargazers:156Issues:0Issues:0

huggingface_hub

The official Python client for the Huggingface Hub.

Language:PythonLicense:Apache-2.0Stargazers:1833Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1500Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9338Issues:0Issues:0

ReFinED

ReFinED is an efficient and accurate entity linking (EL) system.

Language:PythonLicense:NOASSERTIONStargazers:175Issues:0Issues:0
Language:PythonLicense:MITStargazers:1430Issues:0Issues:0

LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

cc-crawl-statistics

Statistics of Common Crawl monthly archives mined from URL index files

Language:PythonLicense:Apache-2.0Stargazers:127Issues:0Issues:0

examples

📝 Examples of how to use Neptune for different use cases and with various MLOps tools

Language:Jupyter NotebookLicense:MITStargazers:73Issues:0Issues:0

news-please

news-please - an integrated web crawler and information extractor for news that just works

Language:PythonLicense:Apache-2.0Stargazers:1984Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29805Issues:0Issues:0