Eric Lam's repositories

awesome-chatgpt-dataset

Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!

Language:PythonLicense:GPL-3.0Stargazers:674Issues:12Issues:3

TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Language:PythonLicense:MITStargazers:534Issues:11Issues:23

TFkit

πŸ€–πŸ“‡ handling multiple nlp task in one pipeline

Language:PythonLicense:Apache-2.0Stargazers:56Issues:7Issues:9

SpeechMix

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

aidev

Revolutionize your development workflow with AI-powered code assistance, automating mock tests, suggestions, and unit test generation in a single Python CLI tool.

Language:PythonStargazers:34Issues:3Issues:0

asrp

ASR text preprocessing utility

Language:PythonLicense:Apache-2.0Stargazers:20Issues:4Issues:2

nlp2go

πŸƒ hosting nlp models in one line

Language:CSSLicense:Apache-2.0Stargazers:20Issues:3Issues:0

nlp2

βš™οΈTool for NLP - handle file and text

Language:PythonLicense:GPL-3.0Stargazers:15Issues:4Issues:0

gpu-info-api

πŸ±β€πŸ’» GPU Info API is an API that provides detailed information about Nvidia, AMD, and Intel GPUs. The information is extracted from Wikipedia and stored in JSON format.

Language:PythonStargazers:8Issues:2Issues:0

t5lephone

phoneme byt5

Language:PythonStargazers:7Issues:3Issues:0

llm-estimator

Effortlessly predict training time, loss, and cost for LLM model training

Language:JavaScriptStargazers:6Issues:2Issues:0

DevLEGO

Create your development Env like LEGO blocks, run your projects on any device - be it a PC, Web, Phone or Tablet!

Language:ShellLicense:MITStargazers:4Issues:2Issues:0
Language:Jupyter NotebookStargazers:3Issues:3Issues:2

paperCrawler

A crawler for https://ndltd.ncl.edu.tw

Language:PythonStargazers:3Issues:1Issues:0

survey-builder

survey builder for human evaluation

Language:JavaScriptStargazers:3Issues:3Issues:0
Language:PythonStargazers:3Issues:3Issues:0

GSQA-GenerativeSpokenQuestionAnswering

Generative Spoken Question Answering

Language:PythonStargazers:2Issues:3Issues:0

t5-seq2seq-trainer

This is a simple example of using the T5 model for sequence-to-sequence tasks, leveraging Hugging Face's `Trainer` for efficient model training.

Language:PythonStargazers:2Issues:2Issues:0
Language:PythonStargazers:2Issues:3Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

stackexchange-dataset

Python tools for processing the stackexchange data dumps into a text dataset for Language Models

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

twcc-usage-slack-bot

TWCC GPU Usage Notification Slack Bot

Language:PythonStargazers:1Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

llm-training

to infinity and beyond

Stargazers:0Issues:2Issues:0

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0