HOANG Bao Tin's starred repositories

llama.cpp

LLM inference in C/C++

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:32370Issues:1287Issues:840

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:31583Issues:334Issues:280

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17156Issues:182Issues:723

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:11584Issues:133Issues:186

PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

Language:PythonLicense:Apache-2.0Stargazers:11383Issues:101Issues:3375

tweepy

Twitter for Python!

Language:PythonLicense:MITStargazers:10239Issues:268Issues:1275

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8392Issues:117Issues:914

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:6734Issues:109Issues:135

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonLicense:Apache-2.0Stargazers:6528Issues:69Issues:586

openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Language:C++License:Apache-2.0Stargazers:5867Issues:184Issues:2400

Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Language:PythonLicense:MITStargazers:5390Issues:48Issues:204

ethereum-org-website

Ethereum.org is a primary online resource for the Ethereum community.

Language:MarkdownLicense:MITStargazers:4759Issues:205Issues:2934

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4533Issues:85Issues:118

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:3872Issues:55Issues:288

Wikipedia

A Pythonic wrapper for the Wikipedia API

Language:PythonLicense:MITStargazers:2808Issues:84Issues:232

OnnxStream

Lightweight inference library for ONNX files, written in C++. It can run SDXL on a RPI Zero 2 but also Mistral 7B on desktops and servers.

Language:C++License:NOASSERTIONStargazers:1737Issues:24Issues:55

konlpy

Python package for Korean natural language processing.

Language:PythonLicense:NOASSERTIONStargazers:1390Issues:65Issues:338

Wikipedia-API

Python wrapper for Wikipedia

Language:PythonLicense:MITStargazers:527Issues:8Issues:59

vnstock

A powerful Python library for getting rich data from the Vietnam Stock Market using just a few lines of code

Language:PythonLicense:MITStargazers:357Issues:32Issues:50

GoogleNews

Script for GoogleNews

Language:PythonLicense:MITStargazers:305Issues:7Issues:105

curvesim

Simulates Curve Finance pools

Language:HTMLLicense:MITStargazers:145Issues:5Issues:130

kengdic

Joe Speigle's Korean/English dictionary database

cihai

Python library for CJK (Chinese, Japanese, and Korean) language dictionary

Language:PythonLicense:MITStargazers:77Issues:4Issues:16

sc-data

Content for SuttaCentral, including texts both legacy and bilara, parallels, structure, and other metadata.

Stargazers:40Issues:0Issues:0

googletranslatepy

Google Translate Client with `deep-translator`

Language:PythonStargazers:20Issues:0Issues:0

awesome

Awesome stuff built on top of SuttaCentral’s awesomeness

License:CC0-1.0Stargazers:12Issues:6Issues:0
Language:PythonLicense:CC0-1.0Stargazers:11Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0