JasonGuo's starred repositories

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:80093Issues:1729Issues:42997

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33679Issues:340Issues:2634

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:32775Issues:1283Issues:845

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27455Issues:246Issues:6993

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25616Issues:171Issues:4136

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22838Issues:186Issues:180

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8689Issues:78Issues:975

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8679Issues:120Issues:947

pkuseg-python

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Language:PythonLicense:MITStargazers:6472Issues:208Issues:165

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5732Issues:48Issues:967

nlp_paper_study

该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonLicense:Apache-2.0Stargazers:3632Issues:32Issues:374

docarray

Represent, send, store and search multimodal data

Language:PythonLicense:Apache-2.0Stargazers:2840Issues:43Issues:637

NRLPapers

Must-read papers on network representation learning (NRL) / network embedding (NE)

Kyty

PS4 & PS5 emulator

Language:C++License:MITStargazers:2447Issues:133Issues:64

wechatDownload

微信公众号文章批量下载工具,支持图片、评论下载,支持保存html/md/pdf/docx文件

suyu

suyu is the continuation of the world's most popular, open-source, Nintendo Switch emulator, yuzu. It is written in C++ with portability in mind, and we're actively working on builds for Windows, Linux and Android.

Language:C++License:GPL-3.0Stargazers:2047Issues:49Issues:0

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1545Issues:18Issues:529

transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Language:Jupyter NotebookStargazers:1465Issues:14Issues:9

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1227Issues:14Issues:26

SDT

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).

Language:PythonLicense:MITStargazers:922Issues:11Issues:77

GPTRouter

Smoothly Manage Multiple LLMs (OpenAI, Anthropic, Azure) and Image Models (Dall-E, SDXL), Speed Up Responses, and Ensure Non-Stop Reliability.

Language:TypeScriptLicense:MITStargazers:402Issues:10Issues:10

FakeNewsCorpus

A dataset of millions of news articles scraped from a curated list of data sources.

papers.cool

Cool Papers - Immersive Paper Discovery

LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

Language:PythonLicense:Apache-2.0Stargazers:207Issues:3Issues:4

handwriting-web

将文本转为模拟手写文字的网页版

Language:VueLicense:MITStargazers:159Issues:3Issues:19

Pytorch-Base-Trainer

Pytorch分布式训练框架

Language:PythonLicense:MITStargazers:61Issues:4Issues:4

EasyLLM

make LLM easier to use

Language:PythonStargazers:59Issues:2Issues:0

ST-w-Scorer-ABSA

Released code for「Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction」in ACL2024.

Language:PythonStargazers:10Issues:0Issues:0