gucasdongzi

gucasdongzi

Geek Repo

Github PK Tool:Github PK Tool

gucasdongzi's starred repositories

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Language:PythonLicense:Apache-2.0Stargazers:422Issues:0Issues:0

blog

Public repo for HF blog posts

Language:Jupyter NotebookStargazers:2093Issues:0Issues:0

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Language:PythonStargazers:12538Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38147Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53830Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17727Issues:0Issues:0

notero

A Zotero plugin for syncing items and notes into Notion

Language:TypeScriptLicense:MITStargazers:2082Issues:0Issues:0

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11863Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3117Issues:0Issues:0

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

Stargazers:999Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7651Issues:0Issues:0

searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.

Language:PythonLicense:MITStargazers:587Issues:0Issues:0

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

License:MITStargazers:51040Issues:0Issues:0

ChatGPT

Reverse engineered ChatGPT API

Language:PythonLicense:GPL-2.0Stargazers:27995Issues:0Issues:0

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Language:PythonLicense:Apache-2.0Stargazers:1971Issues:0Issues:0

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2356Issues:0Issues:0

deep-learning-resources

Books, courses, videos and blogs, mostly about Deep Learning.

License:MITStargazers:30Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39646Issues:0Issues:0

printed-hw-segmentation

Printed and handwritten text segmentation using fully convolutional networks and CRF post-processing

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

PERT

PERT: A Progressively Region-based Network for Scene Text Removal (TIP2023)

Language:PythonStargazers:32Issues:0Issues:0
Language:PythonLicense:MITStargazers:126Issues:0Issues:0

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonLicense:Apache-2.0Stargazers:1958Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:127343Issues:0Issues:0

NRLPapers

Must-read papers on network representation learning (NRL) / network embedding (NE)

Language:TeXStargazers:2521Issues:0Issues:0

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22397Issues:0Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37274Issues:0Issues:0

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Language:PythonLicense:MITStargazers:2109Issues:0Issues:0

xflow

Distributed LR、 FM model on Parameter Server. FTRL and SGD Optimization Algorithm.

Language:C++Stargazers:219Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15448Issues:0Issues:0

Texygen

A text generation benchmarking platform

Language:PythonLicense:MITStargazers:860Issues:0Issues:0