jieliorz

jieliorz

Geek Repo

Github PK Tool:Github PK Tool

jieliorz's starred repositories

qwen.cpp

C++ implementation of Qwen-LM

Language:C++License:NOASSERTIONStargazers:514Issues:0Issues:0

Chinese-text-correction-papers

text correction papers

Stargazers:281Issues:0Issues:0

text_scalpel

Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。

Language:PythonLicense:Apache-2.0Stargazers:210Issues:0Issues:0

A-Guide-to-Retrieval-Augmented-LLM

an intro to retrieval augmented large language model

Stargazers:249Issues:0Issues:0

llama3-Chinese-chat

Llama3 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Language:PythonStargazers:3348Issues:0Issues:0

build_MiniLLM_from_scratch

从0到1构建一个MiniLLM

Language:PythonLicense:MITStargazers:252Issues:0Issues:0

tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Language:PythonStargazers:99Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:328Issues:0Issues:0

Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

Language:PythonLicense:NOASSERTIONStargazers:4999Issues:0Issues:0

full-stack-fastapi-template

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

Language:TypeScriptLicense:MITStargazers:24777Issues:0Issues:0

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Stargazers:15186Issues:0Issues:0

basic_Machine_Learning

机器学习和深度学习入门教程

Language:Jupyter NotebookLicense:MITStargazers:33Issues:0Issues:0

roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Language:PythonStargazers:2568Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8025Issues:0Issues:0

SynoCN

中文近义词表 Chinese Synonyms

Stargazers:240Issues:0Issues:0

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Language:PythonStargazers:4171Issues:0Issues:0

baby-llama2-chinese_cybertron

使用单个24G显卡,从0开始训练LLM

Language:PythonLicense:MITStargazers:39Issues:0Issues:0

ChineseNlpCorpus

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Language:Jupyter NotebookStargazers:5720Issues:0Issues:0

Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

Stargazers:585Issues:0Issues:0

KgCLUEbench

benchmark of KgCLUE, with different models and methods

Language:PythonStargazers:27Issues:0Issues:0

KgCLUE

KgCLUE: 大规模中文开源知识图谱问答

Language:PythonStargazers:414Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2108Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:9651Issues:0Issues:0

Sememe-SC

Source code and data for ACL 2019 paper "Modeling Semantic Compositionality with Sememe Knowledge"

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

MultiRD

Code and data of the AAAI-20 paper "Multi-channel Reverse Dictionary Model"

Language:PythonLicense:MITStargazers:107Issues:0Issues:0

WantWords

An open-source online reverse dictionary.

Language:JavaScriptStargazers:6977Issues:0Issues:0

HyponymyExtraction

HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示

Language:PythonStargazers:162Issues:0Issues:0

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

Language:PythonStargazers:4025Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3244Issues:0Issues:0

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7005Issues:0Issues:0