hgwu (hgwu4869)

hgwu4869

Geek Repo

Location:Shenzhen

Github PK Tool:Github PK Tool

hgwu's starred repositories

QAnything

Question and Answer based on Anything.

Language:PythonLicense:AGPL-3.0Stargazers:11594Issues:0Issues:0

Question-Generation-Paper-List

A summary of must-read papers for Neural Question Generation (NQG)

Stargazers:583Issues:0Issues:0

Awesome-LLM-RAG-Application

the resources about the application based on LLM with RAG pattern

Stargazers:788Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:359Issues:0Issues:0

Automatic_ticket_purchase

大麦网抢票脚本

Language:PythonLicense:MITStargazers:4296Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:935Issues:0Issues:0

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

License:MITStargazers:9437Issues:0Issues:0

LLMsNineStoryDemonTower

【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。

Stargazers:1719Issues:0Issues:0

spider

scripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge

Language:PythonLicense:Apache-2.0Stargazers:815Issues:0Issues:0

chase

Project page of Chase

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

gpt-3

GPT-3: Language Models are Few-Shot Learners

Stargazers:15667Issues:0Issues:0

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

Stargazers:1036Issues:0Issues:0
Language:PythonLicense:MITStargazers:280Issues:0Issues:0

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

License:MITStargazers:916Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:15332Issues:0Issues:0

Chinese-MobileBERT

Chinese MobileBERT(中文MobileBERT模型)

Language:PythonLicense:Apache-2.0Stargazers:78Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:7073Issues:0Issues:0

DuReader

Baseline Systems of DuReader Dataset

Language:PythonStargazers:1133Issues:0Issues:0

MLQA

New dataset

Language:PythonLicense:NOASSERTIONStargazers:294Issues:0Issues:0

cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:414Issues:0Issues:0

PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

Language:PythonLicense:Apache-2.0Stargazers:12019Issues:0Issues:0

bert4keras

keras implement of transformers for humans

Language:PythonLicense:Apache-2.0Stargazers:5365Issues:0Issues:0

djl

An Engine-Agnostic Deep Learning Framework in Java

Language:JavaLicense:Apache-2.0Stargazers:4093Issues:0Issues:0

baichuan_sft_lora

baichuan LLM surpervised finetune by lora

Language:PythonStargazers:58Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26587Issues:0Issues:0

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonLicense:Apache-2.0Stargazers:443Issues:0Issues:0

PersonRelationKnowledgeGraph

ChinesePersonRelationGraph, person relationship extraction based on nlp methods.中文人物关系知识图谱项目,内容包括中文人物关系图谱构建,基于知识库的数据回标,基于远程监督与bootstrapping方法的人物关系抽取,基于知识图谱的知识问答等应用。

Language:PythonStargazers:891Issues:0Issues:0

embedding_model_test

基于开源embedding模型的中文向量效果测试

Language:PythonStargazers:120Issues:0Issues:0

MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

License:Apache-2.0Stargazers:640Issues:0Issues:0

text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:4419Issues:0Issues:0