Lu junru (LuJunru)

LuJunru

Geek Repo

Location:Coventry, UK

Home Page:https://lujunru.github.io/

Github PK Tool:Github PK Tool

Lu junru's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40665Issues:394Issues:1296

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36938Issues:351Issues:1826

faiss

A library for efficient similarity search and clustering of dense vectors.

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18374Issues:184Issues:731

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8148Issues:102Issues:107

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonLicense:NOASSERTIONStargazers:4959Issues:79Issues:74

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。

question_generation

Neural question generation using transformers

Language:Jupyter NotebookLicense:MITStargazers:1105Issues:23Issues:91

KILT

Library for Knowledge Intensive Language Tasks

Language:PythonLicense:MITStargazers:915Issues:25Issues:31

Question-Generation-Paper-List

A summary of must-read papers for Neural Question Generation (NQG)

Awesome-LLM-Eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

ANCE

A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks

Language:PythonLicense:MITStargazers:363Issues:11Issues:15

MRQA-Shared-Task-2019

Resources for the MRQA 2019 Shared Task

Language:PythonLicense:MITStargazers:292Issues:19Issues:31

gisting

Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467

Language:PythonLicense:Apache-2.0Stargazers:265Issues:6Issues:21

Datasets

Poetry-related datasets developed by THUAIPoet (Jiuge) group.

python-turtle-draw-svg

A python program for draw SVG file using turtle package.

Language:PythonLicense:GPL-3.0Stargazers:190Issues:7Issues:8

StylisticPoetry

Codes for Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement (EMNLP 2018)

Info-HCVAE

[ACL 2020] Generating Diverse and Consistent QA pairs from Contexts with Information-Maximizing Hierarchical Conditional VAEs

Language:PythonLicense:Apache-2.0Stargazers:172Issues:4Issues:17

silent_speech

Code for voicing silent speech from EMG. Official repository for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021. Also includes code for converting silent speech to text.

Language:PythonLicense:MITStargazers:115Issues:7Issues:5

RE2RNN

Source code for the EMNLP 2020 paper "Cold-Start and Interpretability: Turning Regular Expressions intoTrainable Recurrent Neural Networks"

Instructdial

Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

Language:PythonLicense:Apache-2.0Stargazers:96Issues:4Issues:13

CorefBERT

Source code for EMNLP 2020 paper "Coreferential Reasoning Learning for Language Representation"

Language:PythonLicense:MITStargazers:67Issues:7Issues:13

topiocqa

Code and data for reproducing baselines for TopiOCQA, an open-domain conversational question-answering dataset

Language:PythonLicense:NOASSERTIONStargazers:47Issues:5Issues:6

WMPoetry

Source codes of Chinese Poetry Generation with a Working Memory Model (IJCAI 2018)

ESTER

public repo for ESTER dataset and modeling (EMNLP'21)

TextVAE-pytorch

Implementation of Variational Auto-Encoder for text generation in pytorch.

Language:PythonLicense:MITStargazers:11Issues:1Issues:1