Beast code in Giters

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01202400

BERT-related-papers

BERT-related papers

202900

socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Language:PythonMIT53900

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonNOASSERTION2200

COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Language:PythonMIT12000

metro_t0

Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)

Language:Python2100

wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Language:PythonMIT62800

fzf

:cherry_blossom: A command-line fuzzy finder

Language:GoMIT6244400

cheat.sh

the only cheat sheet you need

Language:PythonMIT3788100

BiBERT

This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".

Language:PythonMIT3100

a-pretrainers-guide

7300

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT1825800

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

Language:PythonMIT27400