RuanVisser's starred repositories

tpunicorn

Babysit your preemptible TPUs

Language:PythonLicense:NOASSERTIONStargazers:84Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++License:MITStargazers:29607Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:9150Issues:0Issues:0

electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Language:PythonLicense:Apache-2.0Stargazers:2315Issues:0Issues:0

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonLicense:MITStargazers:1263Issues:0Issues:0

jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Language:Jupyter NotebookLicense:MITStargazers:1656Issues:0Issues:0

peS2o

Pretraining Efficiently on S2ORC!

License:Apache-2.0Stargazers:126Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12024Issues:0Issues:0

BERT-related-papers

BERT-related papers

Stargazers:2029Issues:0Issues:0

socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Language:PythonLicense:MITStargazers:539Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:NOASSERTIONStargazers:22Issues:0Issues:0

COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

metro_t0

Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)

Language:PythonStargazers:21Issues:0Issues:0

wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Language:PythonLicense:MITStargazers:628Issues:0Issues:0

fzf

:cherry_blossom: A command-line fuzzy finder

Language:GoLicense:MITStargazers:62444Issues:0Issues:0

cheat.sh

the only cheat sheet you need

Language:PythonLicense:MITStargazers:37881Issues:0Issues:0

BiBERT

This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18258Issues:0Issues:0

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

Language:PythonLicense:MITStargazers:274Issues:0Issues:0

ltg-bert

LTG-Bert

Language:PythonLicense:GPL-3.0Stargazers:25Issues:0Issues:0

vega

A visualization grammar.

Language:JavaScriptLicense:BSD-3-ClauseStargazers:10990Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23394Issues:0Issues:0

transformer-alignment

Code for EMNLP 2020 paper Accurate Word Alignment Induction from Neural Machine Translation

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61772Issues:0Issues:0

the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonLicense:AGPL-3.0Stargazers:9973Issues:0Issues:0

FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:12701Issues:0Issues:0

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonLicense:NOASSERTIONStargazers:536Issues:0Issues:0

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:1927Issues:0Issues:0

KoBERT

Korean BERT pre-trained cased (KoBERT)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1258Issues:0Issues:0