Niklas's repositories
promptsource
Toolkit for creating, sharing and using natural language prompts.
matrixshapes
Language modelling task to infer shapes of matrices - One of the most difficult tasks for models like GPT-3, GPT-J
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
gritlm
Generative Representational Instruction Tuning
HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
licensed-pile
Repo to hold code and track issues for the collection of permissively licensed data
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
pinyin2hanzi
转换拼音到汉字用Hidden Markov Model和Viterbi算法
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
udacity-dl
Udacity Deep-Learning Nanodegree 2020