language-modeling

There are 16 repositories under language-modeling topic.

quark0 / darts
Differentiable architecture search for convolutional and recurrent networks
deep-learning automl image-classification language-modeling pytorch convolutional-networks recurrent-networks neural-architecture-search
Language:Python 3927
RL4LMs
allenai / RL4LMs
A modular RL library to fine-tune language models to human preferences
language-modeling nlp reinforcement-learning dialogue-generation machine-translation natural-language-processing summarization table-to-text text-generation
Language:Python 2241
EgoAlpha / prompt-in-context-learning
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
chain-of-thought chatbot chatgpt chatgpt-api cot in-context-learning language-modeling language-understanding large-language-model llm pre-training prompt prompt-based-learning prompt-design prompt-engineering prompt-learning prompt-toolkit prompt-tuning
Language:Jupyter Notebook 1532
lingua-go
pemistahl / lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
natural-language-processing language-detection language-recognition language-classification language-identification language-processing nlp nlp-machine-learning golang-library go language-modeling text-processing
Language:Go 1193
uber-research / PPLM
Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
deep-learning language-modeling machine-learning natural-language-generation natural-language-processing nlp
Language:Python 1135
Separius / BERT-keras
Keras implementation of BERT with pre-trained weights
keras language-modeling nlp pretrained-models tensorflow theano transfer-learning transformer
Language:Python 813
meta-toolkit / meta
A Modern C++ Data Sciences Toolkit
c-plus-plus graph-algorithms inverted-index language-modeling nlp nlp-parsing pos-tag search-engine text-analysis text-analytics text-classification word-embeddings
Language:C++ 698
songlab-cal / tape
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
benchmark dataset deep-learning language-modeling protein-sequences protein-structure pytorch semi-supervised-learning
Language:Python 673
DmitryRyumin / INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
interspeech speech-technology machine-translation speech-synthesis asr prosody self-supervised-learning speech-production transmission acoustic adaptation signal-processing speech-recognition audio-signals speech-analysis linguistic-analysis language-modeling lexical-analysis interspeech2023 interspeech2024
648
hirofumi0810 / neural_sp
End-to-end ASR/LM implementation with PyTorch
pytorch speech-recognition automatic-speech-recognition asr ctc attention-mechanism attention seq2seq sequence-to-sequence speech language-model transformer language-modeling rnn-transducer transformer-xl streaming
Language:Python 595
google-deepmind / long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
benchmark dataset evaluation factuality language language-modeling large-language-models metrics
Language:Python 565
jeffhj / LM-reasoning
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
artificial-intelligence awesome-list chain-of-thought chatgpt deductive-reasoning gpt-3 human-intelligence in-context-learning language-modeling language-models large-language-models llm natural-language-processing paper-list prompt-engineering prompt-learning reasoning
546
yxuansu / SimCTG
[NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation
contrastive-learning decode deeplearning language-modeling languagemodel nlp textgeneration
Language:Python 467
DmitryRyumin / ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
asr denoising domain-adaptation face-recognition icassp icassp2023 keyword-spotting language-modeling self-supervised-learning semantic-segmentation signal-processing signal-restoration speech-recognition vad generative-models image-generation music-generation spoken-language-understanding multimodal-learning icassp2024
Language:Python 418
majumderb / rezero
Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"
deep-neural-networks language-modeling pytroch resnet transformer
Language:Python 407
madaan / memprompt
A method to fix GPT-3 after deployment with user feedback, without re-training.
feedback few-shot-learning gpt-3 gpt-3-prompt language-generation language-modeling prompt-tuning user-feedback
Language:Python 328
shmsw25 / FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
factuality language language-modeling evaluation emnlp2023
Language:Python 304
PyxLSTM
muditbhargava66 / PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
language-modeling lstm sequence-modeling xlstm
Language:Python 278
UIC-Liu-Lab / ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
catastrophic-forgetting continual-learning knowledge-transfer language-model domain-adaptive-pretraining transfer-learning transformer-architecture natural-language-processing language-modeling
Language:Python 262
L0SG / relational-rnn-pytorch
An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.
deep-learning deepmind language-model language-modeling pytorch recurrent-neural-networks self-attention transformer word-language-model
Language:Python 245
somosnlp / nlp-de-cero-a-cien
Curso práctico: NLP de cero a cien 🤗
attention-mechanism espanol language-modeling lstm natural-language-processing nlp rnn transfer-learning transformers word-embeddings
Language:Jupyter Notebook 185
rusiaaman / XLnet-gen
XLNet for generating language.
gpt2 language-generation language-model language-modeling text transformer-xl xlnet
Language:Python 165
tonybeltramelli / Deep-Lyrics
Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
deep-learning recurrent-neural-networks tensorflow natural-language-processing language-modeling
Language:Python 146
Sunnydreamrain / IndRNN_pytorch
Independently Recurrent Neural Networks (IndRNN) implemented in pytorch.
indrnn rnn action skeleton language-modeling
Language:Python 133
suriyadeepan / rnn-from-scratch
Use tensorflow's tf.scan to build vanilla, GRU and LSTM RNNs
language-modeling recurrent-neural-networks rnn tensorflow
Language:Python 129
uzaymacar / comparatively-finetuning-bert
Comparatively fine-tuning pretrained BERT models on downstream, text classification tasks with different architectural configurations in PyTorch.
natural-language-understanding language-modeling fine-tuning imdb-dataset bert attention-visualization pytorch
Language:Python 121
songlab-cal / tape-neurips2019
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (DEPRECATED)
benchmark dataset deep-learning language-modeling protein-sequences protein-structure semi-supervised-learning
Language:Python 118
DRSY / EMO
[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)
language-modeling
Language:Python 115
flatironinstitute / deepblast
Neural Networks for Protein Sequence Alignment
structural-alignments neural-networks sequence-alignment language-modeling protein-sequences protein protein-structure
Language:Python 114
geyingli / unif
基于 Tensorflow，仿 Scikit-Learn 设计的深度学习自然语言处理框架。支持 40 余种模型类，涵盖语言模型、文本分类、NER、MRC、知识蒸馏等各个领域
bert classification deep-learning distillation gpu-training language-modeling mrc nlp tensorflow transformer
Language:Python 114
referit3d / referit3d
Code accompanying our ECCV-2020 paper on 3D Neural Listeners.
deep-learning multimodal-deep-learning language-modeling rgbd computer-vision computer-graphics
Language:C++ 113
kmario23 / KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
automatic-speech-recognition deep-neural-networks deep-speech kenlm kenlm-toolkit language-model language-modeling natural-language-processing probabilistic-models python speech-recognition
112
jiali-ms / JLM
A fast LSTM Language Model for large vocabulary language like Japanese and Chinese
tensorflow language-modeling lstm decoder deep-neural-networks beam-search viterbi
Language:Python 109
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
artificial-intelligence deep-learning language-modeling state-spaces
Language:Python 97
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"
dataset few-shot-learning gpt-3 in-context-learning language-modeling
Language:Python 97
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
language-modeling llama transformers
Language:Python 93