There are 2 repositories under penn-treebank topic.
Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm
Boilerplate code for quickly getting set up to run language modeling experiments
A Tensorflow 2, Keras implementation of POS tagging using Bidirectional LSTM-CRF on Penn Treebank corpus (WSJ)
Language modeling on the Penn Treebank (PTB) corpus using a trigram model with linear interpolation, a neural probabilistic language model, and a regularized LSTM.
An implementation of WaveNet using PyTorch & PyTorch Lightning
Build a recurrent neural network using TensorFlow and Keras.
Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.
Turkish tree translation of 9561 Penn-Treebank trees (Number of leaves <= 15) and syntactic and semantic annotations of them
Parser for treebanks based on Penn Treebank type of encoding that generates Probabilistic Context Free Grammars
nltk utility which more accurately lemmatizes text using pre-trained part-of-speech tagger.
We use Bi-LSTM to learn to tag the Parts of Speech in a sentence using NLTK Brown corpus Dataset.
LSTM word level language model implementation in tensorflow and pytorch
NLP: HMMs and Viterbi algorithm for POS tagging
Reproduction of CIFAR-10/CIFAR-100 and Penn Treebank experiments to test claims in "LookaheadOptimizer: k steps forward, 1 step back" https://arxiv.org/abs/1907.08610
Reproduction of CIFAR-10/CIFAR-100 and Penn Treebank experiments to test claims in "LookaheadOptimizer: k steps forward, 1 step back" https://arxiv.org/abs/1907.08610
A parser that maps HanLP dependencies to Stanford Typed dependencies for Chinese.
To incorporate context in the task of sentence relation prediction.
A Web Application which on input of sentence gives the info of POS Tagger of the different words
Mini project 2 for NLP class
This repository contains Natural Language Processing programs in the Python programming language.
A lightweight (small and dependency-free) Java 8 library for Penn-like tokenization.
This source code converts a given corpus in the PennTreebank format to the DCG format, being appropriate to run in Prolog.