NamedTensor

Introduction

A proposal for a named tensor for PyTorch described in the blog post:

http://nlp.seas.harvard.edu/NamedTensor

Currently the library targets the PyTorch ecosystem.

Usage

from namedtensor import ntorch

Building tensors.

All pytorch builders have an extra keyword argument names.

x = ntorch.randn(10, 10, 20, names=("batch", "h", "w"))
x = ntorch.ones(10, 10, 20, names=("batch", "h", "w"))

Standard functions

All functions that keep dimensionality work in the same way.

x = x.log()
x = x.float()
x = ntorch.exp(x)

View and Tranposition

View, tranpose, and friends are deprecated in favor of named access and movement.

x = x.stack(("w", "h"), "stackdim")

# Roundtrip

x = x.split("stackdim", ("w", "h"), w=20)

Transposition (discouraged in the API):

x = x.transpose("batch", "w", "h")

# or early dim stay in place

x = x.transpose("w", "h")

Dim replacements:

Any function with a dim argument now can be accessed based on the dimension name.

x = x.narrow("w", 0, 10)
x = x.softmax("w")

This is true of reductions functions as well, where the named dimension is eliminated.

x = x.mean("w")
x, argmax = x.max("w")

Tensor contractions

Matrix operations also use the dimension arguments. We can replace einsum based on persistent names.

x = ntorch.randn(10, 10, 20, names=("batch", "h", "w"))
y = ntorch.randn(10, 20, 30, names=("batch", "w", "c"))
x.dot("w", y)

This also makes indexing much easier to read.

x = ntorch.ones(10, 10, 20, names=("batch", "time", "vocab"))
y = ntorch.randn(20, 30, names=("vocab", "embsize"))
y.index_select("vocab", x)

NN Modules

This api part is a work in progress. But many units are implemented to work with named tensor. They each have a required additional method spec that specifies the input and output dimensions of the object.

Examples

  conv = ntorch.nn.Conv1d(5, 10, 2).spec("input", "time", "output")
  n = ntorch.randn(20, 30, 5, names=("batch", "time", "input"))
  out = conv(n)

  drop = ntorch.nn.Dropout()
  n = ntorch.randn(4, 20, names=("batch", "target"))
  out = drop(n)

  loss = ntorch.nn.NLLLoss().spec("target")
  predict = ntorch.randn(20, 4, names=("target", "batch"))
  target = ntorch.tensor([2, 2, 3, 4], ["batch"])
  out = loss(predict, target)

Other Goodies

Named Distributions libary

Documentation

http://nlp.seas.harvard.edu/namedtensor/

Author

Alexander Rush (srush@seas.harvard.edu, @harvardnlp)

Contributors

(NamedTensor is being collectively developed by Harvard CS 287)

Yuntian Deng
Francisco Rivera
Jiafeng Chen
Celine Liang
Miro Furtado
Roshan Padaki
Mirac Suzgun
Belén Saldías
Jason Ren

About

Proof of concept for a dynamic named tensor for pytorch

http://nlp.seas.harvard.edu/NamedTensor

MIT License

Languages

Language:Jupyter Notebook 83.8%Language:Python 15.9%Language:Smarty 0.3%Language:Shell 0.0%