ET-Chan / th4j

TH4J- A wrapper of torch TH library for Java (JVM langauges).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TH4J

A wrapper of torch TH library for Java (and other JVM langauges), implemented in Scala.

Prerequisites

  1. Torch 7 (libTH.so in LD_LIBRARY_PATH)
  2. JVM 8
  3. cutorch (optional, libTHC.so in LD_LIBRARY_PATH)
  4. sbt 0.13.8 (for building)
  5. jnlua (optional, for Lua interpolation)

Building

  1. git clone https://github.com/ET-Chan/th4j.git
  2. cd th4j && sbt compile
  3. cd lua && luarocks install th4j-0.1-1.rockspec (only for lua interpolation)

Setup

Bintray

Add following lines into your build.sbt

resolvers += Resolver.bintrayRepo("et-chan", "maven")
libraryDependencies +=
  ("th4j-core" %% "th4j-core" % "0.1")

Compile

Execute sbt publishLocal to publish the compiled library into your local ivy repos, then add

libraryDependencies += ("th4j-core" %% "th4j-core" % "0.1")

into you build.sbt.

#Usage The wrapper aims to mimic the APIs in torch, in an object oriented and type-safe way, as much as possible. The main components in th4j are exactly the ones in torch: Storage and Tensor, while storage is the internal holder for tensor data. Similarly, there are six types of tensors(storages) available for use: Double, Float, Int, Long, Short, Char. (Cuda & OpenCL are yet to be implemented.)

##Construction For the purpose of illustration, we will only focus on playing with DoubleTensor in scala. Java users should be able to use the libraries with similar syntax, except the querying part.

First of all, import the libraries:

import th4j.Tensor._

To construct a new tensor:

val t = new DoubleTensor()

, a new tensor with specified sizes:

val t = new DoubleTensor(sz1, [sz2, [sz3, [sz4]])

, or a new tensor viewing the same storage of another tensor

val t1 = new DoubleTensor(3, 4)
val t2 = new DoubleTensor(t1)

##Tensor Querying Due to the syntax differences between Lua and Scala, it is unlikely to implement an identical querying way of tensor as in torch. Instead, another operator !: is introduced to support range querying. The following table shows the comparison of them, supposing t is an instance of DoubleTensor.

Torch th4j
Extracting sub tensors 1 t[i] t(i - 1)
Extracting sub tensors 2 t[i][j] t(i - 1)(j - 1)
Extracting sub tensors 3 t[{i, j}] t( i-1 !: j-1)
Range query t[{{i, j}, {k, f}, {g, h}] t((i-1, j)!:(k-1, f)!:(g-1, h))
Range query with negative index t[{{i, -1}, {k, -2}}] t((i-1, -1)!:(k-1, -2))
Range query with singleton dimension t[{{i, i}, {k, f}}] t(i!:(k-1, f))

You may notice that there is an annoying ``-1" for almost every coordinate. In fact, th4j uses the convention 0-based indexing scheme. Another point worth noticing is that the ranges in th4j are left inclusive and right exclusive, as shown above. Note that since the querying relies heavily on Scala Implicits, java users has to use a list of tuple to do range query.

##Tensor Manipulation In torch, one can flexibly change the view, or shape of a tensor. th4j supports most of them, while mathematics manipulations of the tensors are still being implemented.

For example, one may want to change the view of a tensor and expand it. In torch, it is

t = torch.Tensor(3, 4):view(3, 1, 4):expand(3, 10, 4)

To achieve the same in th4j, one can

val t = new DoubleTensor(3, 4).view(3, 1, 4).expand(3, 10, 4)

Experienced torch users may find the API very familiar. However, there are some exceptions. Say we want to create a random tensor. In torch, you will

t = torch.rand(3, 4)

In th4j, you have to

val t = new DoubleTensor(3, 4).rand()

Similar differences occur in eye, zero etc. Refer to the Tensor.scala to know more.

##Lua Interpolation th4j provides an easy way to construct an adaptor to call Lua function. The binding supports following calling arguments and return types. This functionality needs jnlua.

  1. Int, Double, Float, Char, Byte, Short, String, Long
  2. DoubleTensor, FloatTensor, CharTensor, ShortTensor, LongTensor, IntTensor, ByteTensor
  3. DoubleStorage, FloatStorage, CharStorage, ShortStorage, LongStorage, IntStorage, ByteStorage
  4. List of the above (W.I.P)

Say we have the following Lua function, stored in test.lua, that we want to call in java.

function test(tensor1, tensor2, str)
    print(str)
    return tensor1:size(), tensor2:type()
end

Then, one can invoke following functions to bind the lua function easily

val L = new LuaState()
L.openLibs()
L.load("""require 'th4j' """, "=wrapper")
L.call(0,0)
L.load(Files.newInputStream(Paths.get("test.lua"))
  , "=test")
L.call(0, 0)
val javaFunc = LuaFunction.create[(Tensor, Tensor, String)=>(LongStorage, String)]("test", L)
val tensor1 = new DoubleTensor()
val tensor2 = new IntTensor()
val dummyStr = "Hello from Java"
javaFunc()(tensor1, tensor2, dummyStr)

This API supports tensor binding, and do reference counting automatically under the hood. It supports multiple Lua return values by Scala.Tuple.

Warning

The package is in its early development, and the API may change in future release.

Third party libraries

  1. Torch 7 (All rights reserved to Idiap Research Institute and Deepmind Technologies)
  2. jnlua (MIT license, by Andre Naef).

License

MIT License

About

TH4J- A wrapper of torch TH library for Java (JVM langauges).

License:MIT License


Languages

Language:Scala 96.7%Language:Lua 3.0%Language:CMake 0.3%