rockyzhengwu / difacto

Distributed Factorization Machines

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Distributed Factorization Machines

Build Status codecov.io Documentation Status GitHub license

Fast and memory efficient library for factorization machines (FM).

  • Supports both ℓ1 regularized logistic regression and factorization machines.
  • Runs on local machine and distributed clusters.
  • Scales to datasets with billions examples and features.

Quick Start

The following commands clone and build difacto, then download a sample dataset, and train FM with 2-dimension on it.

git clone --recursive https://github.com/dmlc/difacto
cd difacto; git submodule update --init; make -j8
./tools/download.sh gisette
build/difacto data_in=data/gisette_scale val_data=data/gisette_scale.t lr=.02 V_dim=2 V_lr=.001

History

Origins from wormhole/learn/difacto.

(NOTE: this project is still under developing)

References

Mu Li, Ziqi Liu, Alex Smola, and Yu-Xiang Wang. DiFacto — Distributed Factorization Machines. In WSDM, 2016

About

Distributed Factorization Machines

License:Other


Languages

Language:C++ 96.4%Language:MATLAB 1.7%Language:Makefile 0.9%Language:Shell 0.8%Language:CMake 0.1%