grouplens/samantha

A generic recommender and predictor server for both offline machine learning and recommendation modeling and fast online production serving.
MIT licence, oriented to production use (online field experiments in research and typical industrial use)

Full-fledged, self-contained server that can be used in production right away with one configuration file, including the following components
Data management, including offline and online, in (indexing) and out (post-processing), through configurable backends of most relational databases (e.g. MySQL, PostresSQL, SQLServer etc.), ElasticSearch or Redis.
Model management, including online updating, building, loading, dumping and serving.
Data processing pipeline based on a data expanding and feature extraction framework
State-of-the-art models: collaborative filtering, matrix factorization, knn, trees, boosting and bandits/reinforcement learning
Experimental framework for randomized A/B and bucket testing
Feedback (for online learning/optimization) and evaluation (for experimenting) loops among application front-end, application back-end server and Samantha
Abstracted model parameter server (through extensible variable and index spaces)
Generic oracle-based optimization framework/solver with classic solvers
Flexible model dependency, e.g. model ensemble, stacking, boosting
Schedulers for regular model rebuilding or backup
Integration with other state-of-the-art systems including XGBoost and TensorFlow.
Control and customize all these components through one centralized configuration file

Individuals or organizations who want to deploy a data-driven predictive system with minimum effort. They might need it to support answering relevant research questions involving an intelligent predictive part in their system or just to have an initial try to see the effects of such a predictive component.
Individuals or organizations who are working on comparing and developing new machine learning or recommendation models or algorithms, especially those who care about deploying their models/algorithms into production and evaluate them in front of end users

Qian Zhao. 2018. User-Centric Design and Evaluation of Online Interactive Recommender Systems. Ph.D. Thesis. University of Minnesota.

Samantha is a project developed by Qian Zhao, Ph.D. at GroupLens Research lab (graduated on May 2018) and originated from his research projects there. Samantha might be integrated with Lenskit in future.

Qian Zhao, Yue Shi, Liangjie Hong. GB-CENT: Gradient Boosted Categorical Embedding and Numerical Trees. In Proceedings of the 26th International World Wide Web conference (WWW 2017), ACM, 2017. (see branch qian/gbcent, docs/README.md for details)
Qian Zhao, Jilin Chen, Minmin Chen, Sagar Jain, Alex Beutel, Francois Belletti, Ed Chi. 2018. Categorical-Attributes-Based Item Classification for Recommender Systems. In Proceedings of The 12th ACM Conference on Recommender Systems (RecSys’18). ACM, New York, NY, USA. (see branch qian/hsm for details)
Qian Zhao, Martijn Willemsen, Gediminas Adomavicius, F. Maxwell Harper, Joseph A. Konsta. 2019. From Preference Into Decision Making: Modeling User Interactions in Recommender Systems. In Proceedings of The 13th ACM Conference on Recommender Systems (RecSys’19). ACM, New York, NY, USA. (see branch qian/interaction for details)
Qian Zhao, F. Maxwell Harper, Gediminas Adomavicius, Joseph Konstan. Explicit or Implicit Feedback? Engagement or Satisfaction? A Field Experiment on Machine-Learning-Based Recommender Systems. In Proceedings of the 33rd ACM/SIGAPP Symposium On Applied Computing, Track of Recommender Systems: Theory, User Interactions and Applications (SAC 2018), ACM, 2018. (see Reinforce-State, Bandit-*, MF-*)

About

Samantha - A generic recommender and predictor server

MIT License

Language:Java 93.2%Language:Python 6.2%Language:Scala 0.3%Language:HTML 0.2%Language:JavaScript 0.2%Language:CSS 0.0%