ml-resources

BERT user self--supervice loss call next sentence prediction (NSP) ALBERT Snetence Order prediciction (SOP) wich clain that model is force to learn mode fine-grain datils ELECTRA (GAN) DistilBert (2019) TinyBert (2020) MobileBert Logformer (hybrid local en global attention)

The Transformer Family
Curated list of transformer (Dair)- https://github.com/dair-ai/Transformers-Recipe
Illustrated transformer- https://jalammar.github.io/illustrated-transformer/
Transformers Explained Visually
- (Part 1): Overview of Functionality - https://towardsdatascience.com/transformers-explained-visually-part-1-overview-of-functionality-95a6dd460452
- (Part 3): Multi-head Attention, deep dive - https://towardsdatascience.com/transformers-explained-visually-part-3-multi-head-attention-deep-dive-1c1ff1024853
- (Part 2): How it works, step-by-step - https://towardsdatascience.com/transformers-explained-visually-part-2-how-it-works-step-by-step-b49fa4a64f34
Illustrated: Self-Attention - https://towardsdatascience.com/illustrated-self-attention-2d627e33b20a
https://towardsdatascience.com/galerkin-transformer-a-one-shot-experiment-at-neurips-2021-96efcbaefd3e
Dive into Deep Learning: Coding Session#5 Attention Mechanism II - https://www.youtube.com/watch?v=rRQcS1O58xk
The Illustrated Retrieval Transformer - https://jalammar.github.io/illustrated-retrieval-transformer/
Transformers from Scratch (Brandon Rohrer 2021) - https://e2eml.school/transformers
Transformer Recipe - https://github.com/dair-ai/Transformers-Recipe
Code to train Language model (hugging face)- https://github.com/huggingface/transformers/tree/master/examples/pytorch/language-modeling
BERT-ology at 100 kmph - https://thenlp.space/blog/bert-ology-at-100-kmph
Customize transformer models to your domain - https://thenlp.space/blog/customize-transformer-models-to-your-domain
Papers:
- Attention Is All You Need- https://arxiv.org/pdf/1706.03762.pdf
- Improving Language Models by Retrieving from Trillions of Tokens (DeepMind’s RETRO (Retrieval-Enhanced TRansfOrmer) Dec 2021) - https://deepmind.com/research/publications/2021/improving-language-models-by-retrieving-from-trillions-of-tokens
- Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks (2020 ALLEN) - https://arxiv.org/pdf/2004.10964.pdf
- Natural Language Processing (NLP) for Semantic Search Online Book (pinecone.io) - https://www.pinecone.io/learn/dense-vector-embeddings-nlp/

BERT

Explaining BERT Simply Using Sketches - https://mlwhiz.medium.com/explaining-bert-simply-using-sketches-ba30f6f0c8cb
How to Train a BERT Model From Scratch - https://towardsdatascience.com/how-to-train-a-bert-model-from-scratch-72cfce554fc6
LawBERT: Towards a Legal Domain-Specific BERT? - https://towardsdatascience.com/lawbert-towards-a-legal-domain-specific-bert-716886522b49
Distillation of BERT-Like Models: The Theory - https://towardsdatascience.com/distillation-of-bert-like-models-the-theory-32e19a02641f

Distillation

BigBird

BigBird Research Ep. 1 - Sparse Attention Basics - https://www.youtube.com/watch?v=YvA9nqPmGWg

Courses

REcSYs

See more here

Reinforment learning

Next best action

NBA - https://blog.griddynamics.com/building-a-next-best-action-model-using-reinforcement-learning/
- https://github.com/ikatsov/tensor-house
Next-Best-Action Recommendation https://ambiata.com/blog/2020-09-21-next-best-action-concepts/
Bandits - https://eugeneyan.com/writing/bandits/
Contextual bandits for ads recommendations - https://bytes.swiggy.com/contextual-bandits-for-ads-recommendations-ec210775fcf
HuggingFace Deep Reinforcement Learning course - https://github.com/huggingface/deep-rl-class

Frameworks

ReAgent (Facebook) - https://github.com/facebookresearch/ReAgent
Open Multi-bandit pipeline - https://github.com/st-tech/zr-obp

Graph

Knowledge Graphs in Natural Language Processing @ ACL 2021 - https://towardsdatascience.com/knowledge-graphs-in-natural-language-processing-acl-2021-6cac04f39761
Graph ML in 2022: Where Are We Now? - https://towardsdatascience.com/graph-ml-in-2022-where-are-we-now-f7f8242599e0

Time Series

https://towardsdatascience.com/temporal-convolutional-networks-the-next-revolution-for-time-series-8990af826567
https://towardsdatascience.com/introducing-pytorch-forecasting-64de99b9ef46
IJCAI 2021 Tutorial: Modern Aspects of Big Time Series Forecasting
M4 Forecasting Competition: Introducing a New Hybrid ES-RNN Model (Uber) - https://eng.uber.com/m4-forecasting-competition/
Interpretable Deep Learning for Time Series Forecasting (Google) - https://ai.googleblog.com/2021/12/interpretable-deep-learning-for-time.html
Anomali detection on TS

Papers:

N-BEATS: Neural basis expansion analysis for interpretable time series forecasting - https://openreview.net/pdf?id=r1ecqn4YwB
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting - https://arxiv.org/pdf/1907.00235.pdf
Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting - https://arxiv.org/pdf/1912.09363.pdf

Learn to rank

Search

https://medium.com/@shahrukhx01/keyword-search-using-dense-vectors-filter-recommendation-using-deep-learning-and-haystack-5b242de176d1

ONESHOT

Constrastive learning (supervised / self-supervised)

Contrastive learning is a self-supervised, task-independent deep learning technique that allows a model to learn about data, even without labels.

Understanding Contrastive Learning - https://towardsdatascience.com/understanding-contrastive-learning-d5b19fd96607
Contrastive Representation Learning - https://lilianweng.github.io/lil-log/2021/05/31/contrastive-representation-learning.html
Introduction to Dense Text Representations - https://www.youtube.com/watch?v=t4Gf4LruVZ4&list=PL7kaex1gKh6BDLHEwEeO45wZRDm5QlRil
- Global and local structute of vector space
- Losses: Multiple Negative Ranking Loss (Training with in-batch negative InfoNCE or NTXentloss) / Batch Hard Triplet Loss / Triplet Loss / Contrative loss / CosineSimilarity loss

The InfoNCE loss in self-supervised learning (deeplearning) - https://crossminds.ai/video/the-infonce-loss-in-self-supervised-learning-606fef0bf43a7f2f827c1583/

Others

Applied ml in the industry (papers)

https://github.com/eugeneyan/applied-ml

Producto categorization

Deep Learning: Product Categorization and Shelving - https://medium.com/walmartglobaltech/deep-learning-product-categorization-and-shelving-630571e81e96
Semantic Vector Search: Tales from the Trenches - https://medium.com/grensesnittet/semantic-vector-search-tales-from-the-trenches-fa8b61ea3680

Attribute extractyion in a e-commerce

Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach (2020 Google)

Product matching

Papers:

Entity matching

DeepMatch https://github.com/anhaidgroup/deepmatcher
DeepER http://www.vldb.org/pvldb/vol11/p1454-ebraheem.pdf
EMTA https://github.com/brunnurs/entity-matching-transformer
Auto-EM https://www.microsoft.com/en-us/research/uploads/prod/2019/04/Auto-EM.pdf
Ditto https://arxiv.org/pdf/2004.00584.pdf
GROOV (facebook) - https://arxiv.org/pdf/2209.06148.pdf

Foodbert

http://pic2recipe.csail.mit.edu/
https://github.com/ChantalMP/Exploiting-Food-Embeddings-for-Ingredient-Substitution
https://github.com/chambliss/foodbert
https://deepnote.notion.site/NLP-in-Notebooks-Competition-6616e415f0a44e5c95982e7bc1cb89dd
Paper:
- Exploiting Food Embeddings for Ingredient Substitution - https://www.scitepress.org/Papers/2021/102020/102020.pdf

item2vec

Moving Beyond Meta for Better Product Embeddings (MET) - https://medium.com/1mgofficial/moving-beyond-meta-better-product-embeddings-for-better-recommendations-fa6dd1578777
Item2Vec: Neural Item Embeddings to enhance recommendations - https://tech.olx.com/item2vec-neural-item-embeddings-to-enhance-recommendations-1fd948a6f293
Papers:
- Product recommendation at scale (prod2vec yahoo) - https://dl.acm.org/doi/pdf/10.1145/2783258.2788627
- item2vec (2016) - https://arxiv.org/pdf/1603.04259.pdf
- Meta-Prod2Vec - Product Embeddings Using Side-Information for Recommendation (2016)- https://arxiv.org/pdf/1607.07326.pdf
- Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba (2018): https://arxiv.org/pdf/1803.02349.pdf
- Deep neural network marketplace recommenders in online experiments by Avito - https://arxiv.org/pdf/1809.02130.pdf
- BERTSCORE: EVALUATING TEXT GENERATION WITH BERT (2019) - https://arxiv.org/pdf/1904.09675.pdf

XIA

Explainability and Auditability in ML: Definitions, Techniques, and Tools - https://neptune.ai/blog/explainability-auditability-ml-definitions-techniques-tools
The right way to compute your Shapley Values - https://towardsdatascience.com/the-right-way-to-compute-your-shapley-values-cfea30509254
A Brief Overview of Methods to Explain AI (XAI) - https://towardsdatascience.com/a-brief-overview-of-methods-to-explain-ai-xai-fe0d2a7b05d6

Ab testing

Bayesian A/B Testing for Business Decisions Statistical Challenges in Online Controlled Experiments: A Review of A/B Testing Methodology

Frameworks

Tensorflow

Pytorch

Declarative Deep Learning - https://medium.com/pytorch/ludwig-on-pytorch-1241776417fc

ml-resources

Python

ML Pattern design

MLOPS

Feature store

Serving

Drift

Monitoring and Alerting

GCP

Algorithms / Technique

NLP

Embeddings

Word endedding

Sentence Embedding

Tokenizer

Attention

Tansformer

BERT

Distillation

BigBird

Courses

REcSYs

Reinforment learning

Next best action

Frameworks

Graph

Time Series

Papers:

Learn to rank

Search

ONESHOT

Constrastive learning (supervised / self-supervised)

Others

Applied ml in the industry (papers)

Producto categorization

Attribute extractyion in a e-commerce

Product matching

Entity matching

Foodbert

item2vec

XIA

Ab testing

Frameworks

Tensorflow

Pytorch

Education

MLE certification

Course

Other topics

About