Custom Self-Attention and Late Interaction for Basket Model: Capturing Complementarity and Purchase Intent in Retail

Vincent Auriau^{1, 2}, Michaël Teboul¹, Martin Možina³ and Emmanuel Malherbe¹

¹ _{Artefact Research Center,} ² _{MICS - CentraleSupélec,} ³ _{Fortenova Group}

In large-scale retail environments involving thousands of products, understanding how products are purchased, in particular if and how they interact together, is of great importance. Such insights, like products complementarity and substitution, are crucial for assortment optimization, promotion planning, and store layout design. While methods modeling products with embeddings have demonstrated strong performance in learning meaningful representations, modeling efficiently a basket of products as a whole remains a challenge. In this work, we leverage self-attention, a core operation of the Transformer architecture, well known for its ability in language modeling to enrich the representation of tokens given their context. We propose an architecture, training procedure and scoring function adapted to the structure of basket of items, that remain fairly simple and interpretable. It achieves state-of-the-art performance on the basket completion task for several datasets, including a large-scale one from a private retail actor. We provide a detailed analysis of the different architecture components and show how they can be interpreted to better understand products: in terms of popularity, clusters and interactions between them. These insights can be efficiently leveraged in an industrial context, such as within a dashboard we developed for category managers.

Reproducing Experiments

Install

git clone --recurse-submodules   git@github.com:artefactory/saber.git

Open-Source Datasets

The different datasets can be downloaded using the following links and placed in the folder "/datasets".

Python Requirements

Python >= 3.10
NumPy
TensorFlow
pyreadr
choice-learn

pip install requirements.txt

Compared Models

Run Experiments

python experiments/training.py
python experiments/evaluate.py

Online Appendix

Ablation Study

Configuration	MRR $\uparrow$	HR@50 $\uparrow$	NDCG $\uparrow$
Full Architecture	0.0632	27.9	0.190
Components Ablation
w/o Res-FFN	0.0628	27.8	0.190
w/o self-Attention	0.0608	27.0	0.188
w/o Popularity bias	0.0510	22.8	0.174
w/ Value Matrix	$0.0602$	26.8	0.186
Model Capacity
4 Heads - 1 Layer	0.0617	27.4	0.188
1 Head - 2 Layers	0.0628	27.6	0.188
1 Head - 4 Layers	0.0429	19.2	0.163
Mapping Strategy
w/o Weight Tying	0.0585	26.1	0.181

Compared models summary

	Prod2Vec	AleaCarta	AttRec	BERT4Rec	SABER
Embedding distance	cosine	cosine	$L_2$	cosine	cosine
Uses self-attention	No	No	Yes	Yes	Yes
w/ value matrix	-	-	No	Yes	No
multi head/layers	No	No	No	Yes	No
w/ price effect	No	Yes	No	No	Yes
Ties input & output embeddings	No	Yes	Yes	Yes	Yes
Designed for variable sized sets	Yes	Yes	No	No	Yes
Order invariant	Yes	Yes	No	No	Yes
Nb tokens hidden	1	1	1	N	1

Application Screenshots

Landing page: Products pages: Basket builder page:

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
BERT4Rec/BERT4Rec		BERT4Rec/BERT4Rec
datasets		datasets
experiments		experiments
notebooks		notebooks
python		python
resources		resources
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Custom Self-Attention and Late Interaction for Basket Model: Capturing Complementarity and Purchase Intent in Retail

Reproducing Experiments

Install

Open-Source Datasets

Python Requirements

Compared Models

Run Experiments

Online Appendix

Ablation Study

Compared models summary

Application Screenshots

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Custom Self-Attention and Late Interaction for Basket Model: Capturing Complementarity and Purchase Intent in Retail

Reproducing Experiments

Install

Open-Source Datasets

Python Requirements

Compared Models

Run Experiments

Online Appendix

Ablation Study

Compared models summary

Application Screenshots

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages