Hi,
Thanks for the impressive paper and high quality open-resourced codebase.
I ran an experiment on zappos with this codebase (w/o any editing) and found that there is a big gap between the testset HM number(47.11) of CGE to the reported one(60.5) in the paper. The following is the eval log on test set

Furthermore, from this log, although the test auc number is close to the reported one(33.5), but the best unseen(66.05) also has a big gap to the reported one(71.5).
I am a little bit confused about these number gaps and could you please be kind to give some explainations. Thanks a lot.