ワークショップ (国際) Leveraging Context-dependent Click Model for Off-Policy Evaluation of Ranking Policies

Haruka Kiyohara (Tokyo Institute of Technology), Nobuyuki Shimizu, Yasuo Yamamoto

The 16th ACM International Conference on Web Search and Data Mining (WSDM) WORKSHOP ON INTERACTIVE RECOMMENDER SYSTEMS (WSDM IRS Workshop)


We leverage context-dependent click models to achieve a better bias-variance tradeoff compared to the existing estimators. Specifically, we assume that the click model variable, which is sampled conditional on the context, determines the relevant positions in the ranking. This formulation enables a more generalized click modeling compared to the existing work. Then, we propose the Generalized IPS (GIPS) estimator, which adaptively reduces the combinatorial action space depending on the click model. The proposed estimator is unbiased under any given click models and achieves the minimum variance among IPS-based unbiased estimators. The empirical results demonstrate that GIPS achieves a favorable bias-variance tradeoff compared to the existing estimators on various user behaviors.