A Reinforcement Learning Framework for Explainable Recommendation.pdf
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions.pdf
DRN - A Deep Reinforcement Learning Framework for News Recommendation.pdf
Deep Reinforcement Learning for List-wise Recommendations.pdf
Deep Reinforcement Learning for Page-wise Recommendations.pdf
Deep Reinforcement Learning for Search, Recommendation, and Online Advertising - A Survey.pdf
Exploration and Regularization of the Latent Action Space in Recommendation.pdf
InTune - Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models.pdf
Jointly Learning to Recommend and Advertise.pdf
Large-scale Interactive Recommendation with Tree-structured Policy Gradient.pdf
Model-free Reinforcement Learning with Stochastic Reward Stabilization for Recommender Systems.pdf
Off-policy evaluation for slate recommendation.pdf
Online Matching - A Real-time Bandit System for Large-scale Recommendations.pdf
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning.pdf
Reinforcement Learning for Slate-based Recommender Systems - A Tractable Decomposition and Practical Methodology.pdf
Reinforcing User Retention in a Billion Scale Short Video Recommender System.pdf
Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation.pdf
Supervised Learning-enhanced Multi-Group Actor Critic for Live-stream Recommendation.pdf
Top-K Off-Policy Correctionfor a REINFORCE Recommender System.pdf
Towards Capacity-Aware Broker Matching - From Recommendation to Assignment.pdf
Two-Stage Constrained Actor-Critic for Short Video Recommendation.pdf
Virtual-Taobao - Virtualizing Real-world Online Retail Environment for Reinforcement Learning.pdf
When People Change their Mind - Off-Policy Evaluation in Non-stationary Recommendation Environments.pdf
You can’t perform that action at this time.