Swaminathan & Joachims, 2015
Jump to navigation
Jump to search
See: Swaminathan & Joachims, 2015a, Swaminathan & Joachims, 2015b.
References
2015
- (Swaminathan & Joachims, 2015a) ⇒ Adith Swaminathan, and Thorsten Joachims. (2015). “Counterfactual Risk Minimization: Learning from Logged Bandit Feedback.” In: Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37.
- (Swaminathan & Joachims, 2015b) ⇒ Adith Swaminathan, and Thorsten Joachims. (2015). “The Self-normalized Estimator for Counterfactual Learning.” In: Proceedings of the 28th International Conference on Neural Information Processing Systems.