Ryan Lowe
Jump to navigation
Jump to search
References
- Professional Homepage: https://www.cs.mcgill.ca/~rlowe1/
- Google Scholar Author Page: https://scholar.google.com/citations?user=iRgYMuEAAAAJ
2022
- (Ouyang et al., 2022) ⇒ Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, and Ryan Lowe. (2022). “Training Language Models to Follow Instructions with Human Feedback.” In: arXiv preprint arXiv:2203.02155.
2017
- (Lowe et al., 2017) ⇒ Ryan Lowe, Yi I. Wu, Aviv Tamar, Jean Harb, OpenAI Pieter Abbeel, and Igor Mordatch. (2017). “Multi-agent Actor-Critic for Mixed Cooperative-Competitive Environments.” Advances in Neural Information Processing Systems 30
- (Bahdanau et al., 2017a) ⇒ Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. “An Actor-Critic Algorithm for Sequence Prediction." In: Proceedings of The International Conference on Learning Representations, 2017.