2023 ReinforcedSelfTrainingReSTforLa
Jump to navigation
Jump to search
- (Gulcehre et al., 2023) ⇒ Caglar Gulcehre, Tom Le Paine, Srivatsan Srinivasan, Ksenia Konyushkova, Lotte Weerts, Abhishek Sharma, Aditya Siddhant, Alex Ahern, Miaosen Wang, Chenjie Gu, Wolfgang Macherey, Arnaud Doucet, Orhan Firat, and Nando de Freitas. (2023). “Reinforced Self-Training (ReST) for Language Modeling.” In: arXiv preprint arXiv:2308.08998. doi:10.48550/arXiv.2308.08998
Subject Headings:
Notes
Cited By
Quotes
Abstract
No_abstract
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2023 ReinforcedSelfTrainingReSTforLa | Arnaud Doucet Nando de Freitas Caglar Gulcehre Wolfgang Macherey Alex Ahern Tom Le Paine Srivatsan Srinivasan Ksenia Konyushkova Lotte Weerts Abhishek Sharma Aditya Siddhant Miaosen Wang Chenjie Gu Orhan Firat | Reinforced Self-Training (ReST) for Language Modeling | 10.48550/arXiv.2308.08998 | 2023 |