Alec Radford

From GM-RKB
Jump to navigation Jump to search

Alec Radford is a person.



References

2024

[1] https://www.artificial-intelligence.blog/people-in-ai/alec-radford
[2] https://schneppat.com/alec-radford.html
[3] https://www.wired.com/story/what-openai-really-wants/
[4] https://www.theatlantic.com/magazine/archive/2023/09/sam-altman-openai-chatgpt-gpt-4/674764/
[5] https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf

2021

2020

2019

2018

2017

  • (Schulman et al., 2017) ⇒ John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. (2017). "Proximal Policy Optimization Algorithms." In: arXiv preprint arXiv:1707.06347. [1].
    • NOTE: It introduces Proximal Policy Optimization (PPO), a new family of policy gradient methods that provide a simpler and more stable alternative to Trust Region Policy Optimization (TRPO).
    • NOTE: It presents a novel optimization method for reinforcement learning that has since become one of the most widely used techniques in the field due to its ease of implementation and efficiency.

2016

2015


  1. 1.0 1.1 1.2 Cite error: Invalid <ref> tag; no text was provided for refs named source2
  2. 2.0 2.1 2.2 Cite error: Invalid <ref> tag; no text was provided for refs named source3
  3. 3.0 3.1 Cite error: Invalid <ref> tag; no text was provided for refs named source4