Page history

Page

InstructGPT LLM Model

28 January 2024

Gmelli
Text replacement - ". ↵" to ". "
m
01:47
−2

26 January 2024

Gmelli
Text replacement - " GPT-3, " to " GPT-3, "
m
06:26
+4

19 September 2023

Gmelli
Created page with "An InstructGPT LLM Model is a finetuned LLM model that is trained to follow instructions and complete requests thoughtfully. * <B>Context:</B> ** It is trained using reinforcement learning from human feedback. ** … * <B>Counter-Example(s):</B> ** ChatGPT Model. * <B>See:</B> DallE 2, Reinforcement Learning from Human Feedback (RLHF). ---- ---- == References == === 2022 === * https://openai.com/blog/instruction-following/ ** QUOTE: We’ve tr..."
08:46
+4,885

Retrieved from "http://www.gabormelli.com/RKB/Special:History/InstructGPT_LLM_Model"