GPT-J Model

Revision as of 19:40, 20 December 2023 by Gmelli (talk | contribs) (Text replacement - "__NOTOC__↵Category:Concept↵__NOTOC__" to "__NOTOC__ Category:Concept")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

A GPT-J Model is a LNLM.



References

2023

  • (Wikipedia, 2023) ⇒ https://en.wikipedia.org/wiki/GPT-J Retrieved:2023-3-13.
    • GPT-J is an open source artificial intelligence language model developed by EleutherAI.[1] GPT-J performs very similarly to OpenAI's GPT-3 on various zero-shot down-streaming tasks and can even outperform it on code generation tasks.[2] The newest version, GPT-J-6B is a language model based on a data set called The Pile.[3] The Pile is an open-source 825 gigibyte language modelling data set that is split into 22 smaller datasets.[4] GPT-J is similar to ChatGPT in ability, although it does not function as a chat bot, only as a text predictor.[5]