Stanford Question Answering (SQuAD) Benchmark Task

From GM-RKB
Jump to navigation Jump to search

A Stanford Question Answering (SQuAD) Benchmark Task is a LLM inference evaluation task that is a question answering benchmark task that can be used to assess a language model’s ability to answer factual questions based on context passages using extractive or generative outputs.



References

2023

2018a

2018b

2017

2016