2023 LargeLanguageModelIsNotaGoodFew
- (Ma, Cao et al., 2023) ⇒ Yubo Ma, Yixin Cao, YongChing Hong, and Aixin Sun. (2023). “Large Language Model Is Not a Good Few-shot Information Extractor, But a Good Reranker for Hard Samples!.” doi:10.48550/arXiv.2303.08559
Subject Headings: Few-Shot Information Extraction.
Notes
Cited By
Quotes
Abstract
Large Language Models (LLMs) have made remarkable strides in various tasks. However, whether they are competitive few-shot solvers for information extraction (IE) tasks and surpass fine-tuned small Pre-trained Language Models (SLMs) remains an open problem. This paper aims to provide a thorough answer to this problem, and moreover, to explore an approach towards effective and economical IE systems that combine the strengths of LLMs and SLMs. Through extensive experiments on eight datasets across three IE tasks, we show that LLMs are not effective few-shot information extractors in general, given their unsatisfactory performance in most settings and the high latency and budget requirements. However, we demonstrate that LLMs can well complement SLMs and effectively solve hard samples that SLMs struggle with. Building on these findings, we propose an adaptive filter-then-rerank paradigm, in which SLMs act as [[filters and LLMs act]] as rerankers. By utilizing LLMs to rerank a small portion of difficult samples identified by SLMs, our preliminary system consistently achieves promising improvements (2.1% F1-gain on average) on various IE tasks, with acceptable cost of time and money.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
2023 LargeLanguageModelIsNotaGoodFew | Aixin Sun Yubo Ma Yixin Cao YongChing Hong | Large Language Model Is Not a Good Few-shot Information Extractor, But a Good Reranker for Hard Samples! | 10.48550/arXiv.2303.08559 | 2023 |