LSPGS Wikipedia Long Sentences Summarization System

AKA: Liu-Saleh-Pot-Goodrich-Sepassi Wikipedia Long Sentences Summarization System, Liu's Wikipedia Long Sentences Summarization System.
Context:
- It was developed by Liu et al. (2018).
- GitHub Repository: https://github.com/tensorflow/tensor2tensor/tree/master/tensor2tensor/data_generators/wikisum
- System's Architecture:
  - It is a two-stage text summarization system: Extractive Summarization + Abstractive Summarization.
  - Data representation is based on a sub-word tokenization system (Wu et al., 2016).
- Training and other ML Tools:
  - seq2seq-attn models are trained to optimize maximum likelihood objective.
  - It uses the open-source tensor tensor library for the training of abstractive models.
  - It uses a beam search of size 4 with length alpha=0.6 during decoding (Wu et al., 2016).
Example(s):
- …
Counter-Example(s):
See: Abstractive Text Summarization System, Neural Abstractive Summarization System, Self-Attention Mechanism, Sequence-to-Sequence (seq2seq) Neural Network, Natural Language Generation System, Natural Language Understanding System, Natural Language Translation System, Natural Language Processing System.

References

(Nenkova & Vanderwende, 2005) ⇒ Ani Nenkova, and Lucy Vanderwende (2005). “The Impact of Frequency on Summarization". Microsoft Research, Redmond, Washington, Tech. Rep. MSR-TR-2005, 101.

(Graff & Cieri, 2003) ⇒ David Graff, and Christopher Cieri (2003). "English Gigaword". Linguistic Data Consortium, Philadelphia, 2003.
- QUOTE: English Gigaword was produced by Linguistic Data Consortium (LDC) catalog number LDC2003T05 and ISBN 1-58563-260-0, and is distributed on DVD. This is a comprehensive archive of newswire text data in English that has been acquired over several years by the LDC.
  Four distinct international sources of English newswire are represented here:

(Ramos, 2003) ⇒ Juan Ramos (2003). “Using TF-IDF to Determine Word Relevance in Document Queries". In: Proceedings of the first instructional conference on Machine Learning, volume 242, pp. 133–142.