Paraphrase Detection Task

Context:
- It can be solved by a Paraphrase Detection System (that implements a paraphrase detection algorithm).
Example(s):
- IsParaphrase(“Amrozi accused his brother, whom he called "the witness", of deliberately distorting his evidence.”;
  “Referring to him as only "the witness", Amrozi accused his brother of deliberately distorting his evidence.”
  ) ⇒ true
- based on Microsoft Research Paraphrase Corpus.
- based on DIRT Paraphrase Collection.
- based on Sekine's Paraphrase Database.
- …
Counter-Example(s):
- Paraphrase Generation.
- Text Summarization.
See: Semantic Identity, Paraphrase.

References

https://aclweb.org/aclwiki/Paraphrase_Identification_(State_of_the_art)
source: Microsoft Research Paraphrase Corpus (MSRP)
task: given a pair of sentences, classify them as paraphrases or not paraphrases
see: Dolan et al. (2004).
train: 4,076 sentence pairs (2,753 positive: 67.5%)
test: 1,725 sentence pairs (1,147 positive: 66.5%)
see also: Similarity (State of the art)

Sentence 1: Amrozi accused his brother, whom he called "the witness", of deliberately distorting his evidence.
Sentence 2: Referring to him as only "the witness", Amrozi accused his brother of deliberately distorting his evidence.
Class: 1 (true paraphrase)

(Yin & Schütze, 2015) ⇒ Wenpeng Yin, and Hinrich Schütze. (2015). “Convolutional Neural Network for Paraphrase Identification.” In: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 901-911.

(Socher et al., 2011) ⇒ Richard Socher, Eric H. Huang, Jeffrey Pennin, Christopher D. Manning, and Andrew Y. Ng. (2011). “Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection.” In: Advances in Neural Information Processing Systems, pp. 801-809.

(Das & Smith, 2009) ⇒ Dipanjan Das, and Noah A. Smith. (2009). “Paraphrase Identification As Probabilistic Quasi-synchronous Recognition.” In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL/IJCNLP-2009).