WS-Sim Dataset

A WS-Sim Dataset is a Semantic Word Similarity Dataset that is a portion WordSim-353 Dataset introduced by Agirre et al. (2009).

Example(s):
Counter-Example(s):
See: Training Dataset, Semantic Word Similarity Measure, Semantic Word Similarity System, SemEval-2017 Task 2, Reading Comprehension Dataset, Question-Answer Dataset.

References

2021

(Gabrilovich, 2021) ⇒ http://gabrilovich.com/resources/data/wordsim353/ Released: 2002-02-10, Retrieved:2021-07-18.
- QUOTE: The WordSimilarity-353 Test Collection contains two sets of English word pairs along with human-assigned similarity judgements. The collection can be used to train and/or test computer algorithms implementing semantic similarity measures (i.e., algorithms that numerically estimate similarity of natural language words).

2020

(ACL, 2020) ⇒ https://aclweb.org/aclwiki/WordSimilarity-353_Test_Collection_(State_of_the_art) Last edited on 16 June 2020.
- QUOTE: contains two sets of English word pairs along with human-assigned similarity judgements;
  - first set (set1) contains 153 word pairs along with their similarity scores assigned by 13 subjects;
  - second set (set2) contains 200 word pairs with similarity assessed by 16 subjects;
  - ...
  - performance is measured by Spearman's rank correlation coefficient.
  - introduced by Finkelstein et al. (2002);
  - subsequently used by many other researchers;

2017

(Camacho-Collados et al., 2017) ⇒ Jose Camacho-Collados, aMohammad Taher Pilehvar, Nigel Collier, and Roberto Navigli. (2017). “SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity.” In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval@ACL 2017).

2009

(Agirre et al., 2009) ⇒ Eneko Agirre, Enrique Alfonseca, Keith B. Hall, Jana Kravalova, Marius Pasca, Aitor Soroa (2009). "A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches". In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2009).

2002

(Finkelstein et al., 2002) ⇒ Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin (2002). "Placing Search in Context: The Concept Revisited". In: ACM Transactions on Information Systems (TOIS), Volume 20.

Retrieved from "http://www.gabormelli.com/RKB/index.php?title=WS-Sim_Dataset&oldid=815600"