WS-Sim Dataset
Jump to navigation
Jump to search
A WS-Sim Dataset is a Semantic Word Similarity Dataset that is a portion WordSim-353 Dataset introduced by Agirre et al. (2009).
- Example(s):
- Counter-Example(s):
- See: Training Dataset, Semantic Word Similarity Measure, Semantic Word Similarity System, SemEval-2017 Task 2, Reading Comprehension Dataset, Question-Answer Dataset.
References
2021
- (Gabrilovich, 2021) ⇒ http://gabrilovich.com/resources/data/wordsim353/ Released: 2002-02-10, Retrieved:2021-07-18.
- QUOTE: The WordSimilarity-353 Test Collection contains two sets of English word pairs along with human-assigned similarity judgements. The collection can be used to train and/or test computer algorithms implementing semantic similarity measures (i.e., algorithms that numerically estimate similarity of natural language words).
2020
- (ACL, 2020) ⇒ https://aclweb.org/aclwiki/WordSimilarity-353_Test_Collection_(State_of_the_art) Last edited on 16 June 2020.
- QUOTE: contains two sets of English word pairs along with human-assigned similarity judgements;
- first set (set1) contains 153 word pairs along with their similarity scores assigned by 13 subjects;
- second set (set2) contains 200 word pairs with similarity assessed by 16 subjects;
- ...
- performance is measured by Spearman's rank correlation coefficient.
- introduced by Finkelstein et al. (2002);
- subsequently used by many other researchers;
- QUOTE: contains two sets of English word pairs along with human-assigned similarity judgements;
2017
- (Camacho-Collados et al., 2017) ⇒ Jose Camacho-Collados, aMohammad Taher Pilehvar, Nigel Collier, and Roberto Navigli. (2017). “SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity.” In: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval@ACL 2017).
2009
- (Agirre et al., 2009) ⇒ Eneko Agirre, Enrique Alfonseca, Keith B. Hall, Jana Kravalova, Marius Pasca, Aitor Soroa (2009). "A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches". In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2009).
2002
- (Finkelstein et al., 2002) ⇒ Lev Finkelstein, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin (2002). "Placing Search in Context: The Concept Revisited". In: ACM Transactions on Information Systems (TOIS), Volume 20.