1997 SyntacticClusteringoftheWeb
Jump to navigation
Jump to search
- (Broder et al., 1997) ⇒ Andrei Z. Broder, Steven C. Glassman, Mark S. Manasse, and Geoffrey Zweig. (1997). “Syntactic Clustering of the Web.” In: Selected papers from the sixth International Conference on World Wide Web. doi:10.1016/S0169-7552(97)00031-7
Subject Headings: Syntactic Similarity.
Notes
Cited By
- http://scholar.google.com/scholar?q=%221997%22+Syntactic+Clustering+of+the+Web
- http://dl.acm.org/citation.cfm?id=283202.283370&preflayout=flat#citedby
Quotes
Author Keywords
Similarity; Duplication; Resemblance; Web Search; Fingerprints; Signatures
Abstract
We have developed an efficient way to determine the syntactic similarity of files and have applied it to every document on the World Wide Web. Using this mechanism, we built a clustering of all the documents that are syntactically similar. Possible applications include a "Lost and Found" service, filtering the results of Web searches, updating widely distributed web-pages, and identifying violations of intellectual property rights.
References
;
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
1997 SyntacticClusteringoftheWeb | Andrei Z. Broder Steven C. Glassman Mark S. Manasse Geoffrey Zweig | Syntactic Clustering of the Web | 10.1016/S0169-7552(97)00031-7 | 1997 |