1998 ExtractingRelationsFromWWW
- (Brin, 1998) ⇒ Sergey Brin. (1998). “Extracting Patterns and Relations from the World Wide Web.” In: Proceedings of the EDBT 1998 Workshop on the Web and Databases (WebDB 1998).
Subject Headings: DIPRE Algorithm, Bootstrapping, Relation Recognition Task, Relation Recognition Algorithm, Lexico-Syntactic Pattern Matching Algorithm.
Notes
Cited By
2000
- (Agichtein & Gravano, 2000) ⇒ Eugene Agichtein, and Luis Gravano. (2000). “Snowball: Extracting Relations from Large Plain-Text Collections.” In: Proceedings of the 5th ACM International Conference on Digital Libraries (DL 2000). doi:10.1145/336597.336644
Quotes
Abstract
The World Wide Web is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists may be scattered across thousands of independent information sources in many different formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically. We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample To test our technique we use it to extract a relation of (author, title) pairs from the World Wide Web.
References
- Amazon home page http://www.amazon.com
- Sergey Brin and Larry Page. Google search engine. http://google.stanford.edu.
- Sergey Brin. List of books. http://www-db.stanford.edu/~sergey/booklist.html
- Scott Deerwester, Susan Dumais, Goerge Furnas, Thomas K. Landauer, and Richard Harshman. (1990). Indexing by latent semantic analysis Journal of the American Society for Information Science
- Workshop on management of semistructured data. (1997). http://www.research.att.com/~
suciu/workshop-papers.html
- Visa shopping guide for books http://shopguide.yahoo.com/shopguide/books.html,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
1998 ExtractingRelationsFromWWW | Sergey Brin | Extracting Patterns and Relations from the World Wide Web | http://dbpubs.stanford.edu:8090/pub/1999-65 |