1995 TrasformationBasedErrorDrivenPOSTagging

(Brill, 1995) ⇒ Eric D. Brill. (1995). “Transformation-based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging.” In: Computational Linguistics, 21(4).

Subject Headings: Part-of-Speech Tagging Algorithm, Transformation-based Learning Algorithm.

Notes

ACM Digital Library page http://portal.acm.org/citation.cfm?id=218367

Cited By

~1288 papers http://scholar.google.com/scholar?cites=6669039697562179777

2003

(Sha & Pereira, 2003a) ⇒ Fei Sha, and Fernando Pereira. (2003). “Shallow Parsing with Conditional Random Fields.” In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (HLT-NAACL 2003). doi:10.3115/1073445.1073473
(Collins, 2003) ⇒ Michael Collins. (2003). “Head-Driven Statistical Models for Natural Language Parsing.” In: Computational Linguistics, 29(4). doi:10.1162/089120103322753356.

2001

(Lafferty et al., 2001) ⇒ John D. Lafferty, Andrew McCallum, and Fernando Pereira. (2001). “Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.” In: Proceedings of ICML 2001.

2000

Andrew McCallum, Dayne Freitag, Fernando Pereira. (2000). “Maximum entropy Markov models for information extraction and segmentation.” In: Proceedings17th International Conference on Machine Learning (ICML 2000).

Quotes

Abstract

Recently, there has been a rebirth of empiricism in the field of natural language processing. Manual encoding of linguistic information is being challenged by automated corpus-based learning as a method of providing a natural language processing system with linguistic knowledge. Although corpus-based approaches have been successful in many different areas of natural language processing, it is often the case that these methods capture the linguistic information they are modelling indirectly in large opaque tables of statistics. This can make it difficult to analyze, understand and improve the ability of these approaches to model underlying linguistic behavior. In this paper, we will describe a simple rule-based approach to automated learning of linguistic knowledge. This approach has been shown for a number of tasks to capture information in a clearer and more direct fashion without a compromise in performance. We present a detailed case study of this learning method applied to part-of-speech tagging.

References

Ezra Black, Fred Jelinek, John D. Lafferty, David M. Magerman, Robert Mercer, Salim Roukos, Towards history-based grammars: using richer models for probabilistic parsing, Proceedings of the 31st annual meeting on Association for Computational Linguistics, p.31-37, June 22-26, 1993, Columbus, Ohio doi:10.3115/981574.981579
Ezra Black, Fred Jelinek, John D. Lafferty, Robert Mercer, Salim Roukos, Decision tree models applied to the labeling of text with parts-of-speech, Proceedings of the workshop on Speech and Natural Language, February 23-26, 1992, Harriman, New York doi:10.3115/1075527.1075554
Leo Breiman; Jerome H. Friedman; Olshen, Richard; and Stone, Charles (1984). Classification and regression trees. Wadsworth and Brooks.
Eric D. Brill, A simple rule-based part of speech tagger, Proceedings of the third Conference on Applied Natural Language Processing, March 31-April 03, 1992, Trento, Italy doi:10.3115/974499.974526
Eric D. Brill, Automatic grammar induction and parsing free text: a transformation-based approach, Proceedings of the 31st annual meeting on Association for Computational Linguistics, p.259-265, June 22-26, 1993, Columbus, Ohio doi:10.3115/981574.981609
Eric D. Brill, A corpus-based approach to language learning, University of Pennsylvania, Philadelphia, PA, 1993
Brill, Eric (1993c). “Transformation-based error-driven parsing.” In: Proceedings, Third International Workshop on Parsing Technologies, Tilburg, The Netherlands.
Eric D. Brill, Some advances in transformation-based part of speech tagging, Proceedings of the twelfth national conference on Artificial intelligence (vol. 1), p.722-727, October 1994, Seattle, Washington, United States
Eric D. Brill, Philip Resnik, A rule-based approach to prepositional phrase attachment disambiguation, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991250.991346
Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin, A statistical approach to machine translation, Computational Linguistics, v.16 n.2, p.79-85, June 1990
Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, Robert L. Mercer, Word-sense disambiguation using statistical methods, Proceedings of the 29th annual meeting on Association for Computational Linguistics, p.264-270, June 18-21, 1991, Berkeley, California doi:10.3115/981344.981378
Rebecca Bruce, Janyce M. Wiebe, Word-sense disambiguation using decomposable models, Proceedings of the 32nd annual meeting on Association for Computational Linguistics, p.139-146, June 27-30, 1994, Las Cruces, New Mexico doi:10.3115/981732.981752
Eugene Charniak; Hendrickson, Curtis; Jacobson, Neil; and Perkowitz, Michael (1993). “Equations for part of speech tagging.” In: Proceedings, Conference of the American Association for Artificial Intelligence (AAAI-93), Washington, DC.
Kenneth W. Church, A stochastic parts program and noun phrase parser for unrestricted text, Proceedings of the second Conference on Applied Natural Language Processing, February 09-12, 1988, Austin, Texas doi:10.3115/974235.974260
Doug Cutting, Julian Kupiec, Jan Pedersen, Penelope Sibun, A practical part-of-speech tagger, Proceedings of the third Conference on Applied Natural Language Processing, March 31-April 03, 1992, Trento, Italy doi:10.3115/974499.974523
Carl G. de Marcken, Parsing the LOB corpus, Proceedings of the 28th annual meeting on Association for Computational Linguistics, p.243-251, June 06-09, 1990, Pittsburgh, Pennsylvania doi:10.3115/981823.981854
Steven J. DeRose, Grammatical category disambiguation by statistical optimization, Computational Linguistics, v.14 n.1, p.31-39, Winter 1988
Francis, Winthrop Nelson and Kucera, Henry (1982). Frequency analysis of English usage: Lexicon and grammar. Houghton Mifflin, Boston.
Fujisaki, Tetsu; Jelinek, Fred; Cocke, John; and Black, Ezra (1989). “Probabilistic parsing method for sentence disambiguation.” In: Proceedings, International Workshop on Parsing Technologies, Carnegie Mellon University, Pittsburgh, PA.
William A. Gale, Kenneth W. Church, A program for aligning sentences in bilingual corpora, Proceedings of the 29th annual meeting on Association for Computational Linguistics, p.177-184, June 18-21, 1991, Berkeley, California doi:10.3115/981344.981367
Gale, William; Kenneth W. Church, and David Yarowsky (1992). “A method for disambiguating word senses in a large corpus.” Computers and the Humanities.
Geoffrey Leech, Roger Garside, Michael Bryant, CLAWS4: the tagging of the British National Corpus, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991886.991996
Harris, Zellig (1962). String Analysis of Language Structure. Mouton and Co., The Hague.
Donald Hindle, Acquiring disambiguation rules from text, Proceedings of the 27th annual meeting on Association for Computational Linguistics, p.118-125, June 26-29, 1989, Vancouver, British Columbia, Canada doi:10.3115/981623.981638
Donald Hindle, Mats Rooth, Structural ambiguity and lexical relations, Computational Linguistics, v.19 n.1, March 1993
Huang, Caroline; Son-Bell, Mark; and Baggett, David (1994). “Generation of pronunciations from orthographies using transformation-based error-driven learning.” In: Proceedings of the International Conference on Speech and Language Processing (ICSLP), Yokohama, Japan.
F. Jelinek, Self-organized language modeling for speech recognition, Readings in speech recognition, Morgan Kaufmann Publishers Inc., San Francisco, CA, 1990
Aravind K. Joshi, B. Srinivas, Disambiguation of super parts of speech (or supertags): almost parsing, Proceedings of the 15th conference on Computational linguistics, August 05-09, 1994, Kyoto, Japan doi:10.3115/991886.991912. 29. Sheldon Klein, Robert F. Simmons, A Computational Approach to Grammatical Coding of English Words, Journal of the ACM (JACM), v.10 n.3, p.334-347, July 1963 doi:10.1145/321172.321180
Kupiec, Julian (1992). “Robust part-of-speech tagging using a hidden Markov model.” Computer Speech and Language, 6.
Mitchell P. Marcus, Mary Ann Marcinkiewicz, Beatrice Santorini, Building a large annotated corpus of English: the penn treebank, Computational Linguistics, v.19 n.2, June 1993
Bernard Merialdo, Tagging English text with a probabilistic model, Computational Linguistics, v.20 n.2, p.155-171, June 1994
Miller, George (1990). “Wordnet: an on-line lexical database.” International Journal of Lexicography. 3(4).
J. R. Quinlan, Induction of Decision Trees, Machine Learning, v.1 n.1, p.81-106 doi:10.1023/A:1022643204877
J. R. Quinlan, R. L. Rivest, Inferring decision trees using the minimum description length principle, Information and Computation, v.80 n.3, p.227-248, Mar. 1989 doi:10.1016/0890-5401(89)90010-2
Ramshaw, Lance and Marcus, Mitchell (1994). “Exploring the statistical derivation of transformational rule sequences for part-of-speech tagging.” In: The Balancing Act: Proceedings of the ACL Workshop on Combining Symbolic and Statistical Approaches to Language, New Mexico State University, July.
Emmanuel Roche, Yves Schabes, Deterministic part-of-speech tagging with finite-state transducers, Computational Linguistics, v.21 n.2, p.227-253, June 1995
Hinrich Schütze, Yoram Singer, Part-of-speech tagging using a Variable Memory Markov model, Proceedings of the 32nd annual meeting on Association for Computational Linguistics, p.181-187, June 27-30, 1994, Las Cruces, New Mexico doi:10.3115/981732.981757
R. A. Sharman, F. Jelinek, R. Mercer, Generating a grammar for statistical training, Proceedings of the workshop on Speech and Natural Language, p.267-274, June 24-27, 1990, Hidden Valley, Pennsylvania doi:10.3115/116580.116667
Ralph Weischedel, Richard Schwartz, Jeff Palmucci, Marie Meteer, Lance Ramshaw, Coping with ambiguity and unknown words through probabilistic models, Computational Linguistics, v.19 n.2, June 1993
David Yarowsky, Word-sense disambiguation using statistical models of Roget's categories trained on large corpora, Proceedings of the 14th conference on Computational linguistics, August 23-28, 1992, Nantes, France doi:10.3115/992133.992140,

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
1995 TrasformationBasedErrorDrivenPOSTagging	Eric D. Brill			Transformation-based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging		Computational Linguistics (CL) Research Area	http://www.cs.mu.oz.au/acl/J/J95/J95-4004.pdf			1995

1995 TrasformationBasedErrorDrivenPOSTagging

Notes

Cited By

2003

2001

2000

Quotes

Abstract

References

Navigation menu

Search