Publications of Mark Johnson

David McClosky, Eugene Charniak, and Mark Johnson. Automatic domain adaptation for parsing. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, HLT '10, pages 28-36, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics. [ bib | .pdf ]


Micha Elsner, Eugene Charniak, and Mark Johnson. Structured Generative Models for Unsupervised Named-Entity Clustering. In Proceedings of NAACL-09: HLT, Boulder, Colorado, June 2009. Association for Computational Linguistics. [ bib | .pdf ]

William P. Headden III, Mark Johnson, and David McClosky. Improving Unsupervised Dependency Parsing with Richer Contexts and Smoothing. In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference (to appear), Boulder, Colorado, May 2009. [ bib ]


David McClosky, Eugene Charniak, and Mark Johnson. When is Self-training Effective for Parsing? In Proceedings of the 22nd International Conference on Computational Linguistics (COLING'08), Manchester, UK, August 2008. [ bib | .pdf | .ps ]


Jianfeng Gao, Galen Andrew, Mark Johnson, and Kristina Toutanova. A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing. In Proceedings of the Association for Computational Linguistics (ACL'07), 2007. [ bib ]

Sharon Goldwater, Thomas L. Griffiths, and Mark Johnson. Distributional Cues to Word Segmentation: Context is Important. In Proceedings of the 31st Boston University Conference on Language Development, 2007. [ bib | .pdf ]

Mark Johnson. Why Doesn't EM Find Good HMM POS-Taggers? In Proceedings of Empirical Methods in Natural Language Processing (EMNLP'07), 2007. [ bib ]

Mark Johnson. Transforming Projective Bilexical Dependency Grammars into Efficiently-Parsable CFGs with Unfold-Fold. In Proceedings of the Association for Computational Linguistics (ACL'07), 2007. [ bib ]

Mark Johnson, Thomas L. Griffiths, and Sharon Goldwater. Bayesian inference for PCFGs via Markov chain Monte Carlo. In Proceedings of the North American Conference on Computational Linguistics (NAACL'07), 2007. [ bib | .pdf ]

Mark Johnson, Thomas L. Griffiths, and Sharon Goldwater. Adaptor Grammars: a Framework for Specifying Compositional Nonparametric Bayesian Models. In Advances in Neural Information Processing Systems 19, 2007. [ bib | .pdf ]


Eugene Charniak, Mark Johnson, Micha Elsner, Joseph Austerweil, David Ellis, Isaac Haxton, Catherine Hill, R. Shrivaths, Jeremy Moore, Michael Pozar, and Theresa Vu. Multilevel Coarse-to-Fine PCFG Parsing. In Proceedings of the Human Language Technology Conference of the NAACL (HLT-NAACL'06), pages 168-175, New York City, USA, June 2006. Association for Computational Linguistics. [ bib | .pdf | slides ]

Sharon Goldwater, Tom Griffiths, and Mark Johnson. Interpolating between types and tokens by estimating power-law generators. In Y. Weiss, B. Schölkopf, and J. Platt, editors, Advances in Neural Information Processing Systems 18, pages 459-466, Cambridge, MA, 2006. MIT Press. [ bib | .pdf ]

Sharon Goldwater, Thomas L. Griffiths, and Mark Johnson. Contextual Dependencies in Unsupervised Word Segmentation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association or Computational Linguistics (COLING_ACL'06), pages 673-680, Sydney, Australia, July 2006. Association for Computational Linguistics. [ bib | .pdf ]

William P. Headden III, Eugene Charniak, and Mark Johnson. Learning Phrasal Categories. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pages 301-307, Sydney, Australia, July 2006. Association for Computational Linguistics. [ bib | .pdf ]

Matthew Lease, Mark Johnson, and Eugene Charniak. Recognizing disfluencies in conversational speech. IEEE Transactions on Audio, Speech and Language Processing, 14(5):1566-1573, September 2006. [ bib | .pdf | Abstract ]

Matthew Lease, Eugene Charniak, Mark Johnson, and David McClosky. A Look At Parsing and Its Applications. In Proceedings of the Twenty-First National Conference on Artificial Intelligence (AAAI-06), 16-20 July 2006. [ bib | .pdf ]

Matthew Lease and Mark Johnson. Early Deletion of Fillers In Processing Conversational Speech. In Proceedings of the Human Language Technology Conference of the NAACL (HLT-NAACL'06), Companion Volume: Short Papers, pages 73-76, New York City, USA, June 2006. Association for Computational Linguistics. Version here corrects Table 2 in published version. [ bib | .pdf ]

David McClosky, Eugene Charniak, and Mark Johnson. Reranking and Self-Training for Parser Adaptation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL'06), pages 337-344, Sydney, Australia, July 2006. Association for Computational Linguistics. [ bib | .pdf | .ps ]

David McClosky, Eugene Charniak, and Mark Johnson. Effective Self-Training for Parsing. In Proceedings of the Human Language Technology Conference of the NAACL, Main Conference, pages 152-159, New York City, USA, June 2006. Association for Computational Linguistics. [ bib | .pdf | slides | .ps ]

Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy G. Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matthew Lease, Izhak Shafran, Matthew Snover, Robin Stewart, and Lisa Yung. SParseval: Evaluation Metrics for Parsing Speech. In Fifth International Conference on Language Resources and Evaluation (LREC'06), Genoa, Italy, 2006. [ bib | .pdf ]


Eugene Charniak and Mark Johnson. Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05), pages 173-180, Ann Arbor, Michigan, June 2005. Association for Computational Linguistics. [ bib | .pdf ]

Sharon Goldwater and Mark Johnson. Representational Bias in Unsupervised Learning of Syllable Structure. In Proceedings of the Ninth Conference on Computational Natural Language Learning (CoNLL-2005), pages 112-119, Ann Arbor, Michigan, June 2005. Association for Computational Linguistics. [ bib | .pdf ]

Jeremy G. Kahn, Matthew Lease, Eugene Charniak, Mark Johnson, and Mari Ostendorf. Effective Use of Prosody in Parsing Conversational Speech. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (EMNLP'05), pages 233-240, Vancouver, British Columbia, Canada, October 2005. Association for Computational Linguistics. [ bib | .pdf ]

Matthew Lease, Eugene Charniak, and Mark Johnson. Parsing and its applications for conversational speech. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'05), volume 5, pages 961-964, March 18 - March 23 2005. [ bib | .pdf | Abstract ]


Massimiliano Ciaramita and Mark Johnson. Multi-component Word Sense Disambiguation. In Rada Mihalcea and Phil Edmonds, editors, Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pages 97-100, Barcelona, Spain, July 2004. Association for Computational Linguistics. [ bib | .pdf ]

Sharon Goldwater and Mark Johnson. Priors in Bayesian Learning of Phonological Rules. In Proceedings of the Seventh Meeting of the ACL Special Interest Group in Computational Phonology, pages 35-42, Barcelona, Spain, July 2004. Association for Computational Linguistics. [ bib | .pdf ]

Michelle Gregory, Mark Johnson, and Eugene Charniak. Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does. In Daniel Marcu Susan Dumais and Salim Roukos, editors, HLT-NAACL 2004: Main Proceedings, pages 81-88, Boston, Massachusetts, USA, May 2 - May 7 2004. Association for Computational Linguistics. [ bib | .pdf ]

Keith B. Hall and Mark Johnson. Attention Shifting for Parsing Speech. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume, pages 40-46, Barcelona, Spain, July 2004. [ bib | .pdf ]

Mark Johnson and Eugene Charniak. A TAG-based noisy-channel model of speech repairs. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), pages 33-39, Barcelona, Spain, July 2004. [ bib | .pdf ]

Mark Johnson, Eugene Charniak, and Matthew Lease. An Improved Model For Recognizing Disfluencies in Conversational Speech. In Rich Transcription 2004 Fall Workshop (RT-04F), 2004. [ bib | .pdf ]

Brian Roark, Murat Saraclar, Michael Collins, and Mark Johnson. Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm. In ACL, pages 47-54, 2004. [ bib ]


Yasemin Altun, Mark Johnson, and Thomas Hofmann. Investigating Loss Functions and Optimization Methods for Discriminative Learning of Label Sequences. In Michael Collins and Mark Steedman, editors, Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, pages 145-152, 2003. [ bib | .pdf ]

Massimiliano Ciaramita, Thomas Hofmann, and Mark Johnson. Hierarchical Semantic Classification: Word Sense Disambiguation with World Knowledge. In Georg Gottlob and Toby Walsh, editors, IJCAI-03, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, Acapulco, Mexico, August 9-15, 2003, pages 817-822. Morgan Kaufmann, 2003. [ bib | .pdf | .ps ]

Massimiliano Ciaramita and Mark Johnson. Supersense Tagging of Unknown Nouns in WordNet. In Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing (EMNLP-03), pages 168-175, 2003. [ bib | .pdf ]

Stuart Geman and Mark Johnson. Probability and statistics in computational linguistics, a brief review. Mathematical foundations of speech and language processing, 138:1-26, 2003. [ bib | .pdf ]

Sharon Goldwater and Mark Johnson. Learning OT Constraint Rankings Using a Maximum Entropy Model. In Proceedings of the Workshop on Variation within Optimality Theory, Stockholm University, 2003. [ bib | .pdf | .ps ]

Keith Hall and Mark Johnson. Language modelling using efficient best-first bottom-up parsing. In Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE ASRU 2003, 2003. [ bib | .pdf ]

Mark Johnson. Learning and Parsing Stochastic Unification-Based Grammars. In Bernhard Schölkopf and Manfred K. Warmuth, editors, Computational Learning Theory and Kernel Machines, 16th Annual Conference on Computational Learning Theory and 7th Kernel Workshop, COLT/Kernel 2003, Washington, DC, USA, August 24-27, 2003, Proceedings, volume 2777 of Lecture Notes in Computer Science, pages 671-683. Springer, 2003. [ bib | .pdf ]


Yasemin Altun, Thomas Hofmann, and Mark Johnson. Discriminative Learning for Label Sequences via Boosting. In Proceedings of Neural Information Processing Systems (NIPS02), 2002. [ bib | .pdf ]

Donald Engel, Eugene Charniak, and Mark Johnson. Parsing and Disfluency Placement. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002), pages 49-54, 2002. [ bib | .pdf ]

Stuart Geman and Mark Johnson. Dynamic programming for parsing and estimation of stochastic unification-based grammars. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL'02), pages 279-286, Morristown, NJ, USA, 2002. Association for Computational Linguistics. [ bib | .pdf ]

Stuart Geman and Mark Johnson. Probabilistic Grammars and their Applications. In N.J. Smelser and P.B. Baltes, editors, International Encyclopedia of the Social & Behavioral Sciences, pages 12075-12082, Pergamon, Oxford, 2002. [ bib | .pdf ]

Mark Johnson. The DOP Estimation Method is Biased and Inconsistent. Computational Linguistics, 28(1):71-76, 2002. [ bib | .pdf ]

Mark Johnson. A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pages 136-143, 2002. [ bib | .pdf | .ps ]

Stefan Riezler, Tracy H. King, Ronald M. Kaplan, Richard Crouch, John T. III Maxwell, and Mark Johnson. Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL-02), pages 271-278, 2002. [ bib | .pdf ]


Yasemin Altun and Mark Johnson. Inducing SFA with Epsilon-Translations Using Minimum Description Length. In Finite State Methods in Natural Language Processing Workshop, ESSLLI 2001, 2001. [ bib | .pdf ]

Don Blaheta and Mark Johnson. Unsupervised learning of multi-word verbs. In Proceedings of the 2001 ACL Workshop on Collocation, 2001. [ bib | .pdf ]

Eugene Charniak and Mark Johnson. Edit Detection and Parsing for Transcribed Speech. In Proceedings of the Second Conference of the North American chapter of the Association for Computational Linguistics (NAACL '01), 2001. [ bib | .pdf ]

Mark Johnson. Joint and Conditional Estimation of Tagging and Parsing Models. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL-01), 2001. [ bib | .pdf ]


Massimiliano Ciaramita and Mark Johnson. Explaining away ambiguity: Learning verb selectional preference with Bayesian networks. In Proceedings of the 18th International Conference on Computational Linguistics, 2000. [ bib | .pdf ]

Mark Johnson and Brian Roark. Compact non-left-recursive grammars using the selective left-corner transform and factoring. In Proceedings of the 18th conference on Computational linguistics (COLING '00), pages 355-361, 2000. [ bib | .pdf ]

Mark Johnson and Stefan Riezler. Exploiting auxiliary distributions in stochastic unification-based grammars. In 1st Meeting of the North American Chapter of the Association for Computational Linguistics (NACL-00), pages 154-161, 2000. [ bib | .pdf ]

Stefan Riezler, Detlef Prescher, Jonas Kuhn, and Mark Johnson. Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training. In In Proceedings of 38th Annual Meeting of the Association for Compuational Linguistics (ACL-00), 2000. [ bib | .pdf ]


Mark Johnson. Type-driven semantic interpretation and Feature dependencies in R-LFG. Semantics and Syntax in Lexical Functional Grammar, pages 359-388, 1999. [ bib | .pdf ]

Mark Johnson. A Resource Sensitive Interpretation of Lexical Functional Grammar. Journal of Logic, Language and Information, 8(1):45-81, 1999. [ bib | .pdf | .ps ]

Mark Johnson, Stuart Geman, Stephen Canon, Zhiyi Chi, and Stefan Riezler. Estimators for Stochastic Unification-Based Grammars. In 37th Annual Meeting of the Association for Computational Linguistics (ACL-99), pages 535-541, 1999. [ bib | .pdf ]

Brian Roark and Mark Johnson. Efficient probabilistic top-down and left-corner parsing. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics (ACL '99), pages 421-428, 1999. [ bib | .pdf ]


Eugene Charniak, Sharon Goldwater, and Mark Johnson. Edge-Based Best-First Chart Parsing. In Sixth Workshop on Very Large Corpora, pages 127-133, 1998. [ bib | .pdf ]

Mark Johnson. Proof Nets and the Complexity of Processing Center Embedded Constructions. Journal of Logic, Language and Information, 7(4):433-447, 1998. [ bib | .pdf ]

Mark Johnson. The Effect of Alternative Tree Representations on Tree Bank Grammars. In David M. W. Powers, editor, Proceedings of the Joint Conference on New Methods in Language Processing and Computational Natural Language Learning: (NeMLaP3/CoNLL98), pages 39-48, Somerset, New Jersey, 1998. Association for Computational Linguistics. [ bib | .pdf ]

Mark Johnson. PCFG Models of Linguistic Tree Representations. Computational Linguistics, 24(4):613-632, 1998. [ bib | .pdf | .ps.gz ]

Mark Johnson. Finite-state Approximation of Constraint-based Grammars using Left-corner Grammar Transforms. In COLING-ACL, pages 619-623, 1998. [ bib | .pdf | .ps ]


Mark Johnson. Features as resources in R-LFG. In Proceedings of the 1997 LFG Conference, 1997. [ bib | .ps ]


Mark Johnson. Resource-sensitivity in Lexical-Functional Grammar. Proceedings of the 1996 Roma Workshop, 1996. [ bib ]


Sam Bayer and Mark Johnson. Features and Agreement. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics (ACL-95), pages 70-76, 1995. [ bib | .pdf | Abstract ]

Mark Johnson. Memorization in Top-Down Parsing. Computational Linguistics, 21(3):405-415, 1995. [ bib | .pdf ]

Mark Johnson and Sam Bayer. Features and Agreement in Lambek Categorial Grammar. In Proceedings of the 1995 ESSLLI Formal Grammar Workshop, pages 123-137, 1995. [ bib | .ps.Z ]

Mark Johnson and Jochen Dorre. Memoization of coroutined constraints. In Proceedings of the 33rd annual meeting on Association for Computational Linguistics, pages 100-107, Morristown, NJ, USA, 1995. Association for Computational Linguistics. [ bib | .pdf ]


Mark Johnson. Computing with Features as Formulae. Computational Linguistics, 20(1):1-25, 1994. [ bib | .pdf ]

