Lexically Aware Semi-Supervised Learning for OCR Post-Correction Shruti Rijhwani1, Daisy Rosenblum2, Antonios Anastasopoulos3, Graham Neubig1 1Language Technologies Institute, Carnegie Mellon University, USA 2University of British Columbia, Canada 3Department of Computer Science, George Mason University, USA…
Suchkategorieangehen
Structured Self-Supervised Pretraining for Commonsense Knowledge
Structured Self-Supervised Pretraining for Commonsense Knowledge Graph Completion Jiayuan Huang•∗, Yangkai Du•∗, Shuting Tao•∗, Kun Xu(cid:3), Pengtao Xie(cid:2)† •Zhejiang University, China, (cid:3)Tencent AI Lab, USA, (cid:2)UC San Diego, USA p1xie@eng.ucsd.edu Abstract To develop commonsense-grounded NLP…
Quantifying Social Biases in NLP:
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics Paula Czarnowska♠ University of Cambridge, UK pjc211@cam.ac.uk Yogarshi Vyas Amazon AI, USA yogarshi@amazon.com Kashif Shah Amazon AI, USA shahkas@amazon.com Abstract Measuring…
On the Difficulty of Translating Free-Order Case-Marking Languages
On the Difficulty of Translating Free-Order Case-Marking Languages Arianna Bisazza Ahmet ¨Ust ¨un Center for Language and Cognition University of Groningen, Die Niederlande {a.bisazza, a.ustun}@rug.nl, research@spor.tel Stephan Sportel Abstract Identifying factors that make certain languages…
Controllable Summarization with Constrained Markov Decision Process
Controllable Summarization with Constrained Markov Decision Process Hou Pong Chan1, Lu Wang2, and Irwin King3 1University of Macau, Macau SAR, China 2University of Michigan, Ann Arbor, MI, USA 3The Chinese University of Hong Kong, Hong…
Memory-Based Semantic Parsing
Memory-Based Semantic Parsing Parag Jain and Mirella Lapata Institute for Language, Cognition and Computation School of Informatics, University of Edinburgh 10 Crichton Street, Edinburgh EH8 9AB, UK parag.jain@ed.ac.uk mlap@inf.ed.ac.uk Abstract We present a memory-based model…
Identity-Based Patterns in Deep Convolutional Networks: Generative
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication Gaˇsper Beguˇs University of California, Berkeley, USA begus@berkeley.edu Abstract This paper models unsupervised learning of an identity-based pattern (or copying) in speech called reduplication…
What Helps Transformers Recognize Conversational Structure?
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition Piotr ˙Zelasko†‡, Raghavendra Pappagari†‡, Najim Dehak†‡ †Center of Language and Speech Processing, ‡Human Language Technology Center of Excellence, Johns…
PARSINLU: A Suite of Language Understanding Challenges for Persian
PARSINLU: A Suite of Language Understanding Challenges for Persian Daniel Khashabi1 Arman Cohan1 Siamak Shakeri2 Pedram Hosseini3 Pouya Pezeshkpour4 Malihe Alikhani5 Moin Aminnaseri6 Marzieh Bitaab7 Faeze Brahman8 Sarik Ghazarian9 Mozhdeh Gheini9 Arman Kabiri10 Rabeeh Karimi…
A Statistical Analysis of Summarization Evaluation Metrics Using
A Statistical Analysis of Summarization Evaluation Metrics Using Resampling Methods Daniel Deutsch, Rotem Dror, and Dan Roth Department of Computer and Information Science University of Pennsylvania, USA {ddeutsch,rtmdrr,danroth}@seas.upenn.edu Abstract The quality of a summarization evaluation…