MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages Xinyu Zhang1∗, Nandan Thakur1∗, Odunayo Ogundepo1, Ehsan Kamalloo1†, David Alfonso-Hermelo2, Xiaoguang Li3, Qun Liu3, Mehdi Rezagholizadeh2, Jimmy Lin1 1David R. Cheriton School of Computer Science, University…
Browsing Categorytackle
Exploring Contrast Consistency of Open-Domain Question Answering
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions Zhihan Zhang, Wenhao Yu, Zheng Ning, Mingxuan Ju, Meng Jiang University of Notre Dame, Notre Dame, IN, USA {zzhang23, wyu1, zning, mju2, mjiang2}@nd.edu…
Cross-functional Analysis of Generalization in Behavioral Learning
Cross-functional Analysis of Generalization in Behavioral Learning Pedro Henrique Luz de Araujo1,2 and Benjamin Roth1,3 1Faculty of Computer Science, University of Vienna, Vienna, Austria 2UniVie Doctoral School Computer Science, Vienna, Austria 3Faculty of Philological and…
A Cross-Linguistic Pressure for
A Cross-Linguistic Pressure for Uniform Information Density in Word Order Thomas Hikaru Clark1 Clara Meister2 Tiago Pimentel3 Michael Hahn4 Ryan Cotterell2 Richard Futrell5 Roger Levy1 1MIT, USA 2ETH Z¨urich, Switzerland 3University of Cambridge, UK 4Saarland…
Time-and-Space-Efficient Weighted Deduction
Time-and-Space-Efficient Weighted Deduction Jason Eisner Department of Computer Science Johns Hopkins University jason@cs.jhu.edu Abstract Many NLP algorithms have been described in terms of deduction systems. Unweighted deduction allows a generic forward-chaining execution strategy. For weighted…
Communication Drives the Emergence of Language Universals in
Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off Yuchen Lian(cid:2) † Arianna Bisazza‡∗ (cid:2)Faculty of Electronic and Information Engineering, Xi’an Jiaotong University, China †Leiden Institute of Advanced Computer…
Design Choices for Crowdsourcing Implicit Discourse Relations:
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design Valentina Pyatkin1 Frances Yung2 Merel C. J. Scholman2,3 Reut Tsarfaty1 Ido Dagan1 Vera Demberg2 1Bar Ilan University, Ramat Gan, Israel 2Saarland…
Collective Human Opinions in Semantic Textual Similarity
Collective Human Opinions in Semantic Textual Similarity Yuxia Wang♠ Shimin Tao♣ Timothy Baldwin♠♥ Ning Xie♣ Karin Verspoor♠♦ Hao Yang♣ ♠ The University of Melbourne, Melbourne, Victoria, Australia ♣ Huawei TSC, Beijing, China ♥ MBZUAI, Abu…
Conditional Generation with a Question-Answering Blueprint
Conditional Generation with a Question-Answering Blueprint Shashi Narayan1, Joshua Maynez1, Reinald Kim Amplayo1, Kuzman Ganchev1, Annie Louis2, Fantine Huot1, Anders Sandholm2, Dipanjan Das1, Mirella Lapata1 1Google DeepMind, UK 2Google Research shashinarayan@google.com, joshuahm@google.com, reinald@google.com, kuzman@google.com, annielouis@google.com,…
Directed Acyclic Transformer Pre-training for High-quality
Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation Fei Huang Pei Ke Minlie Huang∗ The CoAI group, Tsinghua University, Beijing, China Institute for Artificial Intelligence, State Key Lab of Intelligent Technology and Systems, Beijing…