¿Sobre qué tema necesitas documentación??
Métodos eficientes para el procesamiento del lenguaje natural: Una encuesta
Métodos eficientes para el procesamiento del lenguaje natural: Una encuesta Marcos Treviso1∗, Ji-Ung Lee2∗, Tianchu Ji3*, Betty van Aken4, Qingqing Cao5, Manuel R. Ciosici6, Michael Hassid7, Kenneth Heafield8, Sara Hooker9, Colin Raffel10, Pedro H. Martín1,11, andref. t. Martín1,11, Jessica Zosa Forde12, Peter Milder3, Edwin Simpson13, Noam Slonim14, Jesse Dodge15, Emma Strubell15,16, Niranjan Balasubramanian3, León Derczynski5,17, Iryna Gurévych2, Roy Schwartz7 1IST/U. de Lisboa e Instituto de Telecomunicaciones ̧
SOY YO: Métricas de evaluación sólidas a partir de la inferencia del lenguaje natural
SOY YO: Métricas de evaluación sólidas a partir de la inferencia del lenguaje natural Yanran Chen1,2 y Steffen Eger2 1Technische Universit¨at Darmstadt, Alemania 2Grupo de aprendizaje de idiomas naturales (NLLG), https://nl2g.github.io/ Facultad de Tecnología, Universidad de Bielefeld, Alemania yanran.chen@stud.tu-darmstadt.de, steffen.eger@uni-bielefeld.de Resumen Las métricas de evaluación basadas en BERT propuestas recientemente para la generación de texto funcionan bien en los puntos de referencia estándar, pero son vulnerables a la publicidad.- ataques versátiles, p.ej., relativo a la exactitud de la información. Sostenemos que esto se debe (en parte) de
Parafraseo de modismos chinos
Modismo chino parafraseando a Jipeng Qiang1∗ Yang Li1 Chaowei Zhang1 Yun Li1 Yi Zhu1 Yunhao Yuan1 Xindong Wu2,3 1 Universidad de Yangzhou, Porcelana, 2 Universidad Tecnológica de Hefei, Porcelana, 3 Laboratorio de Zhejiang, Porcelana {jpqiang,cwzhang,El león,zhuyi,yhyuan}@yzu.edu.cn, xwu@hfut.edu.cn Resumen Los modismos son un tipo de expresión idiomática en chino., la mayoría de los cuales constan de cuatro caracteres chinos. Debido a las propiedades de la no composicionalidad y la media metafórica.- En g, Los modismos chinos son difíciles
Aprendizaje automático gradual supervisado para términos de aspecto
Aprendizaje automático gradual supervisado para el análisis de sentimiento a término Yanyan Wang†‡ Qun Chen∗†‡ Murtadha H.M. Ahmed‡ Zhaoqiang Chen‡ Jing Su‡ Wei Pan‡ Zhanhuai Li‡ Escuela de Ciencias de la Computación, Universidad Politécnica del Noroeste, Xi'an, China ‡Laboratorio clave de almacenamiento y gestión de Big Data, Universidad Politécnica del Noroeste, Ministerio de Industria y Tecnología de la Información, Xi'an, Porcelana {wangyanyan@mail., chenbenben@, murtadha@mail., chenzhaoqiang@mail., sujing@correo., panwei1002@, lizhh@}nwpu.edu.cn Resumen Un trabajo reciente ha demostrado que Aspect-Term
On Graph-based Reentrancy-free Semantic Parsing
On Graph-based Reentrancy-free Semantic Parsing Alban Petit and Caio Corro Universite Paris-Saclay, CNRS, LISN, 91400, Orsay, Francia {alban.petit,caio.corro}@lisn.upsaclay.fr Abstract We propose a novel graph-based approach for semantic parsing that resolves two prob- lems observed in the literature: (1) seq2seq models fail on compositional generalization tasks; (2) previous work using phrase structure parsers cannot cover all the semantic parses observed in treebanks. We prove that both
OpenFact: Factuality Enhanced Open Knowledge Extraction
OpenFact: Factuality Enhanced Open Knowledge Extraction Linfeng Song∗, Ante Wang∗,†, Xiaoman Pan, Hongming Zhang, Dian Yu, Lifeng Jin, Haitao Mi, Jinsong Su†, Yue Zhang‡ and Dong Yu Tencent AI Lab, Bellevue, Washington, USA †School of Informatics, Xiamen University, China ‡School of Engineering, Westlake University, China lfsong@global.tencent.com Abstract We focus on the factuality property during the extraction of an OpenIE corpus named OpenFact, which contains more
FRMT: Un punto de referencia para la traducción automática con reconocimiento de regiones en pocas oportunidades
FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation Parker Riley∗, Timothy Dozat∗, Jan A. Botha∗, Xavier Garcia∗, Dan Garrett, Jason Riesa, Orhan Firat, Noah Constant Google Research, EE.UU {prkriley,tdozat,jabot,xgarcia,dhgarrette, riesa,orhanf,nconstant}@google.com Abstract We present FRMT, a new dataset and evalu- ation benchmark for Few-shot Region-aware Machine Translation, a type of style-targeted translation. The dataset consists of professional translations from English into two regional variants each of
Razonamiento visual espacial
Visual Spatial Reasoning Fangyu Liu Guy Emerson Nigel Collier University of Cambridge, Reino Unido {fl399, gete2, nhc30}@cam.ac.uk Abstract Spatial relations are a basic part of human cog- nition. Sin embargo, they are expressed in natural language in a variety of ways, and previous work has suggested that current vision-and- language models (VLMs) struggle to capture relational information. en este documento, we pres- ent Visual Spatial Reasoning (VSR),
Transparency Helps Reveal When Language Models Learn Meaning
Transparency Helps Reveal When Language Models Learn Meaning Zhaofeng Wu ∗ William Merrill Hao Peng Iz Beltagy Noah A. Smith MIT New York University Allen Institute for Artificial Intelligence Paul G. Allen School of Computer Science & Ingeniería, University of Washington zfw@csail.mit.edu willm@nyu.edu {haop,beltagy,noah}@allenai.org Abstract Many current NLP systems are built from language models trained to optimize unsu- pervised objectives on large amounts of raw
Comprender y detectar alucinaciones en
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection Weijia Xu Microsoft Research, Redmond, USA weijiaxu@microsoft.com Sweta Agrawal University of Maryland, USA sweagraw@cs.umd.edu Eleftheria Briakou University of Maryland, USA ebriakou@cs.umd.edu Marianna J. Martindale University of Maryland, USA mmartind@umd.edu Marine Carpuat University of Maryland, USA marine@cs.umd.edu Abstract Neural sequence generation models are known to ‘‘hallucinate’’, by producing outputs that are unrelated to the source
La compensación del paralelismo: Limitaciones de los transformadores de precisión logarítmica
La compensación del paralelismo: Limitations of Log-Precision Transformers William Merrill Center for Data Science New York University, Nueva York, Nueva York, USA willm@nyu.edu Ashish Sabharwal Allen Institute for AI Seattle, Washington, USA ashishs@allenai.org Abstract Despite their omnipresence in modern NLP, characterizing the computational power of transformer neural nets remains an interest- pregunta abierta. We prove that transformers whose arithmetic precision is logarithmic in the number of
Borrado de atributos no alineados de representaciones neuronales
Erasure of Unaligned Attributes from Neural Representations Shun Shao∗ Yftah Ziser∗ Shay B. Cohen Institute for Language, Cognition and Computation School of Informatics, University of Edinburgh 10 Crichton Street, Edimburgo, EH8 9AB, UK s.shao-11@inf.ed.ac.uk yftah.ziser@inf.ed.ac.uk scohen@inf.ed.ac.uk Abstract the Assignment-Maximization We present Spectral Attribute removaL (AMSAL) algo- ritmo, which erases information from neural representations when the information to be erased is implicit rather than directly being
Unleashing the True Potential of Sequence-to-Sequence Models
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing Han He Department of Computer Science Emory University Atlanta, Georgia 30322 USA han.he@emory.edu Jinho D. Choi Department of Computer Science Emory University Atlanta, Georgia 30322 USA jinho.choi@emory.edu Abstract Sequence-to-Sequence (S2S) models have achieved remarkable success on various text generation tasks. Sin embargo, learning complex structures with S2S models remains challeng- ing as external
Tracking Brand-Associated Polarity-Bearing Topics in User Reviews
Tracking Brand-Associated Polarity-Bearing Topics in User Reviews Runcong Zhao1,2, Lin Gui1, Hanqi Yan2, Yulan He1,2,3 1King’s College London, Reino Unido, 2University of Warwick, Reino Unido, 3The Alan Turing Institute, United Kingdom runcong.zhao@warwick.ac.uk, yulan.he@kcl.ac.uk Abstract Monitoring online customer reviews is im- portant for business organizations to measure customer satisfaction and better manage their reputations. en este documento, we propose a novel dynamic Brand-Topic Model (dBTM) cual
Naturalistic Causal Probing for Morpho-Syntax
Naturalistic Causal Probing for Morpho-Syntax Afra Amini1,2 Tiago Pimentel3 Clara Meister1 Ryan Cotterell1,2 1ETH Z¨urich, Switzerland 2ETH AI Center, Suiza 3Universidad de Cambridge, UK afra.amini@inf.ethz.ch tp472@cam.ac.uk ryan.cotterell@inf.ethz.ch clara.meister@inf.ethz.ch Abstract Probing has become a go-to methodology for interpreting and analyzing deep neural mod- els in natural language processing. Sin embargo, there is still a lack of understanding of the limitations and weaknesses of various types of probes.
Visual Writing Prompts:
Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences Xudong Hong1,2,4, Asad Sayeed3, Khushboo Mehra2,4, Vera Demberg2,4 and Bernt Schiele1,4 1Dept. of Computer Vision and Machine Learning, MPI Informatics, Germany 2Dept. of Language Science and Technology and Dept. of Computer Science, Saarland University, Germany 3Dept. of Philosophy, Lingüística, and Theory of Science, University of Gothenburg, Sweden 4Saarland Informatics Campus, Sarrebruck, Alemania {xhong,kmehra,vera}@lst.uni-saarland.de schiele@mpi-inf.mpg.de, asad.sayeed@gu.se
Hate Speech Classifiers Learn Normative Social Stereotypes
Hate Speech Classifiers Learn Normative Social Stereotypes Aida Mostafazadeh Davani, Mohammad Atari, Brendan Kennedy, Morteza Dehghani University of Southern California, EE.UU {mostafaz,atari,btkenned,mdehghan}@usc.edu Abstract Social stereotypes negatively impact individ- uals’ judgments about different groups and may have a critical role in understanding lan- guage directed toward marginalized groups. Aquí, we assess the role of social stereotypes in the automated detection of hate speech in the English
On the Robustness of Dialogue History Representation in
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method Zorik GekhmanT∗ Nadav OvedT ∗ Orgad KellerG Idan SzpektorG Roi ReichartT T Technion – Israel Institute of Technology, Israel GGoogle Research, Israel {zorik@campus.|nadavo@campus.|roiri@}technion.ac.il {orgad|szpektor}@google.com Abstract Most work on modeling the conversation his- tory in Conversational Question Answering (CQA) reports a single main result on a com-