Documentazione - Ricerca sull'intelligenza artificiale specializzata al MIT

What topic do you need documentation on?

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 197–210, 2018. Redattore di azioni: Hinrich Sch¨utze.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 197–210, 2018. Redattore di azioni: Hinrich Sch¨utze. Lotto di invio: 6/2017; Lotto di revisione: 9/2017; Pubblicato 4/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) KnowledgeCompletionforGenericsusingGuidedTensorFactorizationHanieSedghi∗GoogleBrainMountainView,CA,U.S.A.hsedghi@google.comAshishSabharwalAllenInstituteforArtiﬁcialIntelligence(AI2)Seattle,WA,U.S.A.AshishS@allenai.orgAbstractGivenaknowledgebaseorKBcontaining(noisy)factsaboutcommonnounsorgener-ics,suchas“alltreesproduceoxygen”or“someanimalsliveinforests”,weconsidertheproblemofinferringadditionalsuchfactsataprecisionsimilartothatofthestartingKB.SuchKBscapturegeneralknowledgeabouttheworld,andarecrucialforvariousappli-cationssuchasquestionanswering.Differ-entfromcommonlystudiednamedentityKBssuchasFreebase,genericsKBsinvolvequan-tiﬁcation,havemorecomplexunderlyingreg-ularities,tendtobemoreincomplete,andvio-latethecommonlyusedlocallyclosedworldassumption(LCWA).WeshowthatexistingKBcompletionmethodsstrugglewiththisnewtask,andpresenttheﬁrstapproachthatissuccessful.Ourresultsdemonstratethatex-ternalinformation,suchasrelationschemasandentitytaxonomies,ifusedappropriately,canbeasurprisinglypowerfultoolinthisset-ting.First,oursimpleyeteffectiveknowledgeguidedtensorfactorizationapproachachievesstate-of-the-artresultsontwogenericsKBs(80%precise)forscience,doublingtheirsizeat74%-86%precision.Second,ournoveltax-onomyguided,submodular,activelearningmethodforcollectingannotationsaboutrareentities(e.g.,oriole,abird)is6xmoreeffec-tiveatinferringfurthernewfactsaboutthemthanmultipleactivelearningbaselines.1IntroductionWeconsidertheproblemofcompletingapartialknowledgebase(KB)containingfactsaboutgener-∗ThisworkwasdonewhiletheauthorwasafﬁliatedwiththeAllenInstituteforArtiﬁcialIntelligence.icsorcommonnouns,representedasathird-ordertensorof(source,relation,target)triples,suchas(butterﬂy,pollinate,ﬂower)E(thermometer,mea-sure,temperature).Suchfactscapturecommonknowledgethathumanshaveabouttheworld.Theyarearguablyessentialforintelligentagentswithhuman-likeconversationalabilitiesaswellasforspeciﬁcapplicationssuchasquestionanswering.Wedemonstratethatstate-of-the-artKBcompletionmethodsperformpoorlywhenfacedwithgener-ics,whileourstrategiesforincorporatingexternalknowledgeaswellasobtainingadditionalannota-tionsforrareentitiesprovidetheﬁrstsuccessfulso-lutiontothischallengingnewtask.Sincegenericsrepresentclassesofsimilarindi-viduals,thetruthvalueyiofagenericstriplexi=(S,R,T)dependsonthequantiﬁcationsemanticsoneassociateswithsandt.Indeed,thesemanticsofgenericsstatementscanbeambiguous,evenself-contradictory,duetoculturalnorms.AsLeslie(2008)pointsout,‘duckslayeggs’isgenerallycon-sideredtruewhile‘ducksarefemale’,whichistrueforabroadersetofducksthantheformerstatement,isgenerallyconsideredfalse.Toavoiddeepphilosophicalissues,weﬁxapar-ticularmathematicalsemanticsthatisespeciallyrel-evantfornoisyfactsderivedautomaticallyfromtext:associateswithacategoricalquantiﬁcationfrom{Tutto,some,none}andassociatet(implicitly)withsome.Forinstance,“allbutterﬂiespollinate(some)ﬂower”and“someanimalslivein(some)forest”.Whenpresentingsuchtriplestohumans,theyarephrasedas:isittruethatallbutterﬂiespollinatesomeﬂower?Asanotationalshortcut,wetreatthequantiﬁcationofsasthecategoricallabelyiforthetriplexi.Forexample,(butterﬂy,pollinate,ﬂower) l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 159–172, 2018. Redattore di azioni: Luke Zettlemoyer.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 159–172, 2018. Redattore di azioni: Luke Zettlemoyer. Lotto di invio: 10/2017; Lotto di revisione: 12/2017; Pubblicato 3/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) MappingtoDeclarativeKnowledgeforWordProblemSolvingSubhroRoy∗MassachusettsInstituteofTechnologysubhro@csail.mit.eduDanRoth∗UniversityofPennsylvaniadanroth@seas.upenn.eduAbstractMathwordproblemsformanaturalabstrac-tiontoarangeofquantitativereasoningprob-lems,suchasunderstandingﬁnancialnews,sportsresults,andcasualtiesofwar.Solvingsuchproblemsrequirestheunderstandingofseveralmathematicalconceptssuchasdimen-sionalanalysis,subsetrelationships,etc.Inthispaper,wedevelopdeclarativeruleswhichgovernthetranslationofnaturallanguagede-scriptionoftheseconceptstomathexpres-sions.Wethenpresentaframeworkforin-corporatingsuchdeclarativeknowledgeintowordproblemsolving.Ourmethodlearnstomaparithmeticwordproblemtexttomathex-pressions,bylearningtoselecttherelevantdeclarativeknowledgeforeachoperationofthesolutionexpression.Thisprovidesawaytohandlemultipleconceptsinthesameprob-lemwhile,atthesametime,supportingin-terpretabilityoftheanswerexpression.Ourmethodmodelsthemappingtodeclarativeknowledgeasalatentvariable,thusremov-ingtheneedforexpensiveannotations.Exper-imentalevaluationsuggeststhatourdomainknowledgebasedsolveroutperformsallothersystems,andthatitgeneralizesbetterintherealisticcasewherethetrainingdataitisex-posedtoisbiasedinadifferentwaythanthetestdata.1IntroductionManynaturallanguageunderstandingsituationsre-quirereasoningwithrespecttonumbersorquanti-∗MostoftheworkwasdonewhentheauthorswereattheUniversityofIllinois,UrbanaChampaign.ties–understandingﬁnancialnews,sportsresults,orthenumberofcasualtiesinabombing.Mathwordproblemsformanaturalabstractiontoalotofthesequantitativereasoningproblems.Conse-quently,therehasbeenagrowinginterestindevel-opingautomatedmethodstosolvemathwordprob-lems(Kushmanetal.,2014;Hosseinietal.,2014;RoyandRoth,2015).ArithmeticWordProblemMrs.Hiltbakedpieslastweekendforaholidaydin-ner.Shebaked16pecanpiesand14applepies.Ifshewantstoarrangeallofthepiesinrowsof5pieseach,howmanyrowswillshehave?Solution(16+14)/5=6MathConceptneededforEachOperationFigure1:Anexamplearithmeticwordproblemanditssolution,alongwiththeconceptsrequiredtogenerateeachoperationofthesolutionUnderstandingandsolvingmathwordproblemsinvolvesinterpretingthenaturallanguagedescrip-tionofmathematicalconcepts,aswellasunder-standingtheirinteractionwiththephysicalworld.ConsidertheelementaryschoollevelarithmeticwordproblemshowninFig1.Tosolvetheprob-lem,oneneedstounderstandthat“applepies”and“pecanpies”arekindsof“pies”,andhence,the l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 133–144, 2018. Redattore di azioni: Stefan Riezler.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 133–144, 2018. Redattore di azioni: Stefan Riezler. Lotto di invio: 6/2017; Lotto di revisione: 9/2017; Pubblicato 2/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) LearningRepresentationsSpecializedinSpatialKnowledge:LeveragingLanguageandVisionGuillemCollellDepartmentofComputerScienceKULeuven3001Heverlee,Belgiumgcollell@kuleuven.beMarie-FrancineMoensDepartmentofComputerScienceKULeuven3001Heverlee,Belgiumsien.moens@cs.kuleuven.beAbstractSpatialunderstandingiscrucialinmanyreal-worldproblems,yetlittleprogresshasbeenmadetowardsbuildingrepresentationsthatcapturespatialknowledge.Here,wemoveonestepforwardinthisdirectionandlearnsuchrepresentationsbyleveragingataskconsistinginpredictingcontinuous2Dspa-tialarrangementsofobjectsgivenobject-relationship-objectinstances(e.g.,“catunderchair”)andasimpleneuralnetworkmodelthatlearnsthetaskfromannotatedimages.Weshowthatthemodelsucceedsinthistaskand,furthermore,thatitiscapableofpredictingcorrectspatialarrangementsforunseenob-jectsifeitherCNNfeaturesorwordembed-dingsoftheobjectsareprovided.Thediffer-encesbetweenvisualandlinguisticfeaturesarediscussed.Next,toevaluatethespatialrepresentationslearnedintheprevioustask,weintroduceataskandadatasetconsistinginasetofcrowdsourcedhumanratingsofspatialsimilarityforobjectpairs.WeﬁndthatbothCNN(convolutionalneuralnetwork)featuresandwordembeddingspredicthumanjudgmentsofsimilaritywellandthatthesevectorscanbefurtherspecializedinspatialknowledgeifweupdatethemwhentrainingthemodelthatpredictsspatialarrangementsofobjects.Overall,thispaperpavesthewaytowardsbuildingdistributedspatialrepresen-tations,contributingtotheunderstandingofspatialexpressionsinlanguage.1IntroductionRepresentingspatialknowledgeisinstrumentalinanytaskinvolvingtext-to-sceneconversionsuchasrobotunderstandingofnaturallanguagecommands(Guadarramaetal.,2013;MoratzandTenbrink,2006)oranumberofrobotnavigationtasks.Despiterecentadvancesinbuildingspecializedrepresenta-tionsindomainssuchassentimentanalysis(Tangetal.,2014),semanticsimilarity/relatedness(Kielaetal.,2015)ordependencyparsing(Bansaletal.,2014),littleprogresshasbeenmadetowardsbuild-ingdistributedrepresentations(a.k.a.embeddings)specializedinspatialknowledge.Intuitively,onemayreasonablyexpectthatthemoreattributestwoobjectsshare(e.g.,size,func-tionality,eccetera.),themorelikelytheyaretoexhibitsimilarspatialarrangementswithrespecttootherobjects.Leveragingthisintuition,weforeseethatvisualandlinguisticrepresentationscanbespatiallyinformativeaboutunseenobjectsastheyencodefeatures/attributesofobjects(CollellandMoens,2016).Forinstance,withouthavingeverseenan“elephant”before,butonlya“horse”,onewouldprobablydevisethe“elephant”carryingthe“hu-man”thanotherwise,justbyconsideringtheirsizeattribute.Similarly,onecaninferthata“tablet”anda“book”willshowsimilarspatialpatterns(usuallyonatable,insomeone’shands,eccetera.)althoughtheybarelyshowanyvisualresemblance—yettheyaresimilarinsizeandfunctionality.Inthispaperwesystematicallystudyhowinformativevisualandlin-guisticfeatures—intheformofconvolutionalneuralnetwork(CNN)featuresandwordembeddings—areaboutthespatialbehaviorofobjects.Animportantgoalofthisworkistolearndis-tributedrepresentationsspecializedinspatialknowl-edge.Asavehicletolearnspatialrepresentations, l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 121–132, 2018. Redattore di azioni: Ani Nenkova.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 121–132, 2018. Redattore di azioni: Ani Nenkova. Lotto di invio: 11/2016; Lotto di revisione: 3/2017; Pubblicato 2/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) ConversationModelingonRedditUsingaGraph-StructuredLSTMVictoriaZayatsElectricalEngineeringDepartmentUniversityofWashingtonvzayats@uw.eduMariOstendorfElectricalEngineeringDepartmentUniversityofWashingtonostendor@uw.eduAbstractThispaperpresentsanovelapproachformod-elingthreadeddiscussionsonsocialmediausingagraph-structuredbidirectionalLSTM(long-shorttermmemory)whichrepresentsbothhierarchicalandtemporalconversationstructure.InexperimentswithataskofpredictingpopularityofcommentsinRedditdiscussions,theproposedmodeloutperformsanode-independentarchitecturefordifferentsetsofinputfeatures.Analysesshowabene-ﬁttothemodeloverthefullcourseofthedis-cussion,improvingdetectioninbothearlyandlatestages.Further,theuseoflanguagecueswiththebidirectionaltreestateupdateshelpswithidentifyingcontroversialcomments.1IntroductionSocialmediaprovidesaconvenientandwidelyusedplatformfordiscussionsamongusers.Whenthecomment-responselinksarepreserved,thosecon-versationscanberepresentedinatreestructurewherecommentsrepresentnodes,therootistheoriginalpost,andeachnewreplytoapreviouscom-mentisaddedasachildofthatcomment.Someexamplesofpopularserviceswithtree-likestruc-turesincludeFacebook,Reddit,Quora,andStack-Exchange.Figure1showsanexampleconversa-tiononReddit,wherebiggernodesindicatehigherupvotingofacomment.1InserviceslikeTwitter,1Thetoolhttps://whichlight.github.io/reddit-network-viswasusedtoobtainthisvisualiza-tion.Figure1:VisualizationofasamplethreadonReddit.tweetsandtheirretweetscanalsobeviewedasform-ingatreestructure.Whentimestampsareavail-ablewithacontribution,thenodesofthetreecanbeorderedandannotatedwiththatinformation.Thetreestructureisusefulforseeinghowadiscussionunfoldsintodifferentsubtopicsandshowingdiffer-encesinthelevelofactivityindifferentbranchesofthediscussion.Predictingpopularityofcommentsinsocialme-diaisataskofgrowinginterest.Popularityhasbeendeﬁnedintermsofthevolumeofthere-sponse,butwhenthesocialmediaplatformhasamechanismforreaderstolikeordislikecom-ments(O,upvote/downvote),thenthedifferenceinpositive/negativevotesprovidesamoreinformativescoreforpopularityprediction.Thisdeﬁnitionof l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 107–119, 2018. Redattore di azioni: Ivan Titov.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 107–119, 2018. Redattore di azioni: Ivan Titov. Lotto di invio: 6/2017; Lotto di revisione: 9/2017; Pubblicato 2/2018. C(cid:13)2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. EvaluatingtheStabilityofEmbedding-basedWordSimilaritiesMariaAntoniakCornellUniversitymaa343@cornell.eduDavidMimnoCornellUniversitymimno@cornell.eduAbstractWordembeddingsareincreasinglybeingusedasatooltostudywordassociationsinspeciﬁccorpora.However,itisunclearwhethersuchembeddingsreﬂectenduringpropertiesoflan-guageoriftheyaresensitivetoinconsequentialvariationsinthesourcedocuments.Weﬁndthatnearest-neighbordistancesarehighlysen-sitivetosmallchangesinthetrainingcorpusforavarietyofalgorithms.Forallmethods,includingspeciﬁcdocumentsinthetrainingsetcanresultinsubstantialvariations.Weshowthattheseeffectsaremoreprominentforsmallertrainingcorpora.Werecommendthatusersneverrelyonsingleembeddingmodelsfordistancecalculations,butratheraverageovermultiplebootstrapsamples,especiallyforsmallcorpora.1IntroductionWordembeddingsareapopulartechniqueinnaturallanguageprocessing(PNL)inwhichthewordsinavocabularyaremappedtolow-dimensionalvectors.Embeddingmodelsareeasilytrained—severalimple-mentationsarepubliclyavailable—andrelationshipsbetweentheembeddingvectors,oftenmeasuredviacosinesimilarity,canbeusedtoreveallatentseman-ticrelationshipsbetweenpairsofwords.Wordem-beddingsareincreasinglybeingusedbyresearchersinunexpectedwaysandhavebecomepopularinﬁeldssuchasdigitalhumanitiesandcomputationalsocialscience(Hamiltonetal.,2016;Heuser,2016;Phillipsetal.,2017).Embedding-basedanalysesofsemanticsimilaritycanbearobustandvaluabletool,butweﬁndthatstandardmethodsdramaticallyunder-representthevariabilityofthesemeasurements.Embeddingalgo-rithmsaremuchmoresensitivethantheyappeartofactorssuchasthepresenceofspeciﬁcdocuments,thesizeofthedocuments,thesizeofthecorpus,andevenseedsforrandomnumbergenerators.Ifusersdonotaccountforthisvariability,theirconclusionsarelikelytobeinvalid.Fortunately,wealsoﬁndthatsimplyaveragingovermultiplebootstrapsamplesissufﬁcienttoproducestable,reliableresultsinallcasestested.NLPresearchinwordembeddingshassofarfo-cusedonadownstream-centeredusecase,wheretheendgoalisnottheembeddingsthemselvesbutperformanceonamorecomplicatedtask.Forexam-ple,wordembeddingsareoftenusedasthebottomlayerinneuralnetworkarchitecturesforNLP(Ben-gioetal.,2003;Goldberg,2017).Theembeddings’trainingcorpus,whichisselectedtobeaslargeaspossible,isonlyofinterestinsofarasitgeneralizestothedownstreamtrainingcorpus.Incontrast,otherresearcherstakeacorpus-centeredapproachanduserelationshipsbetweenem-beddingsasdirectevidenceaboutthelanguageandcultureoftheauthorsofatrainingcorpus(Bolukbasietal.,2016;Hamiltonetal.,2016;Heuser,2016).Embeddingsareusedasiftheyweresimulationsofasurveyaskingsubjectstofree-associatewordsfromqueryterms.Unlikethedownstream-centeredapproach,thecorpus-centeredapproachisbasedondirecthumananalysisofnearestneighborstoembed-dingvectors,andthetrainingcorpusisnotsimplyanoff-the-shelfconveniencebutratherthecentralobjectofstudy. l D o w n o a d e d f r o m h t t p : / / d i r e c t . M

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 91–106, 2018. Redattore di azioni: Alexander Clark.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 91–106, 2018. Redattore di azioni: Alexander Clark. Lotto di invio: 7/2017; Lotto di revisione: 10/2017; Pubblicato 2/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) TowardsEvaluatingNarrativeQualityInStudentWritingSwapnaSomasundaran1,MichaelFlor1,MartinChodorow2HillaryMolloy3BinodGyawali1LauraMcCulla11EducationalTestingService,660RosedaleRoad,Princeton,NJ08541,USA2HunterCollegeandtheGraduateCenter,CUNY,NewYork,NY10065,USA3EducationalTestingService,90NewMontgomeryStreet,SanFrancisco,CA94105,USA{ssomasundaran,mﬂor,hmolloy,bgyawali,LMcCulla}@ets.orgmartin.chodorow@hunter.cuny.eduAbstractThisworklaysthefoundationforautomatedassessmentsofnarrativequalityinstudentwriting.Weﬁrstmanuallyscoreessaysfornarrative-relevanttraitsandsub-traits,andmeasureinter-annotatoragreement.Wethenexplorelinguisticfeaturesthatareindicativeofgoodnarrativewritingandusethemtobuildanautomatedscoringsystem.Experimentsshowthatourfeaturesaremoreeffectiveinscoringspeciﬁcaspectsofnarrativequalitythanastate-of-the-artfeatureset.1IntroductionNarrative,whichincludespersonalexperiencesandstories,realorimagined,isamediumofexpressionthatisusedfromtheveryearlystagesofachild’slife.Narrativesarealsoemployedinvariouscapac-itiesinschoolinstructionandassessment.Forex-ample,theCommonCoreStateStandards,aned-ucationalinitiativeintheUnitedStatesthatdetailsrequirementsforstudentknowledgeingradesK-12,employsliterature/narrativesasoneofitsthreelanguageartsgenres.Withtheincreasedfocusonautomatedevaluationofstudentwritingineduca-tionalsettings(Adams,2014),automatedmethodsforevaluatingnarrativeessaysatscalearebecomingincreasinglyimportant.Automatedscoringofnarrativeessaysisachal-lengingarea,andonethathasnotbeenexploredex-tensivelyinNLPresearch.Previousworkonauto-matedessayscoringhasfocusedoninformational,argumentative,persuasiveandsource-basedwritingconstructs(StabandGurevych,2017;NguyenandLitman,2016;Farraetal.,2015;Somasundaranetal.,2014;BeigmanKlebanovetal.,2014;ShermisandBurstein,2013).Allo stesso modo,operationalessayscoringengines(AttaliandBurstein,2006;Elliot,2003)aregearedtowardsevaluatinglanguageproﬁ-ciencyingeneral.Inthiswork,welaytheground-workandpresenttheﬁrstresultsforautomatedscor-ingofnarrativeessays,focusingonnarrativequality.Oneofthechallengesinnarrativequalityanal-ysisisthescarcityofscoredessaysinthisgenre.Wedescribeadetailedmanualannotationstudyonscoringstudentessaysalongmultipledimensionsofnarrativequality,suchasnarrativedevelopmentandnarrativeorganization.UsingascoringrubricadaptedfromtheU.S.CommonCoreStateStan-dards,weannotated942essayswrittenfor18differ-entessay-promptsbystudentsfromthreedifferentgradelevels.Thisdatasetprovidesavarietyofstorytypesandlanguageproﬁciencylevels.Wemeasuredinter-annotatoragreementtounderstandreliabilityofscoringstoriesfortraits(e.g.,development)aswellassub-traits(e.g.,plotdevelopmentandtheuseofnarrativetechniques).Anumberoftechniquesforwritinggoodstoriesaretargetedbythescoringrubrics.Weimplementedasystemforautomaticallyscoringdifferenttraitsofnarratives,usinglinguisticfeaturesthatcapturesomeofthosetechniques.Weinvestigatedtheeffec-tivenessofeachfeatureforscoringnarrativetraitsandanalyzedtheresultstoidentifysourcesoferrors.Themaincontributionsofthisworkareasfol-lows:(1)Tothebestofourknowledge,thisistheﬁrstdetailedannotationstudyonscoringnarra-tiveessaysfordifferentaspectsofnarrativequality. l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 77–89, 2018. Redattore di azioni: Patrick Pantel.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 77–89, 2018. Redattore di azioni: Patrick Pantel. Lotto di invio: 6/2017; Lotto di revisione: 10/2017; Pubblicato 2/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) EventTimeExtractionwithaDecisionTreeofNeuralClassiﬁersNilsReimers†,NazaninDehghani‡∗,IrynaGurevych††UbiquitousKnowledgeProcessingLab(UKP)andResearchTrainingGroupAIPHESDepartmentofComputerScience,TechnischeUniversit¨atDarmstadt‡SchoolofElectricalandComputerEngineering,UniversityofTehranwww.ukp.tu-darmstadt.deAbstractExtractingtheinformationfromtextwhenaneventhappenedischallenging.Documentsdonotonlyreportoncurrentevents,butalsoonpasteventsaswellasonfutureevents.Often,therelevanttimeinformationforaneventisscatteredacrossthedocument.Inthispaperwepresentanovelmethodtoauto-maticallyanchoreventsintime.Toourknowl-edgeitistheﬁrstapproachthattakestempo-ralinformationfromthecompletedocumentintoaccount.Wecreatedadecisiontreethatappliesneuralnetworkbasedclassiﬁersatitsnodes.Weusethistreetoincrementallyinfer,inastepwisemanner,atwhichtimeframeaneventhappened.WeevaluatetheapproachontheTimeBank-EventTimeCorpus(Reimersetal.,2016)achievinganaccuracyof42.0%com-paredtoaninter-annotatoragreement(IAA)of56.7%.Foreventsthatspanoverasingledayweobserveanaccuracyimprovementof33.1pointscomparedtothestate-of-the-artCAEVOsystem(Chambersetal.,2014).Withoutre-training,weapplythismodeltotheSemEval-2015Task4onautomatictimelinegenerationandachieveanimprovementof4.01pointsF1-scorecomparedtothestate-of-the-art.Ourcodeispublicallyavailable.11IntroductionKnowingwhenaneventhappenedisusefulforalotofusecases.Examplesareintheﬁeldsoftime-awareinformationretrieval,textsummarization,automatedtimelinegeneration,andautomaticknowledgebasepopulation.Manyfactsinaknowledgebaseare∗Duringauthor’sinternshipintheresearchtraininggroupAIPHESatUKPLab,TUDarmstadt.1https://github.com/ukplab/tacl2017-event-time-extractiononlytrueforacertaintimeperiod,forexamplethepresidencyofaperson.Hence,thepopulationofaknowledgebasecanhighlybeneﬁtfromhighqualityeventandeventtime2extraction(Surdeanu,2013).Inherenttoeventsistheconnectiontotime.Allan(2002)deﬁnesaneventas“somethingthathappensatsomespeciﬁctimeandplace”.Thechallengesforautomaticeventtimeextractionaremanifold.Thetemporalinformationinnewsarticleswhichstateswhenaneventhappenedis,inmostcases,notinthesameorinneighboringsentenceswiththeevent(Reimersetal.,2016).Itcanbementionedfarbeforetheeventorfaraftertheevent.Evenworse,formorethan60%ofevents,thespeciﬁcdayatwhichtheeventhappenedisnotmentioned.However,fromtheworldknowledgeandcausalrelations,thereadercaninferalotoftemporalinformationaboutthoseeventsandcanofteninferthattheeventhappenedbeforeoraftersomespeciﬁcpointintime.Inthispaperwedescribeanewclassiﬁerforauto-maticeventtimeextraction.WeusetheTimeBank-EventTimeCorpus(Reimersetal.,2016)totrainandevaluateourproposedarchitecture.Incontrasttoothercorporaontemporalrelations,theannota-tionoftheTimeBank-EventTimeCorpusdoesnotmakerestrictionswhere,andinwhichform,tempo-ralinformationforaneventmustbeprovided.Theannotatorswereallowedtotakethewholedocumentintoaccountandwereaskedtoanswer,tothebestoftheirability,thequestionatwhichdateortimeperiodtheeventhappened.Theeventtimeannotationforsomesampleeventsisshowninthefollowing:•Hewas[sent]1980-05-26intospaceonMay26,2Wewillrefertothetemporalinformationwhenaneventhappenedaseventtime. l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 33–48, 2018. Redattore di azioni: Regina Barzilay.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 33–48, 2018. Redattore di azioni: Regina Barzilay. Lotto di invio: 5/2016; Lotto di revisione: 10/2016; Pubblicato 1/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) JointSemanticSynthesisandMorphologicalAnalysisoftheDerivedWordRyanCotterellDepartmentofComputerScienceJohnsHopkinsUniversityryan.cotterell@jhu.eduHinrichSch¨utzeCISLMUMunichinquiries@cislmu.orgAbstractMuchlikesentencesarecomposedofwords,wordsthemselvesarecomposedofsmallerunits.Forexample,theEnglishwordquestionablycanbeanalyzedasquestion+able+ly.However,thisstructuraldecompositionoftheworddoesnotdirectlygiveusasemanticrepresentationoftheword’smeaning.Sincemorphologyobeystheprincipleofcompositionality,thesemanticsofthewordcanbesystematicallyderivedfromthemeaningofitsparts.Inthiswork,weproposeanovelprobabilisticmodelofwordformationthatcapturesboththeanalysisofawordwintoitsconstituentsegmentsandthesynthesisofthemeaningofwfromthemean-ingsofthosesegments.Ourmodeljointlylearnstosegmentwordsintomorphemesandcomposedistributionalsemanticvectorsofthosemorphemes.WeexperimentwiththemodelonEnglishCELEXdataandGermanDErivBase(Zelleretal.,2013)data.WeshowthatjointlymodelingsemanticsincreasesbothsegmentationaccuracyandmorphemeF1bybetween3%and5%.Additionally,weinvestigatedifferentmodelsofvectorcompo-sition,showingthatrecurrentneuralnetworksyieldanimprovementoversimpleadditivemodels.Finally,westudythedegreetowhichtherepresentationscorrespondtoalinguist’snotionofmorphologicalproductivity.1IntroductionInmostlanguages,wordsdecomposefurtherintosmallerunits,termedmorphemes.Forexample,theEnglishwordquestionablycanbeanalyzedasquestion+able+ly.Thisstructuraldecompositionoftheword,Tuttavia,byitselfisnotasemanticrep-resentationoftheword’smeaning;1wefurtherre-quireanaccountofhowtosynthesizethemeaningfromthedecomposition.Fortunately,words—justlikephrases—toalargeextentobeytheprincipleofcompositionality:thesemanticsofthewordcanbesystematicallyderivedfromthemeaningofitsparts.2Inthiswork,weproposeanoveljointprob-abilisticmodelofwordformationthatcapturesbothstructuraldecompositionofawordwintoitscon-stituentsegmentsandthesynthesisofw’smeaningfromthemeaningofthosesegments.Morphologicalsegmentationisastructuredpre-dictiontaskthatseekstobreakawordupintoitsconstituentmorphemes.Theoutputsegmentationhasbeenshowntoaidadiversesetofapplications,suchasautomaticspeechrecognition(Aﬁfyetal.,2006),keywordspotting(Narasimhanetal.,2014),machinetranslation(CliftonandSarkar,2011)andparsing(SeekerandC¸etino˘glu,2015).Incontrasttomuchofthispriorwork,wefocusonsupervisedsegmentation,i.e.,weprovidethemodelwithgoldsegmentationsduringtrainingtime.Insteadofsur-1Therearemanydifferentlinguisticandcomputationaltheo-riesforinterpretingthestructuraldecompositionofaword.Forexample,un-oftensigniﬁesnegationanditseffectonsemanticscanthenbemodeledbytheoriesbasedonlogic.Thisworkad-dressesthequestionofstructuraldecompositionandsemanticsynthesisinthegeneralframeworkofdistributionalsemantics.2Morphologicalresearchintheoreticalandcomputationallinguisticsoftenfocusesonnoncompositionalorlesscom-positionalphenomena—simplybecausecompositionalderiva-tionposesfewerinterestingresearchproblems.Itisalsotruethat—justasmanyfrequentmultiwordunitsarenotcompletelycompositional—manyfrequentderivations(e.g.,refusal,ﬁt-ness)arenotcompletelycompositional.Anindicationthatnon-lexicalizedderivationsareusuallycompositionalisthefactthatstandarddictionarieslikeOUPeditors(2010)listderivationalafﬁxeswiththeircompositionalmeaning,withoutahedgethattheycanalsooccuraspartofonlypartiallycompositionalforms.SeealsoHaspelmathandSims(2013),§5.3.6. l D o w n o a d e d f r o m h t t p : / / d i r e c t

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 17–31, 2018. Redattore di azioni: Ani Nenkova.

Operazioni dell'Associazione per la Linguistica Computazionale, vol. 6, pag. 17–31, 2018. Redattore di azioni: Ani Nenkova. Lotto di invio: 7/17; Lotto di revisione: 11/2017; Pubblicato 1/2018. 2018 Associazione per la Linguistica Computazionale. Distribuito sotto CC-BY 4.0 licenza. C (cid:13) MultipleInstanceLearningNetworksforFine-GrainedSentimentAnalysisStefanosAngelidisandMirellaLapataInstituteforLanguage,CognitionandComputationSchoolofInformatics,UniversityofEdinburgh10CrichtonStreet,EdinburghEH89ABs.angelidis@ed.ac.uk,mlap@inf.ed.ac.ukAbstractWeconsiderthetaskofﬁne-grainedsenti-mentanalysisfromtheperspectiveofmulti-pleinstancelearning(MIL).Ourneuralmodelistrainedondocumentsentimentlabels,andlearnstopredictthesentimentoftextseg-ments,i.e.sentencesorelementarydiscourseunits(EDUs),withoutsegment-levelsupervi-sion.Weintroduceanattention-basedpolar-ityscoringmethodforidentifyingpositiveandnegativetextsnippetsandanewdatasetwhichwecallSPOT(asshorthandforSegment-levelPOlariTyannotations)forevaluatingMIL-stylesentimentmodelslikeours.Experimen-talresultsdemonstratesuperiorperformanceagainstmultiplebaselines,whereasajudge-mentelicitationstudyshowsthatEDU-levelopinionextractionproducesmoreinformativesummariesthansentence-basedalternatives.1IntroductionSentimentanalysishasbecomeafundamentalareaofresearchinNaturalLanguageProcessingthankstotheproliferationofuser-generatedcontentintheformofonlinereviews,blogs,internetforums,andsocialmedia.Aplethoraofmethodshavebeenpro-posedintheliteraturethatattempttodistillsenti-mentinformationfromtext,allowingusersandser-viceproviderstomakeopinion-drivendecisions.Thesuccessofneuralnetworksinavarietyofap-plications(Bahdanauetal.,2015;LeandMikolov,2014;Socheretal.,2013)andtheavailabilityoflargeamountsoflabeleddatahaveledtoanin-creasedfocusonsentimentclassiﬁcation.Super-visedmodelsaretypicallytrainedondocuments(JohnsonandZhang,2015UN;JohnsonandZhang,2015B;Tangetal.,2015;Yangetal.,2016),sen-tences(Kim,2014),orphrases(Socheretal.,2011;[Rating:??]IhadaverymixedexperienceatTheStand.Theburgerandfriesweregood.Thechocolateshakewasdivine:richandcreamy.Thedrive-thruwashorrible.Ittookusatleast30minutestoorderwhentherewereonlyfourcarsinfrontofus.Wecomplainedaboutthewaitandgotahalf–heartedapology.Iwouldgobackbecausethefoodisgood,butmyonlyhesitationisthewait.Summary+Theburgerandfriesweregood+Thechocolateshakewasdivine+Iwouldgobackbecausethefoodisgood–Thedrive-thruwashorrible–Ittookusatleast30minutestoorderFigure1:AnEDU-basedsummaryofa2-out-of-5starreviewwithpositiveandnegativesnippets.Socheretal.,2013)annotatedwithsentimentla-belsandusedtopredictsentimentinunseentexts.Coarse-graineddocument-levelannotationsarerel-ativelyeasytoobtainduetothewidespreaduseofopiniongradinginterfaces(e.g.,starratingsac-companyingreviews).Incontrast,theacquisitionofsentence-orphrase-levelsentimentlabelsre-mainsalaboriousandexpensiveendeavordespiteitsrelevancetovariousopinionminingapplica-tions,e.g.,detectingorsummarizingconsumeropin-ionsinonlineproductreviews.Theusefulnessofﬁner-grainedsentimentanalysisisillustratedintheexampleofFigure1,wheresnippetsofopposingpo-laritiesareextractedfroma2-starrestaurantreview.Although,asawhole,thereviewconveysnegativesentiment,aspectsofthereviewer’sexperiencewereclearlypositive.Thisgoeslargelyunnoticedwhenfocusingsolelyonthereview’soverallrating.Inthiswork,weconsidertheproblemofsegment-levelsentimentanalysisfromtheperspectiveofMultipleInstanceLearning(MIL;Keeler,1991). l D o w n o a d e d f r o m h t t p : / / d i r e c t

CORRIGENDUM: MEASURING UNCERTAINTY

CORRIGENDUM: MEASURING UNCERTAINTY AND ITS IMPACT ON THE ECONOMY Andrea Carriero, Todd E. Clark, and Massimiliano Marcellino* Original article: Carriero, Andrea, Todd E. Clark, and Massimiliano Marcellino, “Measuring Uncertainty and Its Impact on the Economy,” this REVIEW 100:5 (2018), 799–815. 10.1162/rest_a_00693 Abstract—Carriero, Clark, and Marcellino (2018, CCM2018) used a large BVAR model with a factor structure to stochastic volatility to produce an estimate of time-varying

UNRAVELING AMBIGUITY AVERSION∗

UNRAVELING AMBIGUITY AVERSION∗ Ilke Aydogan† Lo¨ıc Berger‡ Valentina Bosetti§ Abstract We report the results of two experiments designed to better understand the mechanisms driving decision-making under ambiguity. We elicit individual prefer- ences over diﬀerent sources of uncertainty, entailing diﬀerent degrees of complexity, from subjects with diﬀerent sophistication levels. We show that (1) ambiguity aversion is robust to sophistication, but the strong relationship previously reported between

MEASURING “GROUP COHESION”

MEASURING “GROUP COHESION” TO REVEAL THE POWER OF SOCIAL RELATIONSHIPS IN TEAM PRODUCTION SIMON GÄCHTER, CHRIS STARMER AND FABIO TUFANO* University of Nottingham and University of Leicester** 30 novembre 2022 We introduce “group cohesion” to study the economic relevance of social relationships in team production. We operationalize measurement of group cohesion, adapting the “oneness scale” from psychology. A series of experiments, including a pre-registered replication,

Alcohol, violence and injury-induced mortality:

Alcohol, violence and injury-induced mortality: Evidence from a modern-day prohibition* Kai Barron1, Charles D.H. Parry2,4, Debbie Bradshaw2,3, Rob Dorrington3, Pam Groenewald2, Ria Laubscher2, and Richard Matzopoulos2,3 1WZB Berlin 2South African Medical Research Council 3University of Cape Town 4Stellenbosch University Abstract This paper evaluates the impact of a sudden and unexpected nation-wide alcohol sales ban in South Africa. We find that this policy causally reduced injury-induced

Assortative Matching of Exporters and Importers*

Assortative Matching of Exporters and Importers* Yoichi Sugita† Kensuke Teshima‡ Enrique Seira§ July 2021 Abstract This paper studies how exporting and importing ﬁrms match based on their ca- pability by investigating the change in such exporter–importer matching during trade liberalization. During the recent liberalization on the Mexico-US textile/apparel trade, exporters and importers often switch their main partners as well as change trade vol- umes. Noi

Impulse Purchases, Gun Ownership, and Homicides: Evidence from a Firearm Demand

Impulse Purchases, Gun Ownership, and Homicides: Evidence from a Firearm Demand Shock1 Christoph Koenig2 David Schindler3 July 23, 2021 Astratto: Do ﬁrearm purchase delay laws reduce aggregate homicide levels? Using variation from a 6-month countrywide gun demand shock in 2012/2013, we show that U.S. states with legislation preventing immediate handgun purchases experienced smaller increases in handgun sales. Our ﬁndings indicate that this is likely driven

THE DYNAMIC EFFECTS OF TAX AUDITS

THE DYNAMIC EFFECTS OF TAX AUDITS Arun Advani, William Elming, and Jonathan Shaw* Abstract—We study the effects of audits on long run compliance behavior using a random audit program covering more than 53,000 tax returns. We ﬁnd that audits raise reported tax liabilities for ﬁve years after audit, effects are longer-lasting for more stable sources of income, and only individuals found to have made errors

The Review of Economics and Statistics

The Review of Economics and Statistics VOL. CV JULY 2023 NUMBER 4 LONG-TERM CARE HOSPITALS: A CASE STUDY IN WASTE Liran Einav, Amy Finkelstein, and Neale Mahoney* Abstract—There is substantial waste in U.S. healthcare but little consensus on how to combat it. We identify one source of waste: long-term care hos- pitals (LTCHs). Using the entry of LTCHs into hospital markets in an event study

DISCRIMINATION, NARRATIVES, AND FAMILY HISTORY: AN EXPERIMENT

DISCRIMINATION, NARRATIVES, AND FAMILY HISTORY: AN EXPERIMENT WITH JORDANIAN HOST AND SYRIAN REFUGEE CHILDREN Kai Barron, Heike Harmgart, Steffen Huck, Sebastian O. Schneider, and Matthias Sutter* Abstract—We measure the prevalence of discrimination between Jordanian host and Syrian refugee children attending school in Jordan. Using a simple sharing experiment, we ﬁnd only a small degree of out-group discrimination. Tuttavia, Jordanian children with Palestinian roots do not