dc.contributor.author | Sainz Jiménez, Oscar | |
dc.contributor.author | López de Lacalle Lecuona, Oier | |
dc.contributor.author | Labaka Intxauspe, Gorka | |
dc.contributor.author | Barrena Madinabeitia, Ander | |
dc.contributor.author | Agirre Bengoa, Eneko | |
dc.date.accessioned | 2024-10-15T17:49:30Z | |
dc.date.available | 2024-10-15T17:49:30Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing :1199-1212 (2021) | es_ES |
dc.identifier.uri | http://hdl.handle.net/10810/69965 | |
dc.description.abstract | Relation extraction systems require large amounts of labeled examples which are costly to annotate. In this work we reformulate relation extraction as an entailment task, with simple, hand-made, verbalizations of relations produced in less than 15 min per relation. The system relies on a pretrained textual entailment engine which is run as-is (no training examples, zero-shot) or further fine-tuned on labeled examples (few-shot or fully trained). In our experiments on TACRED we attain 63% F1 zero-shot, 69% with 16 examples per relation (17% points better than the best supervised system on the same conditions), and only 4 points short to the state-of-the-art (which uses 20 times more training data). We also show that the performance can be improved significantly with larger entailment models, up to 12 points in zero-shot, allowing to report the best results to date on TACRED when fully trained. The analysis shows that our few-shot systems are specially effective when discriminating between relations, and that the performance difference in low data regimes comes mainly from identifying no-relation cases. | es_ES |
dc.description.sponsorship | Oscar Sainz is funded by a PhD grant from the Basque Government (PRE_2020_1_0246). This work is based upon work partially supported via the IARPA BETTER Program contract No. 2019-19051600006 (ODNI, IARPA), and by the Basque Government (IXA excellence research group IT1343-19). | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | ACL | es_ES |
dc.rights | info:eu-repo/semantics/openAccess | es_ES |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
dc.title | Label Verbalization and Entailment for Effective Zero and Few-Shot Relation Extraction | es_ES |
dc.type | info:eu-repo/semantics/conferenceObject | es_ES |
dc.rights.holder | (c)2021 The Association for Computational Linguistics, licensed on a Creative Commons Attribution 4.0 International License. | es_ES |
dc.relation.publisherversion | https://doi.org/10.18653/v1/2021.emnlp-main.92 | es_ES |
dc.identifier.doi | 10.18653/v1/2021.emnlp-main.92 | |
dc.departamentoes | Lenguajes y sistemas informáticos | es_ES |
dc.departamentoeu | Hizkuntza eta sistema informatikoak | es_ES |