Miranda Martija, Imanol (2024-07-09)
[EN] The emergence of Transformer architectures, pretrained models and multimodal data problems have generated new challenges to solve. One of the most popular in recent years is the visual-linguistic task Visual Question ...