TRALIMA/ITZULIK-Kongresuak;;TRALIMA/ITZULIK-Congresoshttp://hdl.handle.net/10810/357462024-03-29T07:03:37Z2024-03-29T07:03:37ZStatistics and visualisations of theatre corpora using corpus analysis softwareSanjurjo-González, HugoAndaluz Pinedo, Olaiahttp://hdl.handle.net/10810/584532022-11-22T19:24:25Z2021-01-01T00:00:00ZStatistics and visualisations of theatre corpora using corpus analysis software
Sanjurjo-González, Hugo; Andaluz Pinedo, Olaia
Corpus linguistics is a powerful quantitative methodology that relies on frequency data and statistical procedures (Han 2019). According to Gries (2013), scientific quantitative research has three main goals: description, explanation and prediction of data. Within this frame, statistics makes sense of quantitative data by means of analysis and useful visualisations (Brezina 2018).
There are many techniques that have been designed for monolingual corpora such as statistical identification of collocations or keywords. While most of these can also be applied to different types of corpora, such as parallel and comparable ones, it seems that a dedicated set of statistics related to structural singularities of text types such as theatre plays is missing.
In this study, we propose a range of different adaptations of statistics and visualisations that apply and interrelate theatre-specific filters. Dramatic texts division in structural units is a specific feature of this genre (Andaluz-Pinedo and Sanjurjo-González in press). Utterances, speakers, stage directions and dialogues are an intrinsic part of these texts that must be taken into account when developing useful and descriptive statistical procedures. It is thus necessary to offer statistics and visualizations that apply and interrelate theatre-specific filters. Some examples of this adaptation may be quantitative analysis based on the units of characters, utterances, stage directions and dialogues instead of using all the texts data as a whole.
As Anthony states (2013), “the functionality offered by software tools largely dictates what corpus linguistics research methods are available to a researcher”. In order to improve this functionality when theatre corpora are analysed, further work includes the integration of this approach into an existing corpus analysis software that processes theatre play-texts such as ACTRES Corpus Manager (Sanjurjo-González, 2017).
38º Congreso Internacional AESLA (2021)
2021-01-01T00:00:00ZEl hombre de La Mancha: de Broadway a MadridMerino Álvarez, Raquelhttp://hdl.handle.net/10810/517132022-11-22T19:13:14Z2021-02-25T00:00:00ZEl hombre de La Mancha: de Broadway a Madrid
Merino Álvarez, Raquel
[EN] Written by Dale Wassermen, Man of La Mancha is a musical play based on the work of Cervantes, which premiered in New York in 1965 and later in Madrid in 1966, with playwright José López Rubio’s Spanish translation. With a cast list featuring Nati Mistral in one of the main roles, Madrid’s production traveled to Argentina and Mexico, following the typical transatlantic circuits of theater culture in Spanish. It thus began a journey that continues to this day and that will be examined throughout this session.; [ES] Escrito por Dale Wasserman, Man of La Mancha es un drama musical basado en la obra de Cervantes que se estrenó en Nueva York en 1965 y en Madrid en 1966, traducido al español por el dramaturgo José López Rubio. Esta producción madrileña, con un elenco encabezado por Nati Mistral, viajó a Argentina y México, siguiendo los circuitos transatlánticos por los que discurre habitualmente la cultura teatral en lengua española, iniciando así un periplo que llega hasta nuestros días y que será examinado a lo largo de esta sesión.
Conferencia invitada impartida por Raquel Merino Álvarez en el Observatorio de la Lengua Española y las Culturas Hispánicas en los Estados Unidos, INSTITUTO CERVANTES AT HARVARD, el 25 de febrero, 2021 (15:00 - 16:30 Boston | 21:00 – 22:30 Madrid) En línea por Zoom. Disponible en línea en el canal de Youtube (Observatorio. Instituto Cervantes at Harvard).
2021-02-25T00:00:00ZBideo-jokoen lokalizazioa azkertuzZorrakin Goikoetxea, Itziarhttp://hdl.handle.net/10810/490242022-11-25T19:04:10Z2020-11-28T00:00:00ZBideo-jokoen lokalizazioa azkertuz
Zorrakin Goikoetxea, Itziar
[ES] La localización es la preparación por la que pasa un videojuego para poder comercializarse en un mercado distinto al mercado en el que se publicó originalmente. Aunque la traducción sea la parte más importante de la localización, no es la única. Tanto los nombres como el aspecto de los personajes pueden modificarse. Empezando por el clásico Pac-Man y pasando por juegos como Final Fantasy, Spyro o Jak and Daxter, veremos algunos de los ejemplos más característicos de la localización de videojuegos. Analizaremos los niveles de localización (caja y documentación, localización parcial y localización total) y la relación de la traducción respecto al original (domesticación frente a extranjerización). Conoceremos la experiencia que han tenido algunos desarrolladores con la localización de videojuegos y, para terminar, analizaremos la situación y tendencias actuales.; [EN] The changes a videogame must undergo to be commercialized in a new market are known as localization. Although the translation is the most important part of the localization process, it is not the only one. The names of the characters as well as their physical appearance may be modified. Starting from the classic Pac-Man and continuing with Final Fantasy, Spyro or Jak and Daxter, we will see some of the most characteristic examples of video game localization. We will analyse the different localization levels (box and docs, partial localization and total localization) and the relationship between the translation and the original text (domestication vs foreignization). We will learn about the experience some developers have had with video game localization and, finally, we will analyse the current situation and tendencies.
Video conferencia impartida en "Jokoteknia III: Bideojoko Jardunaldia 2020" el 28 de noviembre del 2020 en Azkue Fundazioa, Bilbao. -- Con subtítulos en español e inglés.
2020-11-28T00:00:00ZOpenTagger: A flexible and user-friendly linguistic taggerSanjurjo-González, HugoAndaluz Pinedo, Olaiahttp://hdl.handle.net/10810/486832022-11-22T19:03:11Z2020-01-01T00:00:00ZOpenTagger: A flexible and user-friendly linguistic tagger
Sanjurjo-González, Hugo; Andaluz Pinedo, Olaia
Linguistic annotation adds valuable information to a corpus. Annotated corpora are highly useful for linguists since they increase the range of linguistic phenomena that may be registered, categorised and retrieved. In addition, they are also significant for machines, as Natural Language Processing applications involve working with well-annotated data (e.g. Imran, Mitra and Castillo 2016) and some machine learning classifiers employ annotated data to test or train new language annotation tools, among other uses. In this regard, Pustejovsky and Stubbs (2012) report on stages for building annotated corpora to train machine learning algorithms.
This paper describes OpenTagger, a new linguistic tagger that allows users to include any type of information to the different paragraphs, sentences, or words that compose a text. OpenTagger is characterised by its high usability and flexibility. It is a web application that allows users to manually annotate texts using their own predefined tag set or creating a new one. Thus, it offers an answer to any need for a tailor-made annotation system. This tagset may include nested categories. In addition, multiple layers of annotation are possible. The annotation process is very easy and provides two options: i) Selecting text and tagging; ii) Selecting a tag and annotating as much text as precissed. OpenTagger also includes a search box to query the text and retrieve relevant sections for tagging. In sum, the open character of this tool and its user-friendliness allows extending the benefits of annotation to a wider variety of research questions.
OpenTagger differs from others well-known taggers such as Nooj (Silberztein, 2005) because of its simplicity and web access, as it is not specialised for grammar construction or other complex processes. Potential users range from novel linguist researchers to experts. Last, it should be mentioned that a further integration within the corpus analysis software ACTRES Corpus Manager (Sanjurjo-González, 2017) is planned for the future. OpenTagger will make the process of building and querying custom annotated corpora more straightforward using ACM.
2020-01-01T00:00:00Z