CLC

(approved by the Board of Directors on 07/11/2017, minutes no. 77)

The Linguistic Research Centre on Corpora (CLC) of Università degli Studi Internazionali di Roma – UNINT brings together professors belonging to the degree courses in the linguistic area of ​​the Department of International Humanities and Social Sciences active in the field of corpus linguistics research and specialised in analysis of written and oral texts, interpreted and translated speech, L2 interaction and learning varieties. The development of research with the use of technologies applied to linguistic analysis has an immediate application in the teaching of various courses: interpretation, translation and language, terminology and applied linguistics (sociolinguistics, contact linguistics, textual linguistics, pragmatics).

Scientific Director

Laura Mori
Department of International Humanities and Social Sciences

Objectives

The CLC intends to support the research activities carried out by its members, promoting scientific dialogue within the Department and the creation of new synergies from an international perspective.

The CLC aims to disseminate the scientific results achieved by the members and to encourage the exchange of knowledge regarding in particular the development of applied research protocols, the use of programs dedicated to linguistic analysis on corpora, the development of linguistic databases and of terminological repertoires.

The CLC presents itself as a sharing space for UNINT lecturers and undergraduates/graduates in constant connection with the outside world for the advancement of research and the improvement of teaching practices.

Fields of research

The research activities conducted by the members of the CLC are oriented towards scientific development in the following thematic areas:

  • corpus-based sociolinguistics (specialist and sectoral varieties);
  • corpus-based acquisitional linguistics (L2 varieties);
  • corpus-based discourse and conversation analysis;
  • translation on parallel and/or comparable corpora;
  • translation memories and databases;
  • corpus-based terminology and phraseology.

University Research Committee

The members of the CLC have scientific publications to their credit that refer to their activities in the field of linguistic research on corpora. The current configuration of the CLC involves the involvement of tenured and contract lecturers, experts on the subject belonging to the degree courses in the linguistic area of Department of International Humanities and Social Sciences.


Furthermore the CLC includes external, national and European members, who collaborate within research projects developed by UNINT.

Lorenzo Blini

Michael Sherman Boyd

Marina Brancaccio

Stefania Cerrito

Claudia Coppola

Manuela Frontera

Najla Kalach

Fiorenza Mileto

Laura Mori

Lucilla Pizzoli

Federica Politanò

Fabio Proia

Claudio Russello

Annalisa Sandrelli

Lucja Biel 
UNIVERSITY OF WARSAW

Giuditta Caliendo 
UNIVERSITÉ DE LILLE

Paolo Cavanese
UNIVERSITÉ DE GENÈVE

Sandro Caruana 
UNIVERSITY OF MALTA

Sara Castagnoli 
UNIVERSITÀ DI MACERATA

Chiara Degano 
UNIVERSITÀ DEGLI STUDI DI ROMA TRE

Gatis Dilans 
VENTSPILS UNIVERSITY COLLEGE

Eman El Sherbiny Ismail
UNIVERSITÄT POTSDAM

Vittorio Ganfi 
UNIVERSITÀ DEGLI STUDI DI MODENA E REGGIO EMILIA

Claudio Fantinuoli 
JOHANNES GUTENBERG-UNIVERSITÄT MAINZ

Annarita Felici 
UNIVERSITÉ DE GENÈVE

Adriano Ferraresi 
UNIVERSITY OF BOLOGNA

Rudy Loock 
UNIVERSITY OF LILLE-CHARLES 
DE GAULLE

Mikhail Mikahilov 
UNIVERSITY OF TAMPERE

Valentina Piunno
UNIVERSITÀ DEGLI STUDI DI BERGAMO

Benedikt Szmrecsanyi 
KATHOLIEKE UNIVERSITEIT LEUVEN

Vilelmini Sosoni 
IONIAN UNIVERSITY OF CORFÙ

Rubén González Vallejo 
UNIVERSITÀ DEGLI STUDI DI MACERATA

Area research projects and groups

Over the last few years UNINT has financed the following research projects and supported the activities of research groups that fall within the objectives pursued by the CLC:

The research group represents the continuation of the Eurolect Observatory Project (EOP), directed by the lecturer Laura Mori, and which took place in two phases (2013-2016 and 2017-2020).

The UNINT Eurolect Observatory research group, coordinated by the lecturer Laura Mori, was born within the Department of Interpreting and Translation (FIT) and was financed by the University Research Fund of Università degli Studi Internazionali di Roma – UNINT.

The first phase (2013-2016) of research activities on 11 languages ​​- Dutch, English, Finnish, French, German, Greek, Italian, Latvian, Maltese, Polish, Spanish - saw the participation of European academics from various Italian and foreign universities (Institute for the Languages ​​of Finland, Ionian University, Ghent University, Paris Diderot University, University of Malta, University of Warsaw, Università di Roma ‘Tor Vergata’, University of Tampere, Ventspils University College).

The research group focused on the analysis of corpora of EU directives and national legislative transposition texts written in the 11 languages ​​under study in order to:

  • verify and/or confirm the hypothesis of the existence of legal language varieties of the European Union (the so-called eurolects), which originated and established themselves within the linguistic diasystems of some member countries, or in parts of them;
  • highlight the linguistic differences between the Eurolects and the respective national legal language varieties in order to offer functional elements for reflection on linguistic and translation variation;
  • provide reference data to the entities involved or interested (Language Services of the EU institutions; national, regional and autonomous chambers);
  • produce useful results for improving the editorial quality of legal texts (legal drafting), both at a national and supranational level.

GEROM is the final product of an Italian-German project born and implemented within the Fachbereich Translations-, Sprach- und Kulturwissenschaft (FTSK) of the Johannes Gutenberg-Universität Mainz/Germersheim and the Department of Interpreting and Translation (FIT) of Università degli Studi Internazionali di Roma – UNINT. The project was divided into two phases, between 2013 and 2015. In the first phase, the project coordinators brought together students and undergraduates in two working groups to evaluate the contribution of the terminology to the Italian-German dialogue. In the second phase, a Round Table was organised which allowed discussion with language professionals, researchers and university professors specialised in terminology and corpus linguistics.

Learn more

Dub-Talk Project – 2013/2015 – and TV-Talk – 2016/2018 – The project aims to study the characteristics of television and cinema speech and its translation into dubbing, the most widespread in Italy. The collaboration between Università di Pisa (Department of Philology, Literature and Linguistics) and Università degli Studi Internazionali di Roma – UNINT, which began in 2013, aims to broaden the scope of this project, setting up a common parallel corpus, consisting of transcripts of original dialogue and dubbed dialogue from US and British TV series and films.

Learn more

In 2023, the Research Group on Terminology (UNINTerm) was established within the Linguistic Research Centre on Corpora (CLC). Given the close connection between terminology/terminography and corpus linguistics, the Group's activities fall fully within the scope of investigation of the CLC, which also includes among its thematic areas terminology and phraseology extracted with corpus-based and corpus-driven methodologies. The activities of UNINTerm aim to extend the field of research to the languages ​​included in the educational offer of UNINT degree courses and to make use of interdisciplinary contributions from different scientific areas, internal or external to the University.

Learn more

Activities

The Linguistic Research Centre on Corpora annually organises a series of corpus linguistics research seminars (Corpus Linguistics Workshops) available in the "Seminars" section below. 

The seminars constitute a moment of in-depth analysis of methodological issues, presentation of mixed analysis approaches (qualitative and quantitative), use of software for the description of linguistic variation through the use of synchronic and diachronic corpora and databases. 

The seminars, held both by members of the CLC and by invited speakers, are open to all people interested in the potential of corpora for linguistic research. 

The seminars aim to promote reflection and exchange within the scientific community and represent an opportunity for growth for undergraduate and graduate students who intend to make use of technological resources for the analysis of linguistic data from a sociolinguistic, pragmatic, translation and terminological perspective.

Seminars​

All the events of the Research Centre

Digital archive

To date, the following corpora, digital catalogues and portals have been collected within Università degli Studi Internazionali di Roma – UNINT, as research products of financed group projects (also with the collaboration of undergraduates and graduates of the linguistic area) by the University Research Committee.

Corpora

Corpus MULPOLDIS (Multilingual Multimodal Political Discourse): starting from 2020, the CLC has concentrated its activity on the creation of a digital space for the archiving of corpora derived from master's thesis works in textual linguistics and pragmatics, linguistics of the languages ​​of study, interpretation and translation which involved a collection of oral texts.

Learn more

Eurolect Observatory Multilingual Corpus (Eurolect Observatory Project): the multilingual corpus includes a total of 11 languages. UNINT financed the establishment of sub-corpus A and B of the following languages: English, French, Italian, Maltese, Polish and sub-corpus C (scientific manager: lecturer Laura Mori) Sub-corpus A: 660 directives in 8 linguistic versions (1/1/1999 – 31/12/2008) concerning all EU policies; Sub-corpus B: national measures implementing the directives of sub-corpus A in Belgium, England, France, Germany, Italy, Malta, Poland, Spain. Sub-corpus C: rules of national origin in English, German, Italian, Spanish.

Learn more

Dub-Talk Corpus (Dub-Talk Project): parallel corpus of transcriptions of dubbed TV series and films (scientific director: lecturer Annalisa Sandrelli).

TV-talk Corpus (TV-Talk Project): corpus in the combinations English > Italian and Italian > English including different genres and new translation methods, including voice-over (symil-synch) and subtitling (scientific director: lecturer Annalisa Sandrelli).

Digital catalogues

Focus on Eurolects: As of 2021, a database has been set up to provide an overview of studies, conducted as part of individual or group research projects, dealing with one or more legal language varieties used in EU legislation (“Eurolects ”). In most cases they analyse and compare data from legislative corpora (e.g. EU directives and national transposition measures) or judgments issued by the EU Court of Justice and/or national courts.

The titles of the publications are divided into two sections, depending on the method and approach adopted by the author or authors. The first section (“Monolingual studies”) includes studies examining the EU and the national legal language used in a particular EU Member State (e.g. German Eurolect on the one hand, the German variety in use in the Federal Republic of Germany on the other). The works that extend their field of investigation to two or more Eurolects are listed in the second section (“Multilingual studies”). All titles are grouped by language, indicated by the corresponding ISO 639-1 code, and presented in chronological order. The acronym EOP in brackets indicates the works that were published by the participants in the first and second phases of the Eurolect Observatory Project (2013-2016 and 2017-2020). The last two sections are dedicated respectively to degree and doctoral theses, both published and unpublished, and to book reviews.

Learn more

Portals

GEROM – German-Italian Terminology Portal (GEROM Project): lexical and textual platform, bilingual and dynamic, to support the analysis and translation of texts on topics of politics, society and culture of German and Italian speaking countries (responsible scientific: lecturer Fabio Proia)

Learn more

Share
Share