Corpora and Databases

Credit points: 3
Contact hours: 60
Type: Seminar Classes


dr. Darja Fišer, Assistant Professor
dr. Špela Vintar, Full Professor

Course Description

Objectives and competences

  1. 1. Competence in using computational and technical tools related to translation;
  2. 2. Competence in terminological research and the use of software tools for storing term data and constructing term banks.

Content (Syllabus outline)

  1. The emphasis is on computer-aided search of terminological resources and on specialised tools for terminology storage and management. The course involves hands-on training in corpus querying and building as well as advanced methods of extracting knowledge from corpora.

- overview of language and translation tools
- term bases and corpora on the web
- introduction to corpus linguistics
- web corpora
- the IJS-Elan and Trans parallel corpora
- regular expressions
- building a parallel corpus
- corpus encoding standard
- corpus tools: WordSmith and other concordance software
- alignment with ATRIL DejaVU
- querying parallel corpora with ParaConc
- databases: MS Access
- terminology management: TRADOS MultiTerm
- computational terminology and localisation
- methods of automatic term recognition


  1. None.


  1. Registration in the first year of the MA programme in Translation.


  1. Project work, seminar assignment, written assignment.

Darja Fišer


Špela Vintar