Multi-dimensional analysis (MDA) is a method developed by corpus linguist Douglas Biber used for empirical research of text variation. The aim of MDA is to capture the variation based on the function that variant language features have in texts. In contrast to earlier approaches, the goal of MDA is not the a priori identification of linguistic features that are typical of a particular communication domain; MDA, on the contrary, uses the co-occurrence of linguistic features as the starting point for interpretation. From the features that co-occur frequently in texts, it is then possible to infer what function these features collectively fulfill.
MDA has been used as a research method for modeling register variation of many languages. The research procedure consists of the following steps:
In addition to describing language variation, MDA results can be used to determine the main registers in a given language (see register classification, which functions as a complement to txtype/genre classification).
Based on the analysis of the Koditex corpus, a model with 8 dimensions was created:
The naming of the dimensions is primarily based on information about which linguistic features contribute most to their establishment (see the inventory of prominent features), and on the position of texts within a given dimension (see the MDAvis tool).
Czech MDA was conducted at Charles University by researchers from the Institute of the Czech National Corpus; it was supported from the ERDF project Language Variation in the CNC no. CZ.02.1.01/0.0/0.0/16_013/0001758.