CorpLingCz | Czech National Corpus

The K-Centre provides advice on all topics related to corpus linguistics or Czech language. Our corpus linguistics expertise includes data formats, annotation, metadata encoding, corpus querying, corpus linguistics methodology and statistical methods, but we can also provide external pointers to other centres regarding any aspect of Czech language including language resources and natural language processing.

We offer the following on-line services:

helpdesk with Q&A: a virtual platform for active user support and feedback;
comprehensive web documentation, manuals and tutorials;
bibliography of CNC-based research outputs;
corpus-based exercises for language teaching at primary and secondary schools (Czech-only).

In addition, the following services can be arranged on demand via the helpdesk:

workshops and training events on various topics;
provision of linguistic data in the form of corpus-derived packages while respecting the limitations that result from agreements with text providers, copyright law and other regulations;
corpus hosting that includes technical processing, quality checks, and public access to the hosted corpus with related services.

Please do not hesitate to contact us, we are ready to help you with your language-related requests!