Våre språkressurser

The Norwegian Blog Corpus (2026)
  • The project creates a large collection of authentic, up-to-date texts that addresses the lack of openly available data representing informal Norwegian. The corpus is compiled at https://github.com/omikke/NBC and contains 18 million words from recent blog texts written by more than 800 bloggers in 2010-2022 in Norway. The corpus supports Norwegian usage-based and data-driven linguistic research, offering new data and a wide range of search possibilities. Forthcoming at https://www.uit.bloggkorpus.no/
  • Team: Olaf Mikkelsen, Anna Endresen, Taras Andrushko

SnakKIs: The first AI-powered learner chatbot for workplace Norwegian (2026)
  • This innovative tool is designed to help L2 learners of Norwegian master complex multi-word constructions in practical, real-world contexts, with a particular emphasis on workplace-specific language (yrkesnorsk). What makes SnakKIs unique is its ability to engage users through dynamic, purpose-driven conversations, generating specific talking points (in Norwegian, a "snakkis") tailored to their individual needs.
  • Team: Anna Endresen, Jorunn Juliussen Ingilæ, Olaf Mikkelsen, Taras Andrushko

The Norwegian Constructicon (2024-present), also called Språknett
  • The project is building a digital database of Norwegian multi-word language patterns (constructions). 2000 constructions are collected and made openly available in the preliminary interface at https://constructicon.github.io/norwegian/, a new interface is built at https://www.spraknett.no/, where the data will be made available.
  • Team: Anna Endresen, Olaf Mikkelsen, Taras Andrushko

The Russian Constructicon (2016-present)
  • The project has built and continues to expand an open-access electronic resource that offers a searchable database of over 4000 Russian constructions accompanied with thorough descriptions of their properties.
  • The resource is available at https://constructicon.github.io/russian/

Construxercise! Hands-on learning of Russian constructions (2022; 2025-2026)
  • The project created the first full-fledged educational resource that implements the innovative constructionist approach to foreign language teaching and offers a system of construction-based exercises for both classroom and self-guided study of L2 Russian. It provides over 180 exercises on 58 discourse constructions.
  • Available at https://constructicon.github.io/construxercise-rus/
  • Current team: Anna Endresen, Valentina Zhukova
  • Alumni: Taras Andrushko,Zoia Butenko, George Lonshakov, Daria Demidova, Natalia Kalanova, Elena Bjørgve, David Henrik Lavén, Laura A. Janda, and Tatiana Perevoshchikova

The Ukrainian Constructicon (2023-present)
  • The project builds a digital searchable repository of prominent Ukrainian patterns of phrase and sentence structures, i.e. multi-word grammatical constructions. Available at https://constructicon.github.io/ukrainian/.
  • Current team: Yuliia Palii, Laura A. Janda
  • Alumni: Anna Endresen, Valentina Zhukova, Zoia Butenko

The Persian Constructicon (2023)