Language and Personalization

LAP

Language and Personalization

This work package is a new construct, made up by the combination of the previous work packages on personalization (PERS) and natural language processing (LANG). The work in the work package will initially follow these two directions somewhat separately, for later to be tighter combined.

The purpose for this work package is to develop personalization techniques and Scandinavian language processing capabilities to provide personalized content generation and:

  1. Develop truly explainable, fair and transparent personalization techniques
  2. Enable proactivity in customer relations
  3. Provide an individualized experience that provably respects privacy concerns
  4. Develop individualized content
  5. Develop large-scale Scandinavian language models
  6. Enable human-like content creation and conversations

Personalization and contextualization have been successfully employed in diverse applications over the past decade, and currently see an extended usage, for instance in proactive interaction with customers and individualization of news stories. LAP will contribute to developing such systems while ensuring that the system usage will be ethical and respecting users’ requirements for privacy, fairness and accountability. 

Building Scandinavian language models requires the compilation of large-scale reusable language resources, including general-purpose corpora from public sources (e.g., news and social media) as well as industry- and domain-specific text collections. We will address the scarcity of the latter by pre-training on the former and developing transfer learning methods. These large-scale language models will then be utilized in real-life scenarios by formulating a number of specific summarization, explanation, and conversational tasks based on our partners’ use-cases. LAP will develop appropriate evaluation methodology with user-oriented evaluation measures and objectives. It will thus contribute to providing measurable quantification of the amount of domain-specific training material needed in order to provide a language service that is of sufficiently high quality.


People

People

person-portlet

Projects

Projects

Projects

Short description: Based on a very large corpus consisting of newspaper articles from most Norwegian newspapers, NorwAI will create the largest Norwegian language model built so far, enabling new opportunities within areas as chatbots and text summarization.

Time perspective: 2020-

Involved partners: 

Logo Retriever

Logo Schibsted

Logo NR

Logo UiO

Logo DNB

Logo UiS

Stories

Stories

New Language Models in NorwAI

New Language Models in NorwAI

Photo. Jon Atle Gulla

The NorwAI center is determined to provide new Norwegian language models that are significantly larger and better than what is available to-day and can easily be employed in advanced Norwegian NLP applications for industrial use, says center director professor Jon Atle Gulla.

Read more