AI Sweden now releases the first large Nordic language model, GPT-SW3. It is available as an open model for businesses and organizations to use in their products and services. The model provides Sweden with a crucial resource that strengthens our ability to use AI for increased competitiveness and a stronger welfare system. Developed by AI Sweden in collaboration with WASP and RISE, GPT-SW3 is specifically developed for Swedish and Nordic languages.
GPT-SW3 can be used for a variety of tasks such as building solutions for text analysis, classification, text generation, text processing, content moderation, and chat functionalities.
As a result, it addresses some of the challenges faced by both the private sector and the public sector concerning other available large-scale language models. Some of these obstacles include ambiguities surrounding training data, unclear licensing terms, or that models are only accessible as cloud services.
There is a significant need for a model trained on Swedish text with clear licensing terms that can be fine-tuned to meet specific requirements and run on dedicated hardware, both for legal reasons and business competitiveness. GPT-SW3 fulfills all these needs, thus opening the doors to a multitude of new applications within Swedish businesses and the public sector.
Various studies on generative AI consistently highlight significant productivity gains. A 2019 report by DIGG estimated that AI could contribute an annual value of 140 billion SEK to the public sector. This estimation was made before generative AI gained widespread adoption.
Many organizations in both the private and public sectors are already working with generative AI, including language models. For example, Region Halland, explores how language models can help to summarize medical records, predict adverse events, code journals, extract information from free text, and create discharge messages for patients.
GPT-SW3 is a generative pre-trained model. By thoroughly analyzing vast amounts of text, the language model discovers intricate language patterns. These patterns can then be leveraged to intelligently generate text: When you input a shorter or longer text into a GPT model, it will seamlessly generate statistically plausible continuations.
However, it's important to note that GPT-SW3 is not a ready-to-use product or service. To harness its capabilities, developers must build something using GPT-SW3 as their foundation.
Developers can access different versions of GPT-SW3 on Hugging Face, along with the license and terms of use.
GPT-SW3 is the result of a collaboration between AI Sweden, WASP, and RISE with funding from Vinnova.