Skip to main content

GPT-SW3 validation project

Can a large-scale Swedish generative language model reduce the threshold for using advanced language models to solve text processing tasks across a variety of sectors and use cases?

GPT-SW3 graphic showing input by public sector, industry, startups and academia

Final Seminar

On September 26, 2024, a final seminar for the validation project was held at AI Sweden's office in Stockholm. Watch the recording (in Swedish):

Challenges

In a number of projects, especially “Language Models for Swedish Authorities,” it has become clear that there is a tangible need for powerful Swedish language models, both in the public and private sectors. However, it is challenging for individual actors to adapt and deploy models for their organizational needs due to difficulties in attracting skilled staff, poor data readiness, and acquiring suitable hardware. 

AI Sweden believes that foundational models constitute a promising alternative to tackle these challenges. They have the potential to transform traditional ways of solving text processing needs. 

Project purpose

This project tests the hypothesis that foundational models can reduce the threshold for using advanced language models to solve text-processing tasks in the private and public sectors. 

It investigates the possibility of using a large-scale Swedish generative language model to create a general solution for text-processing tasks that can be used by many different stakeholders. The project is based on the large-scale generative language model GPT-SW3, which AI Sweden is developing in collaboration with RISE, WASP WARA Media & Language and NVIDIA. The project will make GPT-SW3 available via an API and user-friendly web-based interface, develop solutions for text processing tasks (e.g. through prompting and p-tuning), and validate the use of the model across various use cases, with clear need owners from the public sector, industry, and academia.

The key innovations of the project are partly the API and the web-based interface for the model, and partly the validation of using the model to solve a large number of different text-processing tasks. 

How? 

The project brings together stakeholders from the public sector, industry, and academia. AI Sweden and RISE will develop the API solution as the first step of the project. In the next step, the project focuses on different use cases in the stakeholders’ organizations. 

These use cases focus on text processing applications such as text generation and summarization, categorization, and information extraction. 

The work will take place iteratively: AI Sweden and RISE validate and improve the API solution with the help of results from the various use cases. The stakeholders are responsible for their respective use cases with the support of AI Sweden and RISE.

Expected outcomes

If this project succeeds it will drastically reduce the threshold for using advanced language models to solve text-processing tasks. This will lead to a disruptive change and acceleration in the use of AI-driven Swedish language technology. Stakeholders will no longer need to invest in costly processes of collecting training data for their specific needs, nor will they have to purchase and operate expensive hardware resources. The results and resources produced within this project will be applicable in virtually all sectors and industries. It is hoped that the technology can be used as broadly as possible.

  • Reduce the threshold for using advanced language models to solve text-processing tasks in the public and private sector

  • Cost and time savings for stakeholders

  • Better services for citizens and customers

  • Increase AI competence in the field of foundation models which is valuable for Swedish competitiveness and societal benefit

Handbook for prompting

With this handbook, we want to give an overall explanation of what a language model is, an introduction to Prompt Engineering, and demonstrate its great potential by going through some simple applications. The handbook can be used as a tool to successfully create prompts on your own and hopefully develop and improve specific applications.

The manual is a work-in-progress product and will be updated during the course of the project.

Check out the handbook for prompting

Text saying GPT-SW3 with ac background image of letters

Status Update: GPT-SW3: a foundational model for Swedish NLP

Presentation at the reference group meeting "Offentlig sektor och tillämpad språkteknologi" at AI Sweden
Magnus Sahlgren, AI Sweden

June 8, 2022

Text saying GPT-SW3 with ac background image of letters

Status update: GPT-SW3 – En svensk basmodell för texthantering / valideringsprojekt

Presentation at the reference group meeting “Offentlig sektor och tillämpad språkteknologi” at AI Sweden
Magnus Sahlgren, AI Sweden

December 15, 2022

Contact

A picture of Ariel Ekgren
Ariel Ekgren
Senior Research Scientist
+46 (0)70-793 13 94

Ongoing projects NLU

A woman standing by a desk in fromt of her computer

A shared digital assistant for the public sector

This is a collaboration between Swedish authorities, municipalities, regions, and industry, coordinated by AI Sweden. The objective is to promote national cooperation on AI for text tasks and create...
DeployAi logo on a turquoise background

DeployAI

The project DeployAI will build, deploy, and launch a fully operational AI-on-demand platform (AIoDP) promoting trustworthy, ethical, and transparent European AI solutions for use in the industry...
One of the supercomputors in the Barcelona Supercomputing Center

EuroLingua-GPT

AI Sweden and the Fraunhofer Institute for Intelligent Analysis and Information Systems are together training a new series of large language models (LLMs) for all of the official European languages.
Photo in black and white of trucks on a parking lot seen from above

FormAI

FormAI explores the use of generative AI coupled with formal verification for developing secure and trustworthy software in the automotive domain.
An illustrative composition featuring multiple screens arranged in a circular formation, radiating light from the center

Multimodal language model

AI Sweden's language team is taking the next major step by initiating the development of Sweden's first large multimodal language model. The new model is expected, just like GPT-SW3, to become an...
An AI-generated image showing cars driving and people walking in a city setting

Next generation infrastructure

Within the Next generation infrastructure project, AI Sweden is developing next-generation infrastructure for the training, deployment, and iterative improvement of foundation models by addressing...
Photo of a large amounts of text documents laying on a table, seen from above. AI-generated

TrustLLM

TrustLLM will develop European large language models (LLMs) on an unprecedented scale, trained on the largest amount of text so far in European AI, covering a range of underrepresented languages, and...