{"id":11265,"date":"2025-02-25T15:00:00","date_gmt":"2025-02-25T14:00:00","guid":{"rendered":"https:\/\/haimagazine.com\/uncategorized\/new-polish-language-model-pllum\/"},"modified":"2025-06-26T15:10:25","modified_gmt":"2025-06-26T13:10:25","slug":"new-polish-language-model-pllum","status":"publish","type":"post","link":"https:\/\/haimagazine.com\/en\/ai-news\/new-polish-language-model-pllum\/","title":{"rendered":"\ud83d\udd12 New Polish language model: PLLuM"},"content":{"rendered":"<p>On February 24, 2025, a conference was held at the Ministry of Digital Affairs where the family of Polish language models, PLLuM (<em>Polish Large Language Model<\/em>), was officially presented. The conference marked the culmination of the first phase of a project developed by a consortium of six scientific institutions, led by the Wroc\u0142aw University of Technology under the auspices of the government (with a budget of 14.5 million zlotys). The goal was to build comprehensive Polish LLMs in order to support public administration and academic communities in the digitization process, as well as Polish business in terms of innovation.<\/p><p class=\"has-text-align-center\"> <img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"377\" class=\"wp-image-8950\" style=\"width: 800px;\" src=\"https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga.png\" alt=\"\" srcset=\"https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga.png 1464w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga-300x141.png 300w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga-1024x482.png 1024w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga-768x361.png 768w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/droga-600x282.png 600w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><br\/>Stages of Development of PLLuM, Ministry of Digital Affairs<\/p><p>The PLLuM family utilizes from 8 to 70 billion parameters depending on the specific model. The smaller ones are designed for simple tasks, such as creating a bot to manage websites. Larger models are better suited for areas where contextual coherence in understanding the Polish language is needed, for example in scientific research or processing official documentation. <strong>All versions are based on ethically sourced data<em>\u2014<\/em>under licenses, the amended copyright law, and EU regulations. <\/strong>Additionally, scientific models (which aren&#8217;t approved for commercial use) use publicly available data sets such as Common Crawl. This makes them even more efficient.<\/p><blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\"><p class=\"has-text-align-center\"><em>\u201cTogether with the Bielik model, they can promote artificial intelligence developed in Poland, supporting each other in a better training process and further data acquisition and opening, which is necessary for #AIMadeInPoland to keep improving\u2014for public administration, business and society.\u201d <\/em><br\/>Ministry of Digital Affairs<em>.<\/em><\/p><\/blockquote><p>An interesting moment of the conference was the confirmation that one of the PLLuM models (8&#215;7-nc-chat) outperforms in <a href=\"https:\/\/huggingface.co\/spaces\/sdadas\/plcc\" target=\"_blank\" rel=\"noopener\"><mark style=\"background-color:#82D65E\" class=\"has-inline-color has-dark-gray-color\">competency tests<\/mark><\/a> benchmarks of counterparts like GPT-4-turbo or DeepSeek R1-Llama-70B. This confirms that the success of the latter was not accidental and that massive investments aren&#8217;t necessary to create efficient LLMs that greatly understand the nuances of our language and culture. Similarly, in the area of security, the Polish model (12b-chat) performs well when it comes to resistance to disruptions (e.g., imprecise user queries).<\/p><p>The Ministry has announced that for 2025, 19 million PLN has been secured for further development of the project and research. The HIVE consortium, which is in charge of implementing the project (this time led by NASK), has expanded by two new partners: the Central Informatics Center and the Academic Computer Center Cyfronet AGH. Currently, the inclusion of private companies in the consortium is legally limited, but efforts are underway to change the regulations to allow this.<\/p><p>The following implementations of models in the public administration are planned for 2025<strong>: <\/strong><\/p><ul class=\"wp-block-list\"><li><strong>Smart administrative assistant<\/strong> <em>\u2013<\/em> it&#8217;s designed to help employees navigate through the maze of bureaucratic regulations (tests are already underway at the Ministry of Digital Affairs).<\/li>\n\n<li><strong>Assistant for the mObywatel app <\/strong><em>\u2013<\/em> especially in the context of its expanding functionalities.<\/li>\n\n<li>Work will also begin on <strong>developing solutions in education support<\/strong> for teachers using AI.<\/li><\/ul><p>It&#8217;s also important that the implementations themselves (e.g., in mObywatel) will be funded with additional resources. This allows the consortium to invest the budget in further training the models and working on their safety.<\/p><p>Everyone can now test PLLuM on the page <mark style=\"background-color:#82D65E\" class=\"has-inline-color has-contrast-color\"><a href=\"https:\/\/pllum.clarin-pl.eu\/\" target=\"_blank\" rel=\"noopener\">https:\/\/pllum.clarin-pl.eu\/<\/a><\/mark> (the chat uses 2 models: 12b and 8x7b). The entire model library is available at <a href=\"https:\/\/huggingface.co\/CYFRAGOVPL\" target=\"_blank\" rel=\"noopener\"><mark style=\"background-color:#82D65E\" class=\"has-inline-color has-dark-gray-color\">https:\/\/huggingface.co\/CYFRAGOVPL<\/mark><\/a><em>\u2014<\/em>anyone with the necessary computing power and skills can launch the model that best meets the needs of their research or commercial project.<\/p><p>We asked the PLLuM model what it wishes for itself and for us:<\/p><p class=\"has-text-align-center\"> <img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"367\" class=\"wp-image-8954\" style=\"width: 800px;\" src=\"https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik.png\" alt=\"\" srcset=\"https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik.png 1178w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik-300x138.png 300w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik-1024x469.png 1024w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik-768x352.png 768w, https:\/\/haimagazine.com\/wp-content\/uploads\/2025\/02\/belik-600x275.png 600w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/p><p>The entire conference is available for viewing at: <a href=\"https:\/\/www.youtube.com\/watch?v=m9gyLQTX820&amp;t=2s\" target=\"_blank\" rel=\"noopener\"><mark style=\"background-color:#82D65E\" class=\"has-inline-color has-contrast-color\">https:\/\/www.youtube.com\/watch?v=m9gyLQTX820&amp;t=2s<\/mark><\/a><\/p><p><\/p>","protected":false},"excerpt":{"rendered":"<p>PLLuM has been unveiled! As a result of the work by a scientific consortium on a project funded by the Ministry of Digital Affairs, a fully native LLM has been developed with the mission to support officials, businesses and citizens in the AI revolution.<\/p>\n","protected":false},"author":10,"featured_media":8956,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"rank_math_lock_modified_date":false,"footnotes":""},"categories":[782],"tags":[83,447,715,728],"popular":[],"difficulty-level":[36],"ppma_author":[352],"class_list":["post-11265","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-news","tag-ai","tag-ai-4","tag-artificial-intelligence","tag-polish-language-model","difficulty-level-easy"],"acf":[],"authors":[{"term_id":352,"user_id":10,"is_guest":0,"slug":"seweryn-jakubiec","display_name":"Seweryn Jakubiec","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/9f6a221b4ee0d45f9cb264964464c87cc2036e4466dc908a6ec21be51baff707?s=96&d=mm&r=g","first_name":"Seweryn","last_name":"Jakubiec","user_url":"","job_title":"","description":"Senior Product Manager w bran\u017cy IT, obserwator \u015bwiata tech i AI, muzyk-amator, wielbiciel kot\u00f3w rasy Devon Rex"}],"_links":{"self":[{"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/posts\/11265","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/comments?post=11265"}],"version-history":[{"count":1,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/posts\/11265\/revisions"}],"predecessor-version":[{"id":11266,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/posts\/11265\/revisions\/11266"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/media\/8956"}],"wp:attachment":[{"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/media?parent=11265"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/categories?post=11265"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/tags?post=11265"},{"taxonomy":"popular","embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/popular?post=11265"},{"taxonomy":"difficulty-level","embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/difficulty-level?post=11265"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/haimagazine.com\/en\/wp-json\/wp\/v2\/ppma_author?post=11265"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}