Dr Agnieszka Karlińska

Agnieszka Karlinska
Research interests: text analysis, NLP, particularly legal NLP, LLM, data-centric AI, digital humanities, sociolinguistics

Agnieszka Karlińska conducts research in automatic text processing and analysis, situated at the intersection of computer science and computational linguistics, with occasional forays into digital humanities and computational social sciences. In her Ph.D. thesis, she examined gender bias in forensic psychiatric assessment. At NASK, she is involved in both developing tools for detecting harmful content, such as hate speech, and building safe and reliable large language models. Her work focuses particularly on evaluating LLMs’ tendency to generate toxic and discriminatory content and mitigating biases at various stages of model development, with a special emphasis on data selection and alignment processes.

Selected Publications

Articles

Karolina Seweryn, Anna Kołos, Agnieszka Karlińska, Katarzyna Lorenc, Katarzyna Dziewulska, Maciej Chrabaszcz, Aleksandra Krasnodebska, Paula Betscher, Zofia Cieślińska, Katarzyna Kowol, Julia Moska, Dawid Motyka, Paweł Walkowiak, Bartosz Żuk, Arkadiusz Janz, "PLLuM-Align: Polish Preference Dataset for Large Language Model Alignment", Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, , 2025, 23890–23919.
Anna Kołos, Katarzyna Lorenc, Emilia Wiśnios, Agnieszka Karlińska, "Behind Closed Words: Creating and Investigating the forePLay Annotated Dataset for Polish Erotic Discourse", Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers, 2025, 2416–2432.
Maja Sawicka, Agnieszka Karlińska, "Agency Attributions under a Normative Crisis: Corpus Analysis of Emerging Frameworks of Meaning during the COVID-19 Pandemic in Poland", East European Politics and Societies, , 2025,
Agnieszka Karlińska, Piotr Miłkowski, Paulina Czwordon-Lis, Bartłomiej Koptyra, Jan Kocoń, "Comprehensive Sentiment Analysis of Polish Book Reviews Using Large and Small Language Models", 24th IEEE International Conference on Data Mining Workshops, ICDMW, 2024, 453-462.
Sławomir Mandes, Agnieszka Karlińska, "W stronę nowej metodologii analizy treści. Podobieństwa i różnice pomiędzy modelowaniem tematycznym i jakościową analizą treści", Przegląd Socjologii Jakościowej, 20(4), 2024, 118-143.
Anna Kołos, Inez Okulska, Kinga Głąbińska, Agnieszka Karlinska, Emilia Wiśnios, Paweł Ellerik, Andrzej Prałat, "BAN-PL: A Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl Web Service", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 2107–2118.
Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar, Jan Wieczorek, "Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 17271–17284.