Dr Agnieszka Karlińska

Research interests: text analysis, NLP, particularly legal NLP, LLM, data-centric AI, digital humanities, sociolinguistics

Agnieszka Karlińska conducts research in automatic text processing and analysis, situated at the intersection of computer science and computational linguistics, with occasional forays into digital humanities and computational social sciences. In her Ph.D. thesis, she examined gender bias in forensic psychiatric assessment. At NASK, she is involved in both developing tools for detecting harmful content, such as hate speech, and building safe and reliable large language models. Her work focuses particularly on evaluating LLMs’ tendency to generate toxic and discriminatory content and mitigating biases at various stages of model development, with a special emphasis on data selection and alignment processes.

agnieszka.karlinska@nask.pl

Selected Publications

Articles

Agnieszka Karlińska, Piotr Miłkowski, Paulina Czwordon-Lis, Bartłomiej Koptyra, Jan Kocoń, "Comprehensive Sentiment Analysis of Polish Book Reviews Using Large and Small Language Models", 24th IEEE International Conference on Data Mining Workshops, ICDMW, 2024, 453-462.

See publication

Sławomir Mandes, Agnieszka Karlińska, "W stronę nowej metodologii analizy treści. Podobieństwa i różnice pomiędzy modelowaniem tematycznym i jakościową analizą treści", Przegląd Socjologii Jakościowej, 20(4), 2024, 118-143.

See publication

Anna Kołos, Inez Okulska, Kinga Głąbińska, Agnieszka Karlinska, Emilia Wiśnios, Paweł Ellerik, Andrzej Prałat, "BAN-PL: A Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl Web Service", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 2107–2118.

See publication

Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar, Jan Wieczorek, "Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 17271–17284.

See publication