Dr Agnieszka Karlińska
Research interests: text analysis, NLP, particularly legal NLP, LLM, data-centric AI, digital humanities, sociolinguistics
Agnieszka Karlińska conducts research in automatic text processing and analysis, situated at the intersection of computer science and computational linguistics, with occasional forays into digital humanities and computational social sciences. In her Ph.D. thesis, she examined gender bias in forensic psychiatric assessment. At NASK, she is involved in both developing tools for detecting harmful content, such as hate speech, and building safe and reliable large language models. Her work focuses particularly on evaluating LLMs’ tendency to generate toxic and discriminatory content and mitigating biases at various stages of model development, with a special emphasis on data selection and alignment processes.
Selected Publications
Articles
Anna Kołos, Inez Okulska, Kinga Głąbińska, Agnieszka Karlinska, Emilia Wiśnios, Paweł Ellerik, Andrzej Prałat, "BAN-PL: A Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl Web Service", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 2107–2118.
Agnieszka Karlinska, Cezary Rosiński, Marek Kubis, Patryk Hubar, Jan Wieczorek, "Using Bibliodata LODification to Create Metadata-Enriched Literary Corpora in Line with FAIR Principles", In: Calzolari N, Kan M-Y, Hoste V, Lenci A, Sakti S, Xue N, eds. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia: ELRA and ICCL, 2024, 17271–17284.