Resilience of Named Entity Recognition Models Against Adversarial Attacks

Authors

  • Paweł Walkowiak Wroclaw University of Science and Technology

Abstract

Adversarial Attacks are actions that aims to mislead models by introducing subtle and often imperceptible changes in model's input. Providing resilience for such kind of risk is key for all Natural Language Processing (NLP) task specific models. Current state of the art solution for one of NLP task Named Entity Recognition (NER) is usage of transformer based solutions. Previous solution where based on Conditional Random Fields (CRF).This research aims to investigate and compare the robustness of both transformer-based and CRF-based NER models against adversarial attacks. By subjecting these models to carefully crafted perturbations, we seek to understand how well they can withstand attempts to manipulate their input and compromise their performance. This comparative analysis will provide valuable insights into the strengths and weaknesses of each architecture, shedding light on the most effective strategies for enhancing the security and reliability of NER systems.

Additional Files

Published

2025-07-09

Issue

Section

Cryptography and Cybersecurity