Author
Listed:
- Huan Zhang
(School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China)
- Long Zhou
(School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China)
- Miaomiao Gu
(Institute of Machine Intelligence, University of Shanghai for Science and Technology, Shanghai 200093, China)
Abstract
Continual Learning for Named Entity Recognition (CL-NER) is a crucial task in recognizing emerging concepts when constructing real-world natural language processing applications. It involves sequentially updating an existing NER model with new entity types while retaining previously learned information. However, current CL methods are struggling with a major challenge called catastrophic forgetting. Owing to the semantic shift of the non-entity type, the issue is further intensified in NER. Most existing CL-NER methods rely on knowledge distillation through the output probabilities of previously learned entities, resulting in excessive stability (recognition of old entities) at the expense of plasticity (recognition of new entities). Some recent works further extend these methods by improving the distinction between old entities and non-entity types. Although these methods result in overall performance improvements, the preserved knowledge does not necessarily ensure the retention of task-related information for the oldest entities, which can lead to significant performance drops. To address this issue while maintaining overall performance, we propose a method called Confident Soft-Label Imitation (ConSOLI) for continual learning in NER. Inspired by methods that balance stability and plasticity, ConSOLI incorporates a soft-label distillation process and confident soft-label imitation learning. The former helps to gather the task-related knowledge in the old model and the latter further preserves the knowledge from diluting in the step-wise continual learning process. Moreover, ConSOLI demonstrates significant improvements in recognizing the oldest entity types, achieving Micro-F1 and Macro-F1 scores of up to 8.72 and 9.72, respectively, thus addressing the challenge of catastrophic forgetting in CL-NER.
Suggested Citation
Huan Zhang & Long Zhou & Miaomiao Gu, 2024.
"Reduced Forgetfulness in Continual Learning for Named Entity Recognition Through Confident Soft-Label Imitation,"
Mathematics, MDPI, vol. 12(24), pages 1-22, December.
Handle:
RePEc:gam:jmathe:v:12:y:2024:i:24:p:3964-:d:1545736
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:24:p:3964-:d:1545736. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.