华东师范大学 | 用于小样本命名实体识别的具有自适应边距的元学习三元组网络

Meta-Learning Triplet Network with Adaptive Margins for Few-Shot Named Entity Recognition

Chengcheng Han, Renyu Zhu, Jun Kuang, FengJiao Chen, Xiang Li, Ming Gao, Xuezhi Cao, Wei Wu

East China Normal University & Meituan Inc

用于小样本命名实体识别的具有自适应边距的元学习三元组网络

要点：
1.元学习方法已广泛用于少样本命名实体识别（NER），尤其是基于原型的方法。然而，Other(O)类很难用原型向量表示，因为类中一般有大量样本，语义杂乱。

2.为了解决这个问题，我们提出了 MeTNet，它只为实体类型而不是 O 类生成原型向量。我们设计了一个改进的三元组网络，将样本和原型向量映射到一个更容易分类的低维空间，并为每个实体类型提出自适应边距。边距作为一个半径，在低维空间中控制一个大小自适应的区域。基于这些区域，我们提出了一种新的推理过程来预测查询实例的标签。

一句话总结：

在域内和跨域设置中进行了大量实验，以展示 MeTNet 优于其他最先进方法的优势。特别是，文章发布了从知名电子商务平台提取的中文小样本 NER 数据集 FEW-COMM。据目前所知，这是第一个中国的小样本 NER 数据集。所有数据集和代码都在 https://github.com/hccngu/MeTNet上提供。[机器学习+人工翻译]

Meta-learning methods have been widely used in few-shot named entity recognition (NER), especially prototype-based methods. However, the Other(O) class is difficult to be represented by a prototype vector because there are generally a large number of samples in the class that have miscellaneous semantics. To solve the problem, we propose MeTNet, which generates prototype vectors for entity types only but not O-class. We design an improved triplet network to map samples and prototype vectors into a low-dimensional space that is easier to be classified and propose an adaptive margin for each entity type. The margin plays as a radius and controls a region with adaptive size in the low-dimensional space. Based on the regions, we propose a new inference procedure to predict the label of a query instance. We conduct extensive experiments in both in-domain and cross-domain settings to show the superiority of MeTNet over other state-of-the-art methods. In particular, we release a Chinese few-shot NER dataset FEW-COMM extracted from a well-known e-commerce platform. To the best of our knowledge, this is the first Chinese few-shot NER dataset. All the datasets and codes are provided at https://github.com/hccngu/MeTNet.

https://arxiv.org/pdf/2302.07739.pdf

内容中包含的图片若涉及版权问题，请及时与我们联系删除

华东师范大学 | 用于小样本命名实体识别的具有自适应边距的元学习三元组网络

评论列表

评论