Talk 1

Title:FGN: Fusion Glyph Network for Chinese Named Entity Recognition

Speaker: Jia Shi


As pictographs, Chinese characters contain latent glyph information, which is often overlooked. In this paper, we propose the FGN1, Fusion Glyph Network for Chinese NER. Except for encoding glyph information with a novel CNN, this method may extract interactive information between character distributed representation and glyph representation by a fusion mechanism. The major innovations of FGN include: (1) a novel CNN structure called CGSCNN is proposed to capture glyph information and interactive information between the neighboring graphs. (2) we provide a method with sliding window and attention mechanism to fuse the BERT representation and glyph representation for each character. This method may capture potential interactive knowledge between context and glyph. Experiments are conducted on four NER datasets, showing that FGN with LSTM-CRF as tagger achieves new state-of-the-art performance for Chinese NER. Further, more experiments are conducted to investigate the influences of various components and settings in FGN.

Supervisor: Zhangyu Cao

Talk 2

Title: Learning in the Frequency Domain

Speaker: Ji Li


Deep neural networks have achieved remarkable success in computer vision tasks. Existing neural networks mainly operate in the spatial domain with fixed input sizes. For practical applications, images are usually large and have to be downsampled to the predetermined input size of neural networks. Even though the downsampling operations reduce computation and the required communication bandwidth, it removes both redundant and salient information obliviously, which results in accuracy degradation. Inspired by digital signal processing theories, we analyze the spectral bias from the frequency perspective and propose a learning-based frequency selection method to identify the trivial frequency components which can be removed without accuracy loss. The proposed method of learning in the frequency domain leverages identical structures of the wellknown neural networks, such as ResNet-50, MobileNetV2, and Mask R-CNN, while accepting the frequency-domain information as the input. Experiment results show that learning in the frequency domain with static channel selection can achieve higher accuracy than the conventional spatial downsampling approach and meanwhile further reduce the input data size. Specifically for ImageNet classification with the same input size, the proposed method achieves 1.60% and 0.63% top-1 accuracy improvements on ResNet-50 and MobileNetV2, respectively. Even with half input size, the proposed method still improves the top-1 accuracy on ResNet-50 by 1.42%. In addition, we observe a 0.8% average precision improvement on Mask R-CNN for instance segmentation on the COCO dataset.

Supervisor: Ji Li


Time:16:00  September 30, 2021

Address:MingLi Buliding C1102

Chair: Li Gou