Knowledge Agora



Scientific Article details

Title Multimodal Data Processing Framework for Smart City: A Positional-Attention Based Deep Learning Approach
ID_Doc 39150
Authors Ma, QX; Nie, YF; Song, JY; Zhang, T
Title Multimodal Data Processing Framework for Smart City: A Positional-Attention Based Deep Learning Approach
Year 2020
Published
DOI 10.1109/ACCESS.2020.3041447
Abstract In the past few years, edge computing has brought tremendous convenience to the development of smart cities, releasing computation pressure to edge compute nodes. However, a series of problems, such as the explosive growth of smart devices and limited spectrum resources, still greatly limit the application of edge computing. Different types of end devices generate and collect multimodal information, and substantial data is transmitted to upper nodes. Multimodal machine learning methods process data at edge nodes, and only high-level features are uploaded to the cloud in order to save bandwidth. In this article, we propose a novel multimodal data processing framework based on multiple attention mechanisms. Two distinct attention mechanisms are used to capture inter and intra-modality dependencies and align different modalities together. We conduct experiments on image captioning, a core research hotspot in multimodal machine learning. A unified hierarchical structure extracts features from images and natural language. Matching attention aligns visual and textual information. Besides, we propose a new attention mechanism, positional attention, which finds the relationship of each element within one sensory modality. The hierarchical structure realizes parallel computation in the training phase and speeds up the training of the model. Experiments and analysis demonstrate significant improvements over baselines, proving the effectiveness of our method.
Author Keywords Smart cities; Edge computing; Cloud computing; Servers; Deep learning; Data processing; Task analysis; Attention mechanism; convolutional neural network; smart city; multimodal machine learning
Index Keywords Index Keywords
Document Type Other
Open Access Open Access
Source Science Citation Index Expanded (SCI-EXPANDED)
EID WOS:000597180800001
WoS Category Computer Science, Information Systems; Engineering, Electrical & Electronic; Telecommunications
Research Area Computer Science; Engineering; Telecommunications
PDF https://ieeexplore.ieee.org/ielx7/6287639/6514899/09274421.pdf
Similar atricles
Scroll