Hu Mingxuan, He Min, Su Wei et Chehri Abdellah. (2020). A TextCNN and WGAN-gp based deep learning frame for unpaired text style transfer in multimedia services. Multimedia Systems,
Le texte intégral n'est pas disponible pour ce document.
URL officielle: http://dx.doi.org/doi:10.1007/s00530-020-00714-0
Résumé
With the rapid growth of big multimedia data, multimedia processing techniques are facing some challenges, such as knowledge understanding, semantic modeling, feature representation, etc. Hence, based on TextCNN and WGAN-gp (improved training of Wasserstein GANs), a deep learning framework is suggested to improve the efficiency of discriminating the specific style features and the style-independent content features in unpaired text style transfer for multimedia services. To redact a sentence with the requested style and preserve the style-independent content, the encoder-decoder framework is usually adopted. However, lacking of same-content sentence pairs with different style for training, some works fail to capture the original content and generate satisfied style properties accurately in the transferred sentences. In this paper, we adopt TextCNN to extract the style features in the transferred sentences, and align the style features with the target style label by the generator (encoder and decoder). Meanwhile, WGAN-gp is utilized subtly to preserve the content features of original sentences. Experiments demonstrate that the performances of our framework on automatic evaluation and human evaluation are much better than the former works. Thus, it provides an effective method for unpaired text style transfer in multimedia services.
Type de document: | Article publié dans une revue avec comité d'évaluation |
---|---|
Version évaluée par les pairs: | Oui |
Date: | 23 Novembre 2020 |
Sujets: | Sciences naturelles et génie > Génie Sciences naturelles et génie > Génie > Génie informatique et génie logiciel Sciences naturelles et génie > Sciences appliquées |
Département, module, service et unité de recherche: | Départements et modules > Département des sciences appliquées > Module d'ingénierie |
Mots-clés: | big multimedia data, TextCNN, WGAN-gp, unpaired text style transfer, multimedia services, big multimédias, transfert de style de texte non apparié, services multimédias |
Déposé le: | 27 avr. 2021 18:21 |
---|---|
Dernière modification: | 27 avr. 2021 18:21 |
Éditer le document (administrateurs uniquement)