Composition of weighted finite transducers in MapReduce

Bilal Elghadyry; Faissal Ouardi; Sébastien Verel

doi:10.1186/s40537-020-00397-4

Article Dans Une Revue International Journal of Big Data Intelligence Année : 2021

Composition of weighted finite transducers in MapReduce

(1, 2) , (1) , (2)

1
2

Bilal Elghadyry

Fonction : Auteur
PersonId : 819195
ORCID : 0000-0002-8034-5855

Université Mohammed V de Rabat [Agdal]

Laboratoire d'Informatique Signal et Image de la Côte d'Opale

Faissal Ouardi

Fonction : Auteur
PersonId : 762401
ORCID : 0000-0002-7636-5001

Université Mohammed V de Rabat [Agdal]

Sébastien Verel

Fonction : Auteur
PersonId : 2072
IdHAL : sebastien-verel
ORCID : 0000-0003-1661-4093
IdRef : 11806259X

Laboratoire d'Informatique Signal et Image de la Côte d'Opale

Résumé

Weighted finite-state transducers have been shown to be a general and efficient representation in many applications such as text and speech processing, computational biology, and machine learning. The composition of weighted finite-state transducers constitutes a fundamental and common operation between these applications. The NP-hardness of the composition computation problem presents a challenge that leads us to devise efficient algorithms on a large scale when considering more than two transducers. This paper describes a parallel computation of weighted finite transducers composition in MapReduce framework. To the best of our knowledge, this paper is the first to tackle this task using MapReduce methods. First, we analyze the communication cost of this problem using Afrati et al. model. Then, we propose three MapReduce methods based respectively on input alphabet mapping, state mapping, and hybrid mapping. Finally, intensive experiments on a wide range of weighted finite-state transducers are conducted to compare the proposed methods and show their efficiency for large-scale data.

Mots clés

Finite transducers MapReduce Composition Communication cost

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

bmc_article.pdf (378.18 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte
Licence : CC BY - Paternité

Sébastien Verel : Connectez-vous pour contacter le contributeur

https://ulco.hal.science/hal-03507182

Soumis le : mardi 18 avril 2023-09:05:08

Dernière modification le : mardi 25 avril 2023-11:23:56

Archivage à long terme le : mercredi 19 juillet 2023-18:14:47

Dates et versions

hal-03507182 , version 1 (18-04-2023)

Identifiants

HAL Id : hal-03507182 , version 1
DOI : 10.1186/s40537-020-00397-4

Citer

Bilal Elghadyry, Faissal Ouardi, Sébastien Verel. Composition of weighted finite transducers in MapReduce. International Journal of Big Data Intelligence, 2021, 8 (22), ⟨10.1186/s40537-020-00397-4⟩. ⟨hal-03507182⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LITTORAL LISIC

29 Consultations

17 Téléchargements

Composition of weighted finite transducers in MapReduce

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager