Please use this identifier to cite or link to this item: https://hdl.handle.net/10321/5838
Title: Speech to speech translation with translatotron : a state of the art review
Authors: Kala, Jules R. 
Adetiba, Emmanuel
Abayom, Abdultaofeek 
Dare, Oluwatobi E. 
Ifijeh, Ayodele H. 
Keywords: Translatotron;BLEU;Cascade;Speech-to-speech
Issue Date: 9-Feb-2025
Source: Kala, J.R. et al. 2025. Speech to speech translation with translatotron: a state of the art review. arXiv preprint arXiv:2502.05980.
Abstract: 
A cascade-based speech-to-speech translation has been considered a benchmark
for a very long time, but it is plagued by many issues, like the time taken to
translate a speech from one language to another and compound errors. These
issues are because a cascade-based method uses a combination of methods such
as speech recognition, speech-to-text translation, and finally, text-to-speech trans
lation. Translatotron, a sequence-to-sequence direct speech-to-speech translation
model was designed by Google to address the issues of compound errors associated
with cascade model. Today there are 3 versions of the Translatotron model: Trans
latotron 1, Translatotron 2, and Translatotron3. The first version was designed as a
proof of concept to show that a direct speech-to-speech translation was possible, it
was found to be less effective than the cascade model but was producing promising
results. Translatotron2 was an improved version of Translatotron 1 with results sim
ilar to the cascade model. Translatotron 3 the latest version of the model is better
than the cascade model at some points. In this paper, a complete review of speech
to-speech translation will be presented, with a particular focus on all the versions
of Translatotron models. We will also show that Translatotron is the best model
to bridge the language gap between African Languages and other well-formalized
languages.
URI: https://hdl.handle.net/10321/5838
Appears in Collections:Research Publications (Systems Science)

Files in This Item:
File Description SizeFormat
Kala_Adetiba et al_2025.pdf32.76 MBAdobe PDFView/Open
Show full item record

Page view(s)

19
checked on Mar 13, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.