Training for Italian and German works, but Portuguese and Russian do not. #244

frixos25 · 2025-02-12T19:56:01Z

I am trying to train a model in German, Italian, Portuguese, and Russian, but only Italian and German produced successful results. The training for Portuguese and Russian resulted in unusable model files. The phoneme extractor works fine in all these languages. What can I do to successfully train the model for Portuguese and Russian?
I also modifed the files portuguese.py and portuguese_bert.py with the link model_id = 'neuralmind/bert-base-portuguese-cased' and the same in the other languages. The phonemes seems to be accurate.

yukiarimo · 2025-02-19T18:33:24Z

Will update you when implemented. I’m currently working on rewriting everything, so you can check out my YunaTTS repo

gdurifw · 2025-02-21T17:27:03Z

I am trying to train a model in German, Italian, Portuguese, and Russian, but only Italian and German produced successful results. The training for Portuguese and Russian resulted in unusable model files. The phoneme extractor works fine in all these languages. What can I do to successfully train the model for Portuguese and Russian? I also modifed the files portuguese.py and portuguese_bert.py with the link model_id = 'neuralmind/bert-base-portuguese-cased' and the same in the other languages. The phonemes seems to be accurate.

hi, can you explain how to do the training to add the Italian language?
Is preprocess_text.py enough or do we need to create a new structure starting from es_phonemizer?
Ciao :)

Marcello-Bentivoglio · 2025-03-12T08:49:31Z

Hello,
I am also interested in how to train the model on an Italian dataset. We have a dataset that includes a set of 1,000 audio files, and we have modified the repository's code to ensure it follows the guidelines of other languages. Are we heading in the right direction? @frixos25 Could you give us some advice on how you did it?

Thank you very much in advance.

yukiarimo · 2025-03-12T13:09:45Z

If language does not exist you need to just implement it (find phoneme logical somewhere)
Otherwise, we can just simply transliterate into English and train a new speaker (this works)!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training for Italian and German works, but Portuguese and Russian do not. #244

Training for Italian and German works, but Portuguese and Russian do not. #244

frixos25 commented Feb 12, 2025

yukiarimo commented Feb 19, 2025

gdurifw commented Feb 21, 2025

Marcello-Bentivoglio commented Mar 12, 2025

yukiarimo commented Mar 12, 2025

Training for Italian and German works, but Portuguese and Russian do not. #244

Training for Italian and German works, but Portuguese and Russian do not. #244

Comments

frixos25 commented Feb 12, 2025

yukiarimo commented Feb 19, 2025

gdurifw commented Feb 21, 2025

Marcello-Bentivoglio commented Mar 12, 2025

yukiarimo commented Mar 12, 2025