Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training for Italian and German works, but Portuguese and Russian do not. #244

Open
frixos25 opened this issue Feb 12, 2025 · 4 comments
Open

Comments

@frixos25
Copy link

I am trying to train a model in German, Italian, Portuguese, and Russian, but only Italian and German produced successful results. The training for Portuguese and Russian resulted in unusable model files. The phoneme extractor works fine in all these languages. What can I do to successfully train the model for Portuguese and Russian?
I also modifed the files portuguese.py and portuguese_bert.py with the link model_id = 'neuralmind/bert-base-portuguese-cased' and the same in the other languages. The phonemes seems to be accurate.

@yukiarimo
Copy link

Will update you when implemented. I’m currently working on rewriting everything, so you can check out my YunaTTS repo

@gdurifw
Copy link

gdurifw commented Feb 21, 2025

I am trying to train a model in German, Italian, Portuguese, and Russian, but only Italian and German produced successful results. The training for Portuguese and Russian resulted in unusable model files. The phoneme extractor works fine in all these languages. What can I do to successfully train the model for Portuguese and Russian? I also modifed the files portuguese.py and portuguese_bert.py with the link model_id = 'neuralmind/bert-base-portuguese-cased' and the same in the other languages. The phonemes seems to be accurate.

hi, can you explain how to do the training to add the Italian language?
Is preprocess_text.py enough or do we need to create a new structure starting from es_phonemizer?
Ciao :)

@Marcello-Bentivoglio
Copy link

Hello,
I am also interested in how to train the model on an Italian dataset. We have a dataset that includes a set of 1,000 audio files, and we have modified the repository's code to ensure it follows the guidelines of other languages. Are we heading in the right direction? @frixos25 Could you give us some advice on how you did it?

Thank you very much in advance.

@yukiarimo
Copy link

  1. If language does not exist you need to just implement it (find phoneme logical somewhere)
  2. Otherwise, we can just simply transliterate into English and train a new speaker (this works)!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants