Best AI Voice Generator | 2024.08

6 месяцев назад

22,120 Просмотров

Комментарии:

@guilherme1556 - 11.08.2024 20:54

I loved this type of content Thorsten. You made it so easy for me to test some TTS models I wanted to for using in some home automation projects. You are the best Thorsten, thank you so much 🎉 🎉🎉

Ответить

@helloworld7796 - 11.08.2024 22:46

Is PiperTTS still the best to do training?

Ответить

@willthecat3861 - 12.08.2024 00:05

I'd like to hear more about 'integration' or TTS... for reading text...not just for amusing myself, by cloning my voice.

Ответить

@MitchRSA - 12.08.2024 03:10

I remember back in 1995, using the MAC TTS for the first time at the age of 12. That sense of wonder and awe... you took me back there... Thank you Thorsten!

Ответить

@NLPprompter - 12.08.2024 05:44

developers who are do open source... they don't know they might change someone live into better living... i got blind friend it is never been so happy moment for her listening humanlike speech... she said maybe someday she could get a emotional speech driven by context paragraph it read, she said imagine if she reading (listening) a novel with automatic switching voice and emotionally accurate referred by the story...

Ответить

@AltMarc - 12.08.2024 16:34

Whole video is pretty pointless, can't find out which one is better, cloning your foreign accent doesn't help much too and the programming language/OS isn't useful (would be better to know if it uses CPU/CUDA/METAL and how fast is its inference)... Try cloning the voice of the Professor in Futurama.
Your T-shirt sums it up.

Ответить

@Storytelling-by-ash - 12.08.2024 18:39

Thank you so much for your effort in this, it really helped me ❤

Ответить

@clemeaux.1 - 12.08.2024 20:43

Hallo Thorsten und ein herzhaftes Mopn, Moin, aus dem Norden und Danke für dieses Video! Bezüglich deiner Frage, was ich als Bestandteil deiner geplanten Folgen zu den jeweiligen TTS-Systemen gern hören/sehen würde: Für mich (und wahrscheinlich auch viele andere) wäre interessant, wie sich die jeweiligen Modelle in lokale Desktop-Anwendungen (wie etwa Open-WebUI, Text-Genneration-WebUI., LM-studio, Koboldcpp, etc.) einbinden lassen, bzw. ob das überhaupt möglich ist. Da du dich in deinen Videos häufig mit der Thematik lokal laufender Annwendungen auseinandersetzt, dürfte dies wohl sowieso ein naheliegendes Thema sein...
Hello Thorsten! Greetings from the north of Germany and many thx for this video! Regarding your question about what I'd like to see covered in the upcoming videos about the the different TTS-models, that you're planning to create: I guess it's not only me who would be interested in how it will be possible (or if, anyway) to integrate those TTS-engines into desktop-apps running LLM's locally like: Open-WebUI, Text-Generation-WebUI (Oobabooga), LM-Studio, Koboldcpp, etc. Since running TTS locally seems to be the topic of several of the videos we find on your channel, this might be something that is close to you anyway...

Ответить

@thegtlab - 13.08.2024 17:10

Best open source library for fine-tuning custom voices? Im currently using alltalktts and the models come out decent, just wondering if there is anything better.

Ответить

@lennoyl - 13.08.2024 18:38

I stupidly though Parler would speak French language but it doesn't seem to...

Ответить

@judehaalandham - 14.08.2024 02:31

My man!!!!!! Fank yoe very moch

Ответить

@suhass9837 - 14.08.2024 09:24

Is it possible for two speakers can you help us to find two speakers supported models?

Ответить

@RoshnaOmer94 - 14.08.2024 19:21

Thank you for going over these models! I really enjoyed it!

I have a question about Parler TTS. I want to train in on languages like Arabic that don't use English letters, do you think that could be possible? I tried using Common Voice as an example but failed

Ответить

@softvision3000 - 15.08.2024 20:47

Nice German accent. 😂

Ответить

@safnasthegreat7153 - 16.08.2024 17:44

could you do a video about how to train TTS for our native languages. there are videos but those videos are now old and there are some updates. we would really appreciate if you do for both linux and windows

Ответить

@iknowwhy2629 - 20.08.2024 08:55

Hi. thank you for your videos. I'm kinda new to this so I don't know much about all this. is there any "good" tts for people that have AMD gpus and are using windows? if there is, can you connect them to something like koboldAI and how?

Ответить

@Dseen4u - 20.08.2024 19:24

How I learn voice cloning and voice accent

Ответить

@Dseen4u - 20.08.2024 19:25

I am bigger how i learn ai voice cloning and accent

Ответить

@garthok6224 - 21.08.2024 01:06

I wonder which one is better for training a Spanish model. I want to convert books to audio with s better voice than Android. Any guidance?

Ответить

@not_lexxzaa - 21.08.2024 10:53

So i want to ask about a tool that can extract from a person. Like for example if i want a person with their specific language and they can use their voice. The tool will allow to record the voice first and automatically extract it. Once that happens, that voice can be converted into AI Generated voice on that same voice and accent in just few words.

From this, we can test if we type a few words from text to speech. That specific custom generated AI voice that is extracted will convert the speech to the exact voice and accent itself. Is there a specific tool for that?

Ответить

@BalamuruganCRA - 24.08.2024 16:39

Thank you, Man, for this wonderful infermation

Ответить

@pedroorden - 25.08.2024 22:30

thanks Thorsten, greetings from buenos aires, argentina

Ответить

@greggwelker4733 - 29.08.2024 16:59

Fantastic Thorsten, very useful and informative.

Ответить

@Insidestoryland - 30.08.2024 14:10

there is any way to train a model voice model on my own voice, after this safe the parameter of my voice safe a file and next time when i need text to speech use only these parameter to generate voice: Coqui-TTS with this model..... help me please. i search all over the internet did not find any solution

Ответить

@Ravisidharthan - 31.08.2024 17:39

What is the best option for mac offline?

Ответить

@Mystinarium - 01.09.2024 11:19

Hallo Thorsten, ich habe dir eine Mail geschrieben, ich würde mich freuen, wenn du guggen könntest 😅. Es geht um dein tolles Programm und ich hab da ein Problem. Keine Angst, ich bin das Problem, nicht dein Programm. 😇 Danke dir.

Ответить

@Aristocle - 04.09.2024 00:23

Which of these are multi-lingual? in particular those who speak Italian?

Ответить

@nastastic - 08.09.2024 06:12

What one would be able to create a cartoon character voice? I tried a couple of huggingface models but no luck getting a sample voice in to work on building a new voice.

Ответить

@louiereyes1306 - 13.09.2024 03:59

Thanks Thorsten! I'm interested in Parler, is there a way to extend the number of characters it can process. My use case is short stories to be converted to audio book. I only know basic python.

Ответить

@AR45H - 18.09.2024 12:31

Hey Thorsten,
Is there any way to use Local AI/Neural TTS in windows with the SAPI5 interface?
I don't want to get an audio output file. I want the TTS to read the text to me using AI/Neural voices.
I would like to use this to read ebooks/text. I already use some Ivona and Harpo voices in Balabolka reader. Very recently I found out I can use Microsoft's online natural voices with NaturalVoiceSAPIAdapter, a very neat piece of software that you're welcome to share with everyone else on your channel. But there is a small problem with that. The reader constantly pauses after every sentence because of how the Adapter is sending the data to Microsoft. So, I am still in desperate need of local neural TTS that works through SAPI5.
Thanks for your great work.

Ответить

@ennergie - 20.09.2024 16:16

Lieber Thorsten. Als ADSler fällt es mir sehr schwer, lange Texte zu lesen. Ich kann viel besser Informationen verarbeiten, wenn ich sie höre. Die beste Sprachsynthese, wenn es schnell gehen muss, liefert meiner Erfahrung nach leider immer noch ege auf Windows. Aber ich suche regelmäßig nach einer bessern Computerstimme. XTTS war eine deutliche Verbesserung, was Betonung betrifft. Leider wurden manchmal Worte verschluckt. Ich folge deinen Videos aufmerksam und erwarte gespannt deinen Test von Meta Voice. etc. Ich finde deine Arbeit wichtig und bin dir für deine Mühe sehr dankbar. Weiter so.

Ответить

@hifidrache5366 - 21.09.2024 11:07

Hallo Thorsten, dein Programm das du hier vorstellst kann leider gar kein deutsch. Aber dafür kannst du ja nichts. Hoffe das es bald bessere models gibt. xtts verschluckt in Version 2.02 leider beim generieren manchmal Wörter oder dichtet welche hinzu. Bisher habe ich kein Weg gefunden das stabil ist. Aber ich werde das weiter beobachten.

Ответить

@Marshaal__27 - 21.09.2024 13:01

hey there thorsten i just came across your channel and it so amziang i get the stuffs i was looking for ,these tts model but i have a question iis there a one where he nvidia graphics card is not necessary and it sounds very much human like with easy setup and probably a ui. thank you

Ответить

@siddhubhai2508 - 24.09.2024 11:47

In my opinion if you're looking for the best TTS only then the ChatTTS is the best!

Ответить

@iseahosbourne9064 - 27.09.2024 18:48

He thorsten, what is the best overhaul voice cloning ai tool both locally and remotely? RVC, tortoise tts fast, coqui, so-vits, xtts?

Ответить

@AbdulAzizKhan-m8d - 07.10.2024 22:16

Nice explain ❤, tts voice clone + run in low end pc?????

Ответить

@gotonethatcansee - 12.10.2024 16:14

link for piper onnx ?

Ответить

@ŁukaszMadajczyk - 13.10.2024 02:23

Hello Thorsten, is it possible for you to show how to install and use Bark multi-lingual TTS model ?

Ответить

@GabrielLucas-hy5uq - 15.10.2024 00:04

Hi Thorsten, I use TTS with a different intention, my English pronunciation is not good, so I record an audio of myself speaking in English and use it as inference generating an audio with the same sentence.

I currently use CoquiTTS, out of 100 audios that I generate from the same sentence, 7 have a similar intonation and emotion to the original audio 🤣.

Would you have any recommendations for another TTS that can do the same better?

Ответить

@dondixon4206 - 20.10.2024 03:42

Hi Sir, Your video is Fantastic!!! .. well done!!! The most valuable feature of TTS for me is the ability to highlight words or generate visemes (or even phone numbers) in real time as the text is spoken. This functionality is incredibly important to my work, and I am wondering if any voices or systems provide this capability. Specifically, I am looking for a method to capture spoken words, phrases, or syllables as they are being generated and displayed in real time.

While I have had success with SAPI 5 on Windows for this purpose, I have been unable to find similar solutions for Linux, particularly on my Raspberry Pi setup. My goal is to run me
TTS locally with a childlike voice and to extract key elements such as word highlighting or real-time Phoneme generation. Any guidance or support on achieving these tasks would be greatly appreciated. Thank you!

Ответить

@ŁukaszMadajczyk - 29.10.2024 18:29

Hi Thorsten,

How many hours/steps you spent to trains your DE dataset to become usable model in couqi-tts?

I'm trying to do some model training with my dataset (35 minutes of audio) and I start hearing some voice on 10k steps but it is far away from what I would like to get....

Ответить

@ColinNardo-le3bl - 05.11.2024 16:02

Hi Thorsten!

I want to make a portfolio website where people can talk to myself. Id have a text to text that knows everything about me and that would go to a tts of my own voice to tell it what to say each time. My problem is hosting. I dont understand how the APIs of these tts models work and how id be able to host it as most gpu hosting websites offer per hour rates which seem very expensive.. what do i do! maybe ive got the wrong approach..

Ответить

@rvanner - 05.12.2024 14:13

What's the best TTS for use in an Apple and Android app locally (ie no server connecting)?

Ответить

@EdTimTVLive - 06.12.2024 05:15

It is a nice and useful video. Thank you. I am looking at various options right now.

Ответить

@musakurel - 25.12.2024 16:05

which ones we can use with Swift CoreML ? Is it possible to make them run swift locally?

Ответить

@nikosterizakis - 31.12.2024 13:28

Sadly, none of the reviewed models and frameworks work locally. I have a 2080 Nvidia and tried the frameworks on Umbuntu. All of the frameworks have very poor documentation. Tucan has an issue with the code that is meant to execute finetuning and kept coming up with division by zero errors (I think it has a lower limit on number of samples but not mentioned anywhere in docs). Mars5 needs more than 16GM Vram (but not mentioned anywhere either). ChatTTS does NOT support finetuning, but needs training from scratch. and MetaVoice has a stated 12GB VRam requirement, which meant I did not even try.

Ответить