AI Voice Cloning — Creators Need to Wake Up.

Andrew Best
6 min readDec 11, 2023

--

How cloning your own voice using AI could 2X your productivity and income.

But let’s be honest — AI generated voices have largely been just a fun gimmick so far.

There has been the occasional YouTube success story with “faceless channels”, where the creator uses a robotic AI generated voice and b-roll video footage, to produce vast amounts of videos in a short amount of time.

The reality, however, has been that most of these experiments have flopped — bigtime.

Efficiency is great, but if the quality isn’t good enough for people to actually watch, then what good is producing 100 videos a day if they don’t get any views.

For most people, as soon as they hear the robotic AI computer voice, they click away from the video. This destroys the average “watch time”, and the YouTube algorithm will quickly realize the poor performance and bury the video deep into the abyss, never to be seen again.

Do you really want to watch a video written by ChatGPT and read by a robotic AI?

Of course you don’t.

And nobody else does either.

Why is cloning your own voice better than using a professional AI voice?

Most content creators still don’t know how impressive the quality of professional voice cloning is.

The biggest reason to clone your own voice is that you can train your AI voice clone to be indistinguishable from your own voice.

Professional AI voices still sound too robotic — even the paid ones.

Honestly, I don’t think your audience will care if you use a truly high quality AI clone of your own voice, especially if they know that you wrote your own content.

Far too many content creators have been cutting corners by getting ChatGPT to produce their actual content — Not enough content creators have leveraged the power of AI voice cloning.

They have this backwards.

Voice cloning is where the real leverage and productivity gains will come from in 2024 and beyond.

Once you have your perfect AI voice clone, the amount of leverage you have for content creation is incredible. You can focus your energy into research and writing great scripts. You won’t need to waste your time setting up a microphone, recording, and editing all the mistakes.

Recording your voice takes so much time and energy, and if an AI can do it as well as you can, then you are just wasting your time and energy.

And money.

What is the best free AI voice cloning software?

This was the first question I asked in my quest to clone my own voice.

I’ll spoil the surprise. There really isn’t anything free that is worthwhile. You’ll see some companies advertise these as “free”, but then pull the bait-and-switch tactic when you get on their website. You might get a mini free trial, or something like that, but don’t waste your time going down this rabbit hole.

I wrote on a previous post about how expensive it is to use these AI API’s.

Training voices is expensive, and no company can offer much for free because they are also paying a lot of money to use the API’s.

What is the best software to clone my own voice?

If you are serious about getting any results with this, then you need to get the absolute best quality.

There is no point in making an AI voice clone that is kind of cool, but isn’t good enough to replace you.

The point of making an AI voice clone is to actually use it to replace your “real voice”. If it is pretty good, but not as good as the “real you”, then people won’t like to listen to it as much. Then the whole exercise is a waste of time and money.

I’ve looked around and compared what is out there. Eleven Labs is the clear winner.

It is a very reasonable price as well. You need to get their creator package for the professional voice cloning ability. It is 22 bucks a month. And the first money is only $11.00

It is actually really cheap when you think about how much time you can save with this, and how much extra money you can earn from using this properly.

How long does it take to train my own AI voice clone?

I’ve seen some services saying that all you need to do is upload yourself reading a couple sentences, and then they’ll make an AI voice clone.

Nonsense.

These are instant voice clones, not professional voice clones.

To me, a voice clone is a voice that sounds like the real you. Not a gimmicky thing that “sort of sounds like you”.

If you really want to make an AI voice that sounds like the real you, then you will need at least 30 minutes of high quality audio of you speaking

The more the high quality input the model has to train on, the better the result will be.

Don’t cut corners on this.

If your voice clone isn’t as good as your real voice, then don’t use it.

Note: These models are getting better and better, so it is possible that they will be able to shorten the amount of audio you need to train the models well.

Also, you can upload audio from work you’ve previously made, like YouTube videos.

Voice Cloning Options at Eleven Labs

Multilingual AI voice clones — a huge bonus.

Did you know you could speak 29 languages fluently?

The amazing thing is that your AI voice clone can.

I speak Mandarin Chinese, and I’ve tested it to see how good the software is. It’s awesome.

These are the languages you can translate your voice into — 11 Labs.

Here are some other advantages of cloning your own voice

  • If you clone your own voice, it will be uniquely yours to use. No one else will be allowed to use it, so it won’t have a generic feel.
  • You can train the model on the words you actually use. This is a huge one. Have you ever used a TTS (Text To Speech) model and asked it to read a script? There are always a few words it can’t pronounce properly. These errors scream “FAKE” and ruin the entire audio clip. If you record yourself saying these words, the AI will say them exactly how you say them.
  • You can train keep improving the voice by giving it more and more of your own recordings to use as training data.
  • If you listen to an entire script made on your own voice, and you don’t like one little part of it, you can just record yourself saying that part and edit it into the video. If you use a generic AI voice, you don’t have that option because you won’t sound like the voice.
  • If YouTube and other companies start cracking down on AI generated content (which they have been and will continue to do more and more), then at least having your own voice cloned will be far better than being caught using a generic voice. (This is an educated guess on my part, but it makes total sense)

Sign up for Eleven Labs and start creating your own professional AI voice clone.

Sign up for my email newsletter if you are interested in AI and Growth Marketing.

Also, check out my website: AI Growth Guys for more info on how to grow your business with AI.

Note: Some links in this post may be affiliate links, where I earn a small commission if you choose to make a purchase through these links.

--

--

Andrew Best

AI Educator | ChatGPT & Prompt Engineering Expert | 10M+ Podcast Downloads | Co-founder of 88Herbs | AI Course Instructor at Udemy | www.aigrowthguys.com