Real-Time Translating Earbuds

Amanda White
5 min readOct 21, 2022

--

The development of technology over the past hundred years has enabled connection across the world that would have otherwise been unthinkable. We are now able to access information from other countries, share personal experiences and meet new friends from different backgrounds all because of the connectivity of the internet and social media. And even though we are more connected than ever before, we are still limited by the ability to understand one another.

Imagery of pieces of multi-colored paper with the word thank you in several different languages
https://learnenglishteens.britishcouncil.org/study-break/magazine-zone/languages

There are approximately 7,100 languages spoken across the world today. In response, many countries require students to learn popular languages, like English, along with their native language in order to bridge the language gap across the world. However, learning just one additional language can be very difficult and can require life-long commitment in order to gain fluency.

Luckily, scientists have taken another approach and have worked to develop technology that translates speech in several languages. After years of prototyping, several companies have emerged with real-time translating devices conveniently encapsulated into common wireless earbuds. A translating company that claims to have developed the most precise and accurate earbuds is Timekettle with their newest model, the Timekettle WT2 Edge.

Image of the Timekettle WT2 Edge earbuds and charging case. A design similar to common earbuds.
https://www.timekettle.co/blogs/news/these-real-time-in-ear-translator-earphones-help-you-fluently-speak-in-as-many-as-40-different-languages

How it works

These earbuds work in a five stage process. The first stage is called ‘input conditioning’ in which the device utilizes voice activation detection technology in conjunction with background noise removal in order to carefully listen to the user’s individual speech. The second stage is called ‘language identification’ which utilizes machine learning to determine what language is being spoken by the user. The third stage is called ‘automatic speech recognition’ which utilizes a process that converts recorded speech into strings of phonemes and language modeling to develop specific words. The fourth step, called ‘natural language processing’, then directly translates the specific words from one language to another. Lastly, the fifth stage called ‘speech synthesis’ utilizes a process exactly opposite to the third stage in order to convert those words to speech in the opposing earbud.

The Timekettle WT2 Edge seems to lead the industry in translating earbud technologies for many reasons. This model has the ability to translate 40 languages and up to 93 accents with a 95% accuracy. You can utilize this technology in three modes which allow for 1) two person conversations, 2) a translating speaker for large audiences, and 3) brief ‘walkie-talkie’ translation for 6–30 people when utilized with video conferencing tools. This model also offers offline translation for a small selection of common languages making connectivity more accessible.

Future Advancements and their Effect on Society

The development of this technology is very exciting as it can have an extraordinary effect on how we communicate with others in the future. In the future we can expect to see these kind of devices translate more languages with increased accuracy and expand into less common languages and dialects. We can also expect to see devices with longer battery lives and even implementation into common earbud devices, cell phones and video conferencing technologies. I believe that one day these might gain enough popularity and demand that we will use the technology as everyday tools.

Imagery of researchers giving presentation to large room of people
https://blogs.illinois.edu/view/6397/273317

This technology is specifically exciting for me as a second-generation immigrant. I have family members that explicitly speak Chinese while I primarily speak English. So having a tool that would allow for deeper conversation with such ease is especially exciting. I am also currently pursuing a Masters of Science in Mechanical Engineering and in that field research is shared across the world. One barrier to sharing that research, however, is language barriers. The utilization of this real-time translating technology has the potential to break down those barriers and allow myself and other scientists to share ground-breaking research with ease.

Aside from the positive impact this technology may have on society, there is potentially negative impacts that we need to consider. This technology has the potential to connect people from vastly different backgrounds with ease, however, by relying more on technology for translation, people may be effected in various ways. First, utilizing this simple technology may discourage people from learning and studying other languages. People may develop the mindset of ‘why spend years of my life memorizing words and practicing grammar if I can just put on my translating earbuds?’ This could also result in a loss of jobs for translators across the world as it will become cheaper to purchase translating technology once rather than hire hourly employees.

Imagery of professional translator
https://www.entrepreneur.com/en-au/news-and-trends/what-translators-do-others-learn/339036

Conclusion

Overall, this up-and-coming technology has the potential to change the way we communicate with people around the world. Companies continue to research ways to improve performance of these real-time translating earbuds and translate more and more languages. This technology has the potential to benefit the lives of every person on Earth. However, we must be aware of the potential negative impacts to real people that the development of these kinds of technologies may produce.

Sources

--

--