Tyler Weitzman, Co-Founder & Head of AI at Speechify – Interview Series

0
340

[ad_1]

Tyler Weitzman is the Co-Founder, Head of Artificial Intelligence & President at Speechify, the #1 text-to-speech app on this planet, totaling over 100,000 5-star critiques. Weitzman is a graduate of Stanford University, the place he acquired a BS in arithmetic and a MS in Computer Science within the Artificial Intelligence observe. He has been chosen by Inc. Magazine as a Top 50 Entrepreneur, and he has been featured in Business Insider, TechCrunch, LifeHacker, CBS, amongst different publications. Weitzman’s Masters diploma analysis targeted on synthetic intelligence and text-to-speech, the place his ultimate paper was titled: “CloneBot: Personalized Dialogue-Response Predictions.”

You started coding once you have been solely 9 years outdated, what initially attracted you to laptop science?

I used to be fairly obsessed as a child with Dragon Ball Z, and I needed to be taught to animate myself. I discovered Adobe Flash and Photoshop and put my very own animations of Goku on a fan webpage I constructed. It was quickly after I started studying about methods and algorithms, and after I discovered I might truly program for a residing that was fairly thrilling. I believed it was only a passion like taking part in video games.

You then started constructing iphone apps once you have been solely 12 years outdated, what have been a few of these apps?

One app known as Black SMS that enables folks to ship encrypted textual content messages to one another. Another app was known as Frontback that allows customers to take selfies and images of what’s in entrance of them at the very same time.

Could you focus on your analysis at Stanford University and the way it was centered round pure language processing and speech synthesis?

My analysis spanned a number of makes use of for transformer networks, together with language technology fashions for chat, part-of-speech tagging, punctuation prediction, and text-to-speech. Optimizing neural community inference for cellular CPUs was a major focus and that immediately translated to the offline voices obtainable on Speechify, which work even on airplane mode.

Could you share the genesis story behind Speechify?

I’m blind in a single eye and my brother Cliff is dyslexic. We’ve used audiobooks and textual content to speech audio know-how for so long as we are able to bear in mind to get by faculty and after we have been younger for studying books like Harry Potter. As we bought older and began to make use of extra know-how merchandise, we began to comprehend there was a chance to construct higher textual content to speech apps on internet and cellular with higher voices because of developments in AI and a greater person expertise. So we determined to go for it in Speechify.

What are among the totally different machine studying applied sciences which are used at Speechify?

We’ve adopted cutting-edge methods for superior generative architectures— transformers/conformers, large-scale pretraining, distributed coaching, gradient accumulation, auto-encoded latent areas, diffusion, adversarial networks, and language modeling. We make use of supporting methods for function processing surrounding phonemization, pitch, and emotion, to higher mannequin speech particularly.

What are among the challenges behind constructing a text-to-speech app?

One key problem is constructing top quality voices that sound like actual people relatively than robots. Our purpose is for folks to not be capable to inform the distinction between how our voices sound and the way people sound, in order that our customers are comfy listening to content material on Speechify for lengthy intervals of time. A second problem is distributing our AI fashions to hundreds of thousands of customers. It’s one factor to construct top quality AI voices and one other to verify hundreds of thousands of customers the world over truly discover out about them and use them.

Speechify is the #1 app in its class within the app retailer, what do you attribute this success to?

We consider we’ve constructed one of the best merchandise out there for individuals who need to take heed to the studying they should devour – whether or not it’s college students with homework, professionals who’re studying for work, or leisure readers who simply need to be entertained. We have one of the best collection of voices, together with celebrities like Snoop Dogg, and one of the best person interface for folks to simply add and entry the content material that they need to devour. And our person expertise is seamless throughout the Speechify ecosystem – you can begin listening to an article in your laptop after which simply zap it to maintain listening in your cellphone.

What are among the largest use circumstances for this app?

Speechify’s generative AI solves actual issues for college kids who need to get by numerous homework quicker, actual folks with Dyslexia and ADHD who’ve bother studying, seniors with low imaginative and prescient, professionals who need to learn extra and be extra productive, writers who need to take heed to their work, auditory learners, and numerous others.

What is your imaginative and prescient for the way forward for AI?

We need AI – and particularly AI textual content to speech voices – to get rid of limitations to studying no matter your earnings stage, studying variations, geography, or language. We see AI as a device for social good to raise the standard of life people can dwell by enhancing their schooling.

Thank you for the good interview, readers who want to be taught extra ought to go to Speechify.

LEAVE A REPLY

Please enter your comment!
Please enter your name here