Kokoro TTS

Pricing model
Free
Upvote 0
Kokoro TTS is a state-of-the-art text-to-speech solution that transforms text into realistic, human-like speech. It provides a range of customizable voices across various languages, leveraging advanced AI technology and utilizing NVIDIA GPU acceleration for rapid processing. Users such as content creators, businesses, educators, and developers can utilize Kokoro TTS to create top-notch voiceovers, automated communications, or audio content efficiently, helping to conserve time and resources while ensuring consistency and accessibility for an international audience.

Similar neural networks:

Freemium
Upvote 0
Listnr is an online AI-powered voice generator and text-to-speech tool enabling users to produce lifelike voiceovers from text, featuring over 900 voices across more than 142 languages. It allows users to create perfectly timed, human-like voiceovers for ads, e-learning, product demonstrations, presentations, audiobooks, and YouTube videos. Furthermore, Listnr offers developers straightforward and dependable APIs, and allows users to create a podcast from text, publish it on a customized page, and share it on all major platforms.
Paid
Upvote 0
Replica Studios offers an AI Voice Actor Library featuring over 40 voices for use in games, films, and various creative works. Their AI system is trained to replicate the speech patterns, pronunciation, and emotional expressions of actual voice actors. This library is expanding quickly, and Replica supports indie creators and animation studios by enabling them to achieve natural-sounding performances efficiently and as needed. They are also committed to ethical considerations and the security of AI voices, providing tools to ensure voices are utilized positively.
Freemium
Upvote 1
Descript is an audio and video editing software offering transcription, screen recording, publishing, and AI features such as lifelike voice cloning with Overdub, free voice templates, privacy-centric options, the capacity to edit real recordings mid-sentence, create multiple voices, share with trusted collaborators, and access a premium stock voice library. It also delivers a 44.1KHz broadcast-quality speech synthesizer and live Overdubbing capabilities.