KittenTTS

Pricing model
Open Source
Upvote 0
KittenTTS is an ultra-lightweight open-source text-to-speech model that converts written text into natural-sounding speech with impressive quality, all while requiring minimal computational resources. Unlike most speech conversion AI models that demand powerful hardware, KittenTTS operates efficiently on almost any device, including older computers, Raspberry Pi, and even browsers, thanks to its tiny size of 25 MB and design with 15 million parameters. This AI model provides several realistic voices in real-time without needing an internet connection or GPUs, making it ideal for developers creating privacy-focused applications, edge computing projects, accessibility tools, or any scenarios where resource efficiency is vital. Combining high output quality, incredible speed on CPU-only systems, and an open-source Apache 2.0 license, KittenTTS represents a breakthrough in AI-powered voice conversion where larger models simply cannot function.

Similar neural networks:

GitHub
Upvote 0
TTS Voice Wizard is a software that allows users to transform their speech into text and then reconvert it to speech using Microsoft Azure Voice Recognition and TTS. Additionally, it transmits OSC messages to VRChat to exhibit text on an avatar. The software offers numerous customization features, including over 100 voice options, support for more than 20 languages, and the capability to display song titles, artists, and progress above the user.
Paid
Upvote 0
Resemble's AI voice generator is a comprehensive toolset for generating lifelike voices swiftly. It includes features such as text-to-speech, speech-to-speech, neural audio editing, language dubbing, emotional expression, real-time voice cloning, localization, and Resemble Fill. Additionally, it offers a versatile API and compatibility with popular tools, allowing developers to quickly create production-ready integrations.
Paid
Upvote 0
This robust online voice generator provides a wide selection of over 130 AI voices in various accents and tones, allowing you to effortlessly find the ideal voice for your videos, presentations, commercial branding, e-learning materials, and more. Utilizing cutting-edge AI algorithms and deep learning, Murf’s AI voices are remarkably lifelike, avoiding any robotic or monotonous tones. Moreover, Murf's user-friendly interface, modern design, and premium features enable you to create authentic-sounding voice overs in a matter of minutes! Give Murf a try today and discover the capabilities of AI-generated speech.