Found 22 results for “Voice Cloning”
MyVocal.ai is a tool that provides voice synchronization and voice cloning features. Users can synchronize their own voice with popular music and complete voice cloning in a relatively short time.
Lovo is an AI voice generation and text-to-speech tool that supports converting text into natural speech, suitable for audio content production, voiceover, and various creative scenarios, helping reduce manual recording costs and time investment.
Translate.video is an AI translation tool for video content, supporting video translation, subtitle translation, dubbing, AI voice conversion, recording, and text generation to help distribute video content in multiple languages.
Inworld AI is an AI character development platform that can create virtual characters with personality, memory, and contextual awareness, and supports configurations such as safety, narrative control, and multimodality, making it suitable for real-time interactive applications.
HeyGen is an online AI video generation tool that supports creating talking avatar videos and provides customizable avatars and voiceover features, suitable for content production scenarios such as training, teaching, explanations, and marketing.
Murf AI is an AI voice generation tool that converts text into natural, lifelike human speech, suitable for creating podcasts, video voiceovers, presentation narration, and other audio content.
NarrationBox is an AI voice generation tool that offers more than 700 AI narrator voices for creating audio content such as podcasts, audiobooks, and dubbing.
Revocalize AI is an AI voice synthesis tool that supports voice cloning, voice protection, and voice creation, offers multilingual voice options, and is suitable for audio content production and personalized voice applications.
Vidboard AI is an AI video generation tool that can create video presentations featuring AI hosts based on text and human photos, and supports more than 125 languages, making it suitable for product introductions and business presentations.
LALAL.AI is an audio separation tool that can extract vocals or multiple instrument tracks from songs, supports high-quality audio processing, and is suitable for music editing, practice, and asset creation.
Uberduck is an open-source community for AI voice generation and synthesis. The platform offers more than 5,000 voices to help users create AI dubbing and speech, and you can even use your own custom voice clone for synthesis.
Moyin Workshop is a professional AI voiceover tool with more than 800 voices and over 1,000 styles, meeting a wide range of needs from video dubbing to audiobooks. Moyin Workshop offers rich features, including speech rate adjustment, polyphonic character selection, and pause control, ensuring realistic and natural text-to-speech results. Users can easily download lossless audio files and enjoy a convenient voiceover experience.
Qimiaoyuan is an AI digital human short-video and livestreaming solution launched by Mobvoi. With this digital avatar creation and livestreaming platform, users can create their own digital avatars and conduct livestreaming activities through them. The Qimiaoyuan platform currently has more than 100 digital humans and more than 1,000 3D digital assets, providing users with a wide range of choices.
ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.
iFlytek Zhiwen is an intelligent document AI assistant launched by iFLYTEK based on the Spark large model, designed to improve the creation and presentation efficiency of Word and PPT. The tool supports features such as intelligent rehearsal and AI Presenter to help users optimize the entire process from content creation to speech delivery.
KreadoAI is an AIGC digital marketing video creation platform focused on using artificial intelligence technology to simplify and optimize the video content creation process. Users only need to enter text or keywords, and Kreado AI can create video content with real or virtual people.
Wondercraft is a versatile AI audio content creation platform that uses generative AI voice technology to allow users to quickly convert text content into podcasts, audiobooks, ads, and other audio formats.
SoundView is an AI video localization tool that supports video dubbing and video translation. SoundView integrates multilingual translation, speech synthesis, speech recognition, and large-model technology to simplify and accelerate the creation of product marketing videos. SoundView supports dubbing and subtitle editing in 100 languages, increasing video production efficiency by 10 times and reducing video translation costs by 90%.
JoyPix is an AI creation tool focused on digital humans and speech synthesis. Users can create personalized virtual avatars by uploading photos, with support for voice conversations with virtual avatars.
Wujie Future is an AI application and elastic computing network platform focused on providing users with strong computing power support and a wide range of AI application services. Wujie Future offers multiple types of GPU resources, allowing users to choose suitable resources based on their needs for AI application training and deployment.
Xmov Nebula is an embodied intelligent 3D digital human open platform launched by Xmov Technology, dedicated to upgrading AI from “having a brain” to “having a body” to enable natural expression and interaction. Based on text input, Xmov Nebula can generate a 3D digital human’s voice, expressions, and movements in real time, supporting multimodal generation, low-cost operation, low-latency interaction, and multi-terminal adaptation.
Yunmu Tongsheng is a new-generation professional AI video translation tool with original-voice-level quality, suitable for short drama overseas expansion, cross-border e-commerce, and other fields. Its AI voice cloning with 98% voice restoration, precise audio-video synchronization algorithms, and AI vocal separation model can fully preserve background music and emotional details, making translated videos as natural as the original.
