Found 16 results for “AI audio tools”
Harmonai is a community-driven open-source generative audio project dedicated to providing AI tools for music creation, enabling more people to participate in sound and music generation practice.
Soundraw is an AI music generator built for creators that can quickly generate background music based on user-set parameters such as genre, mood, instruments, and duration. Users can select music styles such as pop, hip-hop, and classical, and adjust tempo, volume, and instrument combinations to generate music clips that meet their needs.
AI models for transcribing and understanding speech
IBM Watson Text to Speech
Xunfei Zhizuo is a one-stop AIGC content creation platform launched by iFLYTEK, providing services such as text-to-speech and virtual digital human video production based on artificial intelligence technology. Users can easily achieve rapid generation of audio and video content and create high-quality media works without professional skills.
AI text-to-speech generation tool
AI real-time voice changing tool
Uberduck is an open-source community for AI voice generation and synthesis. The platform offers more than 5,000 voices to help users create AI dubbing and speech, and you can even use your own custom voice clone for synthesis.
Moyin Workshop is a professional AI voiceover tool with more than 800 voices and over 1,000 styles, meeting a wide range of needs from video dubbing to audiobooks. Moyin Workshop offers rich features, including speech rate adjustment, polyphonic character selection, and pause control, ensuring realistic and natural text-to-speech results. Users can easily download lossless audio files and enjoy a convenient voiceover experience.
beatoven.ai is an AI music generation platform designed to provide royalty-free background music for video, podcast, and game creators. Users only need to enter their music ideas to quickly generate music in more than 250 styles. The platform supports personalized customization, including music length, style, mood, and instrument selection, to meet different creative needs.
ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.
Deepgram is a platform that provides advanced AI speech recognition and natural language processing technology. Its core products are powerful Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, enabling developers to quickly integrate voice transcription and understanding capabilities into their own applications and services.
Wondercraft is a versatile AI audio content creation platform that uses generative AI voice technology to allow users to quickly convert text content into podcasts, audiobooks, ads, and other audio formats.
Tingnao AI is an AI-powered intelligent voice assistant focused on speech-to-text and real-time recording summaries, offering audio/video transcription, real-time recording-to-text, AI summaries, chapter overview, and other features. Users can freely drag text to view audio/video progress and enjoy a convenient intelligent recording experience.
LangLang Voiceover is an intelligent text-to-speech tool that provides voice synthesis services. It supports more than 30 languages, including Chinese, English, German, and French, as well as more than 10 emotional styles such as happy, sad, and excited. The platform is feature-rich and easy to use, supporting SSML tags to enable advanced functions such as polyphonic character handling and multi-speaker dubbing.
Vemus is Tencent Music's first one-stop AI music creation tool, offering zero-threshold multimodal music creation so everyone can make music. It compresses "songwriting" into three steps: enter a sentence, an image, or a humming clip, and AI automatically completes lyric writing, composition, arrangement, and singing within seconds, with instant switching across styles such as pop, Chinese style, and electronic.
