NaviAI - AI Tools Directory | Discover the Best AI Tools

Found 16 results for “AI audio tools”

Harmonai is a community-driven open-source generative audio project dedicated to providing AI tools for music creation, enabling more people to participate in sound and music generation practice.

SoundrawAudio & Video

Soundraw is an AI music generator built for creators that can quickly generate background music based on user-set parameters such as genre, mood, instruments, and duration. Users can select music styles such as pop, hip-hop, and classical, and adjust tempo, volume, and instrument combinations to generate music clips that meet their needs.

AssemblyAIAudio & Video

AI models for transcribing and understanding speech

IBM Watson文字转语音Audio & Video

IBM Watson Text to Speech

讯飞智作Audio & Video

Xunfei Zhizuo is a one-stop AIGC content creation platform launched by iFLYTEK, providing services such as text-to-speech and virtual digital human video production based on artificial intelligence technology. Users can easily achieve rapid generation of audio and video content and create high-quality media works without professional skills.

VoicemakerAudio & Video

AI text-to-speech generation tool

MetaVoiceAudio & Video

AI real-time voice changing tool

UberduckAudio & Video

Uberduck is an open-source community for AI voice generation and synthesis. The platform offers more than 5,000 voices to help users create AI dubbing and speech, and you can even use your own custom voice clone for synthesis.

魔音工坊Audio & Video

Moyin Workshop is a professional AI voiceover tool with more than 800 voices and over 1,000 styles, meeting a wide range of needs from video dubbing to audiobooks. Moyin Workshop offers rich features, including speech rate adjustment, polyphonic character selection, and pause control, ensuring realistic and natural text-to-speech results. Users can easily download lossless audio files and enjoy a convenient voiceover experience.

beatoven.aiAudio & Video

beatoven.ai is an AI music generation platform designed to provide royalty-free background music for video, podcast, and game creators. Users only need to enter their music ideas to quickly generate music in more than 250 styles. The platform supports personalized customization, including music length, style, mood, and instrument selection, to meet different creative needs.

ElevenLabsAudio & Video

ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.

DeepgramAudio & Video

Deepgram is a platform that provides advanced AI speech recognition and natural language processing technology. Its core products are powerful Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, enabling developers to quickly integrate voice transcription and understanding capabilities into their own applications and services.

WondercraftAudio & Video

Wondercraft is a versatile AI audio content creation platform that uses generative AI voice technology to allow users to quickly convert text content into podcasts, audiobooks, ads, and other audio formats.

听脑AIother

Tingnao AI is an AI-powered intelligent voice assistant focused on speech-to-text and real-time recording summaries, offering audio/video transcription, real-time recording-to-text, AI summaries, chapter overview, and other features. Users can freely drag text to view audio/video progress and enjoy a convenient intelligent recording experience.

琅琅配音Audio & Video

LangLang Voiceover is an intelligent text-to-speech tool that provides voice synthesis services. It supports more than 30 languages, including Chinese, English, German, and French, as well as more than 10 emotional styles such as happy, sad, and excited. The platform is feature-rich and easy to use, supporting SSML tags to enable advanced functions such as polyphonic character handling and multi-speaker dubbing.

Vemus未音Audio & Video

Vemus is Tencent Music's first one-stop AI music creation tool, offering zero-threshold multimodal music creation so everyone can make music. It compresses "songwriting" into three steps: enter a sentence, an image, or a humming clip, and AI automatically completes lyric writing, composition, arrangement, and singing within seconds, with instant switching across styles such as pop, Chinese style, and electronic.