NaviAI - AI Tools Directory | Discover the Best AI Tools

Found 45 results for “AI Audio”

MyVocal.ai is a tool that provides voice synchronization and voice cloning features. Users can synchronize their own voice with popular music and complete voice cloning in a relatively short time.

LovoAudio & Video

Lovo is an AI voice generation and text-to-speech tool that supports converting text into natural speech, suitable for audio content production, voiceover, and various creative scenarios, helping reduce manual recording costs and time investment.

latitude.ioGaming & Fun

latitude.io focuses on AI game development and design, providing AI-generated text, images, audio, and related capabilities for game production to help integrate artificial intelligence technology into the game creation workflow.

YouWhisperAudio & Video

YouWhisper is a machine-learning-based video production and editing tool for users who need to quickly process video footage, offering multiple editing options to help create higher-quality video content.

MubertAudio & Video

Mubert is an AI music generation tool that provides royalty-free tracks for content creators and app developers, and can generate music by style, mood, use case, and duration.

Rezi.aiWriting & Text

Rezi.ai is an AI resume generation tool that provides resume templates, content writing assistance, and keyword optimization features to help job seekers complete resumes faster and improve screening match rates.

Adobe Speech EnhancerAudio & Video

Adobe Speech Enhancer is an AI audio enhancement tool for improving the quality of voice recordings. It can reduce background noise and highlight voices, making ordinary spoken recordings sound clearer and closer to a studio effect.

PlaylistableAudio & Video

Playlistable is an AI playlist generation tool that creates personalized playlists based on mood, occasion, and music preferences, and supports listening through Spotify integration.

Nuro.videoAudio & Video

Nuro.video is an AI video editing tool that can automatically transcribe, analyze, and organize long raw video footage into finished videos with titles, transitions, and animations.

RipXAudio & Video

Hit'n'Mix is an audio processing tool that can be used to remove vocals, create tracks, mix, and repair audio. Try it free for 21 days and download now.

Novels AIAudio & Video

Novels AI is a tool for generating personalized audio adventure stories, allowing users to customize characters and plot choices and experience AI-driven immersive story content in audiobook form.

PodsqueezeAudio & Video

Podsqueeze is an AI content repurposing tool for podcast creators that can generate supporting content such as show descriptions, timestamps, and newsletters around podcast audio, helping improve post-production organization efficiency.

KrispAudio & Video

Krisp is an AI-based noise cancellation app mainly used to improve the quality of online meetings and voice communication. It supports Mac and Windows, and offers voice productivity-related features and a free version.

Moises AppAudio & Video

Moises App is an AI music tool that supports adjusting song key and speed, separating vocals and instruments, and provides mastering and audio extraction features.

VoicefulAudio & Video

Voiceful provides game character voice generation and speech synthesis demos, and supports integration into Unity via SDK, making it suitable for development and testing scenarios that require character voice capabilities.

FineShare FineVoiceAudio & Video

FineShare FineVoice is an AI real-time voice changer that supports instant voice adjustment and personalized processing during meetings, live streams, chats, and gaming.

Steve AIAudio & Video

Steve AI is an AI video creation tool for social media operations and content marketing scenarios, capable of quickly converting scripts, blogs, or text into short videos and animated content.

HarmonaiAudio & Video

Harmonai is a community-driven open-source generative audio project dedicated to providing AI tools for music creation, enabling more people to participate in sound and music generation practice.

FolkTalkAudio & Video

FolkTalk is an AI video dubbing platform that supports multilingual dubbing, helping creators and organizations distribute video content to audiences in different languages while preserving the original expression style as much as possible.

SplashmusicAudio & Video

Splashmusic is a project that lets everyone enjoy the fun of music creation. It provides easy-to-use music production tools, allowing users to easily create, record, and share their own music works. Whether you are a music enthusiast or a professional musician, Splashmusic can meet your needs.

SonifyAudio & Video

Sonify is a provider of tools and solutions focused on combining audio and data, with a core direction of turning data into sound to help users understand, analyze, and experience information more intuitively through hearing.

SpeechEasyAudio & Video

SpeechEasy provides high-quality text-to-speech services

超级创作者Audio & Video

This is a website that provides AI-powered rapid short video creation, but currently does not have any content.

Lately.aiWriting & Text

Lately.ai is a tool that combines AI content repurposing with social media management. It can break down long videos, audio, or text into short-form content suitable for social media distribution, while assisting with multi-account management and performance analysis.

TavusAudio & Video

Tavus is an AI personalized video generation tool for product, marketing, and sales teams. It can mass-produce customized videos for different audiences based on templates, and use voice variables to deliver communication content that better matches the recipient.

SoundrawAudio & Video

Soundraw is an AI music generator built for creators that can quickly generate background music based on user-set parameters such as genre, mood, instruments, and duration. Users can select music styles such as pop, hip-hop, and classical, and adjust tempo, volume, and instrument combinations to generate music clips that meet their needs.

AssemblyAIAudio & Video

AI models for transcribing and understanding speech

IBM Watson文字转语音Audio & Video

IBM Watson Text to Speech

讯飞智作Audio & Video

Xunfei Zhizuo is a one-stop AIGC content creation platform launched by iFLYTEK, providing services such as text-to-speech and virtual digital human video production based on artificial intelligence technology. Users can easily achieve rapid generation of audio and video content and create high-quality media works without professional skills.

VoicemakerAudio & Video

AI text-to-speech generation tool

MetaVoiceAudio & Video

AI real-time voice changing tool

D-IDAudio & Video

AI digital human talking-head video generation tool

万兴播爆Audio & Video

Wondershare Virbo is an AI digital human talking-video marketing tool launched by Wondershare Technology, focused on providing video creators and cross-border e-commerce practitioners with a full-chain AIGC creation experience. The software uses advanced AI technology to allow users to quickly generate HD videos containing digital human characters, dynamic scenes, and precise backgrounds through simple text input or voice files.

UberduckAudio & Video

Uberduck is an open-source community for AI voice generation and synthesis. The platform offers more than 5,000 voices to help users create AI dubbing and speech, and you can even use your own custom voice clone for synthesis.

魔音工坊Audio & Video

Moyin Workshop is a professional AI voiceover tool with more than 800 voices and over 1,000 styles, meeting a wide range of needs from video dubbing to audiobooks. Moyin Workshop offers rich features, including speech rate adjustment, polyphonic character selection, and pause control, ensuring realistic and natural text-to-speech results. Users can easily download lossless audio files and enjoy a convenient voiceover experience.

beatoven.aiAudio & Video

beatoven.ai is an AI music generation platform designed to provide royalty-free background music for video, podcast, and game creators. Users only need to enter their music ideas to quickly generate music in more than 250 styles. The platform supports personalized customization, including music length, style, mood, and instrument selection, to meet different creative needs.

ElevenLabsAudio & Video

ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.

DeepgramAudio & Video

Deepgram is a platform that provides advanced AI speech recognition and natural language processing technology. Its core products are powerful Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, enabling developers to quickly integrate voice transcription and understanding capabilities into their own applications and services.

KreadoAIAudio & Video

KreadoAI is an AIGC digital marketing video creation platform focused on using artificial intelligence technology to simplify and optimize the video content creation process. Users only need to enter text or keywords, and Kreado AI can create video content with real or virtual people.

WondercraftAudio & Video

Wondercraft is a versatile AI audio content creation platform that uses generative AI voice technology to allow users to quickly convert text content into podcasts, audiobooks, ads, and other audio formats.

听脑AIother

Tingnao AI is an AI-powered intelligent voice assistant focused on speech-to-text and real-time recording summaries, offering audio/video transcription, real-time recording-to-text, AI summaries, chapter overview, and other features. Users can freely drag text to view audio/video progress and enjoy a convenient intelligent recording experience.

琅琅配音Audio & Video

LangLang Voiceover is an intelligent text-to-speech tool that provides voice synthesis services. It supports more than 30 languages, including Chinese, English, German, and French, as well as more than 10 emotional styles such as happy, sad, and excited. The platform is feature-rich and easy to use, supporting SSML tags to enable advanced functions such as polyphonic character handling and multi-speaker dubbing.

星流AIImage & Design

Xingliu AI is a one-stop AI design and creation tool launched by LiblibAI, providing intelligent creative services such as AI image generation, AI audio generation, and AI video generation. Xingliu AI also offers fast intelligent image editing services, such as HD upscaling, intelligent outpainting, and erasing, helping users process images efficiently.

SoundViewAudio & Video

SoundView is an AI video localization tool that supports video dubbing and video translation. SoundView integrates multilingual translation, speech synthesis, speech recognition, and large-model technology to simplify and accelerate the creation of product marketing videos. SoundView supports dubbing and subtitle editing in 100 languages, increasing video production efficiency by 10 times and reducing video translation costs by 90%.

Vemus未音Audio & Video

Vemus is Tencent Music's first one-stop AI music creation tool, offering zero-threshold multimodal music creation so everyone can make music. It compresses "songwriting" into three steps: enter a sentence, an image, or a humming clip, and AI automatically completes lyric writing, composition, arrangement, and singing within seconds, with instant switching across styles such as pop, Chinese style, and electronic.