Found 45 results for “AI Audio”
MyVocal.ai is a tool that provides voice synchronization and voice cloning features. Users can synchronize their own voice with popular music and complete voice cloning in a relatively short time.
Lovo is an AI voice generation and text-to-speech tool that supports converting text into natural speech, suitable for audio content production, voiceover, and various creative scenarios, helping reduce manual recording costs and time investment.
latitude.io focuses on AI game development and design, providing AI-generated text, images, audio, and related capabilities for game production to help integrate artificial intelligence technology into the game creation workflow.
YouWhisper is a machine-learning-based video production and editing tool for users who need to quickly process video footage, offering multiple editing options to help create higher-quality video content.
Mubert is an AI music generation tool that provides royalty-free tracks for content creators and app developers, and can generate music by style, mood, use case, and duration.
Rezi.ai is an AI resume generation tool that provides resume templates, content writing assistance, and keyword optimization features to help job seekers complete resumes faster and improve screening match rates.
Adobe Speech Enhancer is an AI audio enhancement tool for improving the quality of voice recordings. It can reduce background noise and highlight voices, making ordinary spoken recordings sound clearer and closer to a studio effect.
Playlistable is an AI playlist generation tool that creates personalized playlists based on mood, occasion, and music preferences, and supports listening through Spotify integration.
Nuro.video is an AI video editing tool that can automatically transcribe, analyze, and organize long raw video footage into finished videos with titles, transitions, and animations.
Hit'n'Mix is an audio processing tool that can be used to remove vocals, create tracks, mix, and repair audio. Try it free for 21 days and download now.
Novels AI is a tool for generating personalized audio adventure stories, allowing users to customize characters and plot choices and experience AI-driven immersive story content in audiobook form.
Podsqueeze is an AI content repurposing tool for podcast creators that can generate supporting content such as show descriptions, timestamps, and newsletters around podcast audio, helping improve post-production organization efficiency.
Krisp is an AI-based noise cancellation app mainly used to improve the quality of online meetings and voice communication. It supports Mac and Windows, and offers voice productivity-related features and a free version.
Moises App is an AI music tool that supports adjusting song key and speed, separating vocals and instruments, and provides mastering and audio extraction features.
Voiceful provides game character voice generation and speech synthesis demos, and supports integration into Unity via SDK, making it suitable for development and testing scenarios that require character voice capabilities.
FineShare FineVoice is an AI real-time voice changer that supports instant voice adjustment and personalized processing during meetings, live streams, chats, and gaming.
Steve AI is an AI video creation tool for social media operations and content marketing scenarios, capable of quickly converting scripts, blogs, or text into short videos and animated content.
Harmonai is a community-driven open-source generative audio project dedicated to providing AI tools for music creation, enabling more people to participate in sound and music generation practice.
FolkTalk is an AI video dubbing platform that supports multilingual dubbing, helping creators and organizations distribute video content to audiences in different languages while preserving the original expression style as much as possible.
Splashmusic is a project that lets everyone enjoy the fun of music creation. It provides easy-to-use music production tools, allowing users to easily create, record, and share their own music works. Whether you are a music enthusiast or a professional musician, Splashmusic can meet your needs.
Sonify is a provider of tools and solutions focused on combining audio and data, with a core direction of turning data into sound to help users understand, analyze, and experience information more intuitively through hearing.
SpeechEasy provides high-quality text-to-speech services
This is a website that provides AI-powered rapid short video creation, but currently does not have any content.
Lately.ai is a tool that combines AI content repurposing with social media management. It can break down long videos, audio, or text into short-form content suitable for social media distribution, while assisting with multi-account management and performance analysis.
Tavus is an AI personalized video generation tool for product, marketing, and sales teams. It can mass-produce customized videos for different audiences based on templates, and use voice variables to deliver communication content that better matches the recipient.
Soundraw is an AI music generator built for creators that can quickly generate background music based on user-set parameters such as genre, mood, instruments, and duration. Users can select music styles such as pop, hip-hop, and classical, and adjust tempo, volume, and instrument combinations to generate music clips that meet their needs.
AI models for transcribing and understanding speech
IBM Watson Text to Speech
Xunfei Zhizuo is a one-stop AIGC content creation platform launched by iFLYTEK, providing services such as text-to-speech and virtual digital human video production based on artificial intelligence technology. Users can easily achieve rapid generation of audio and video content and create high-quality media works without professional skills.
AI text-to-speech generation tool
AI real-time voice changing tool
AI digital human talking-head video generation tool
Wondershare Virbo is an AI digital human talking-video marketing tool launched by Wondershare Technology, focused on providing video creators and cross-border e-commerce practitioners with a full-chain AIGC creation experience. The software uses advanced AI technology to allow users to quickly generate HD videos containing digital human characters, dynamic scenes, and precise backgrounds through simple text input or voice files.
Uberduck is an open-source community for AI voice generation and synthesis. The platform offers more than 5,000 voices to help users create AI dubbing and speech, and you can even use your own custom voice clone for synthesis.
Moyin Workshop is a professional AI voiceover tool with more than 800 voices and over 1,000 styles, meeting a wide range of needs from video dubbing to audiobooks. Moyin Workshop offers rich features, including speech rate adjustment, polyphonic character selection, and pause control, ensuring realistic and natural text-to-speech results. Users can easily download lossless audio files and enjoy a convenient voiceover experience.
beatoven.ai is an AI music generation platform designed to provide royalty-free background music for video, podcast, and game creators. Users only need to enter their music ideas to quickly generate music in more than 250 styles. The platform supports personalized customization, including music length, style, mood, and instrument selection, to meet different creative needs.
ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.
Deepgram is a platform that provides advanced AI speech recognition and natural language processing technology. Its core products are powerful Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, enabling developers to quickly integrate voice transcription and understanding capabilities into their own applications and services.
KreadoAI is an AIGC digital marketing video creation platform focused on using artificial intelligence technology to simplify and optimize the video content creation process. Users only need to enter text or keywords, and Kreado AI can create video content with real or virtual people.
Wondercraft is a versatile AI audio content creation platform that uses generative AI voice technology to allow users to quickly convert text content into podcasts, audiobooks, ads, and other audio formats.
Tingnao AI is an AI-powered intelligent voice assistant focused on speech-to-text and real-time recording summaries, offering audio/video transcription, real-time recording-to-text, AI summaries, chapter overview, and other features. Users can freely drag text to view audio/video progress and enjoy a convenient intelligent recording experience.
LangLang Voiceover is an intelligent text-to-speech tool that provides voice synthesis services. It supports more than 30 languages, including Chinese, English, German, and French, as well as more than 10 emotional styles such as happy, sad, and excited. The platform is feature-rich and easy to use, supporting SSML tags to enable advanced functions such as polyphonic character handling and multi-speaker dubbing.
Xingliu AI is a one-stop AI design and creation tool launched by LiblibAI, providing intelligent creative services such as AI image generation, AI audio generation, and AI video generation. Xingliu AI also offers fast intelligent image editing services, such as HD upscaling, intelligent outpainting, and erasing, helping users process images efficiently.
SoundView is an AI video localization tool that supports video dubbing and video translation. SoundView integrates multilingual translation, speech synthesis, speech recognition, and large-model technology to simplify and accelerate the creation of product marketing videos. SoundView supports dubbing and subtitle editing in 100 languages, increasing video production efficiency by 10 times and reducing video translation costs by 90%.
Vemus is Tencent Music's first one-stop AI music creation tool, offering zero-threshold multimodal music creation so everyone can make music. It compresses "songwriting" into three steps: enter a sentence, an image, or a humming clip, and AI automatically completes lyric writing, composition, arrangement, and singing within seconds, with instant switching across styles such as pop, Chinese style, and electronic.
