NaviAI - AI Tools Directory | Discover the Best AI Tools

Found 60 results for “Speech”

OpenGPT is a tool platform for building ChatGPT applications based on APIs, supporting capabilities such as multilingual support, instant messaging, speech recognition, and natural language processing, while also providing reference application examples and open-source code.

MyVocal.aiAudio & Video

MyVocal.ai is a tool that provides voice synchronization and voice cloning features. Users can synchronize their own voice with popular music and complete voice cloning in a relatively short time.

Pod GenieAudio & Video

Pod Genie is an AI podcast tool that can convert RSS feeds into personalized podcast content, and provides customized news broadcasts, newsletters, and summary services, making it convenient for users to access audio information based on their interests.

LovoAudio & Video

Lovo is an AI voice generation and text-to-speech tool that supports converting text into natural speech, suitable for audio content production, voiceover, and various creative scenarios, helping reduce manual recording costs and time investment.

Poly aiBusiness

PolyAI is a company that provides enterprise-grade voice assistant solutions, focusing on handling customer calls through natural conversational AI to help businesses improve phone service efficiency and automation.

AI Voice DetectorAudio & Video

AI Voice Detector is an audio authenticity detection tool used to identify whether speech is generated by AI. Users can upload audio files for verification, making it suitable for scenarios involving evidence review, media judgment, and authenticity analysis in customer communications.

网易见外工作台Business

NetEase Jianwai Workbench is an AI tool for office and collaboration scenarios, providing video and livestream transcription, speech-to-text, document translation, and other functions, suitable for teams handling multimedia and text content.

Inworld AIImage & Design

Inworld AI is an AI character development platform that can create virtual characters with personality, memory, and contextual awareness, and supports configurations such as safety, narrative control, and multimodality, making it suitable for real-time interactive applications.

Article.AudioAudio & Video

Article.Audio is an online service that converts article content into spoken audio, supporting the transformation of text articles into listenable audio for convenient access to information when reading is inconvenient.

AI RegistersWriting & Text

AI Registers is an AI tool directory that includes more than 2,000 AI products across different categories, supporting search, browsing featured tools, and viewing the latest listings.

GPUX.AIDevelopment

GPUX.AI provides resource services for GPU computing tasks, supports running various GPU applications in Docker containers, and offers automatically scalable inference capabilities.

AITWO.CO：由AI驱动的全能设计平台Image & Design

AITWO.CO is an AI architecture and spatial design tool that supports generating concepts for multiple building types and allows custom visual parameters such as style, color, lighting, composition, and details.

MotionIt.aiImage & Design

MotionIt.ai is a tool that uses AI to generate slides and presentation videos, quickly turning text into structured presentation content and supporting export to common formats such as Google Slides, PowerPoint, and PDF.

AI MajicWriting & Text

AI Majic is an AI writing tool for content creation that can generate video descriptions, tags, social media copy, article summaries, speech points, and more, helping users complete various text content more quickly.

RunPodBusiness

RunPod is a GPU cloud service for AI and high-performance computing scenarios, providing on-demand rental, serverless GPU computing, managed AI endpoints, and Jupyter Notebook capabilities.

TranskribierenAudio & Video

Transkribieren is an audio transcription tool that supports uploading multiple audio formats, provides a relatively convenient speech-to-text service, and extends to use on mobile, in the browser, and in meeting scenarios.

HeyGenAudio & Video

HeyGen is an online AI video generation tool that supports creating talking avatar videos and provides customizable avatars and voiceover features, suitable for content production scenarios such as training, teaching, explanations, and marketing.

Adobe Speech EnhancerAudio & Video

Adobe Speech Enhancer is an AI audio enhancement tool for improving the quality of voice recordings. It can reduce background noise and highlight voices, making ordinary spoken recordings sound clearer and closer to a studio effect.

GitHub NextDevelopment

GitHub Next is a collection of AI experimental projects for development scenarios, among which Copilot Voice supports coding and operations through voice commands. Users can use "Copilot" as the wake word and replace keyboard input with voice in parts of the programming workflow.

MixPeekDevelopment

MixPeek is an intelligent search layer built on top of object storage. Through APIs, it can extract, index, and perform natural language search on non-text files, helping applications quickly gain search-engine-like file retrieval capabilities.

AiProlificWriting & Text

AiProlific is an AI writing assistant tool that provides a variety of text templates to help users quickly generate written content such as blog titles, SEO content, product descriptions, and news-style copy.

AIDev.CodesDevelopment

AIDev.Codes is a tool for generating interactive web pages by conversing with AI, supporting text generation, image generation, an optional voice interface, as well as free hosting and custom subdomains.

Neural CanvasImage & Design

Neural Canvas is an AI digital illustration generation service that can create images for comics, blogs, e-books, and story collections, and supports expanding stories into e-books, comics, or graphic novel formats.

HandywriterWriting & Text

Handywriter is an AI writing assistant for WordPress that supports content generation, grammar and spell checking, and works with the native block editor and classic editor.

Murf AIAudio & Video

Murf AI is an AI voice generation tool that converts text into natural, lifelike human speech, suitable for creating podcasts, video voiceovers, presentation narration, and other audio content.

Sumly.AIEducation

Sumly.AI is an AI summary tool for podcast content that uses speech-to-text technology to distill key points from episodes and help users quickly understand podcast content through short summaries.

NarrationBoxAudio & Video

NarrationBox is an AI voice generation tool that offers more than 700 AI narrator voices for creating audio content such as podcasts, audiobooks, and dubbing.

GradioDevelopment

Gradio is a lightweight Python library for quickly creating interactive web interfaces for machine learning models. Developers can use it to showcase, test, and share models, and it can also be embedded in Notebooks.

AI DepotBusiness

AI Depot is a platform that aggregates various types of artificial intelligence tools, covering areas such as text analysis, speech recognition, image recognition, and predictive analytics, helping users find suitable machine learning capabilities for different types of applications.

NeuroSpellWriting & Text

NeuroSpell is a deep learning-based spelling and grammar auto-correction tool that supports more than 30 languages and provides capabilities such as speech-to-text, OCR error correction, and customizable terminology training.

OpenLImage & Design

OpenL is a translation tool.

Revocalize AIAudio & Video

Revocalize AI is an AI voice synthesis tool that supports voice cloning, voice protection, and voice creation, offers multilingual voice options, and is suitable for audio content production and personalized voice applications.

Otter AIBusiness

Otter AI is a meeting recording and note-taking tool that supports real-time speech transcription, audio recording, slide capture, and automatic meeting summary generation for easier organization and review.

GistReaderEducation

GistReader is an AI-powered RSS reader that offers automatic article summaries and text-to-speech features, helping users obtain information more efficiently and save reading time.

VoicefulAudio & Video

Voiceful provides game character voice generation and speech synthesis demos, and supports integration into Unity via SDK, making it suitable for development and testing scenarios that require character voice capabilities.

Miniapps.aiImage & Design

Miniapps.ai is a website that aggregates a variety of free AI mini apps and tools, covering areas such as health, social media, and SEO, and also supports exploring and creating simple AI applications for quick and easy use.

FineShare FineVoiceAudio & Video

FineShare FineVoice is an AI real-time voice changer that supports instant voice adjustment and personalized processing during meetings, live streams, chats, and gaming.

OthersideAIWriting & Text

OthersideAI is a personal writing assistant that provides real-time writing suggestions and sentence completion, helping users express their ideas more smoothly in scenarios such as emails, articles, or assignments.

SupertranslateImage & Design

Supertranslate is a video subtitle tool that can automatically transcribe videos in more than 100 languages and generate English subtitles, suitable for creators and teams that need to distribute content across languages.

WisecutImage & Design

Wisecut is an online AI video editing tool that uses speech recognition to automatically process video content, remove pauses, generate subtitles, and add background music. It is suitable for quickly organizing talking-head, interview, and podcast videos.

Good TapeWriting & Text

Good Tape is an automatic speech-to-text tool that can quickly convert audio recordings into text, supports more than 90 languages, and is suitable for organizing interviews, meetings, and dictated content.

Replica Studios：为您的创意项目提供AI语音演员Audio & Video

Replica Studios is an AI voice generation tool for creative projects, offering virtual voice actors with emotional expression that can be used to create more natural voice performance content.

免费文字转语音生成器Audio & Video

Free Text to Speech Generator is an online TTS tool that supports multiple languages, multiple dialects, and mixed Chinese-English reading, and can convert text into speech and export MP3 files.

Omniverse Audio2FaceAudio & Video

Omniverse Audio2Face is an AI facial animation tool launched by NVIDIA that can automatically generate matching character facial expressions and lip-sync animation from audio, suitable for real-time and traditional character production workflows.

SalientBusiness

Salient is an AI tool for sales teams that can be used for personalized outbound emails, automatic customer replies, and reactivating leads, while also providing employee analytics to help businesses observe workforce trends.

LALAL.AIAudio & Video

LALAL.AI is an audio separation tool that can extract vocals or multiple instrument tracks from songs, supports high-quality audio processing, and is suitable for music editing, practice, and asset creation.

PowtoonImage & Design

Powtoon is a website for creating videos and animations online. They provide professionally designed templates as well as useful tips, training courses, and guides to shorten the learning curve. Users can create stunning videos and presentations on the website, and before exporting the final video they can benefit from royalty-free video footage, images, animations, characters, voiceovers, or music. Powtoon can also be used to import PowerPoint presentations and convert them into videos.

NaturalReader：免费在线文字转语音Audio & Video

NaturalReader is a tool that provides AI text-to-speech services, supporting online use, mobile apps, commercial licensing, and educational scenarios, suitable for converting text content into audio for listening.

SpeechEasyAudio & Video

SpeechEasy provides high-quality text-to-speech services

ThekeysWriting & Text

Thekeys is an AI writing assistant tool that helps users optimize their wording without changing the original meaning. It focuses on making text more vivid, concise, and persuasive, making it suitable for writing, speeches, and everyday communication scenarios.

NonoisyAudio & Video

Nonoisy is an audio post-processing tool mainly used to remove background noise, optimize audio quality, and adjust volume. It can also be used to improve audio performance in videos.

VoiceraAudio & Video

Voicera is an article-to-speech tool that can automatically detect content and generate a playable audio version. It supports multiple languages and voice options, making it convenient for users to access information by listening.

PoisedEducation

Poised is an AI communication coaching tool that provides real-time feedback, helping users improve spoken expression and overall communication skills during voice interactions.

Krater.AIWriting & Text

Krater.AI is an AI-driven content creation tool that provides a suite of tools for marketers and content creators. It offers solutions such as ad copy generation, creating stunning images, and converting audio content into written content or realistic voiceovers. The website claims to have advanced technology comparable to Jasper, Midjourney, and Writer.com. Krater.AI is designed to be user-friendly and intuitive, offering features such as image generation, copywriting, chat, speech-to-text, code, and more. The website also has a Twitter account where they share updates about their product.

QuickieWriting & Text

Quickie is an AI productivity tool that provides text-to-speech, content summarization, text expansion, and other functions, and is used as a browser extension to help users quickly process information during daily browsing and writing.

SoofyEducation

Soofy is an AI language learning app focused on real-world practice, helping users improve pronunciation, writing, and conversation skills, while strengthening practical language use through role-playing, debates, and other methods.

CaffeDevelopment

A deep learning framework introduced by UC Berkeley research

AssemblyAIAudio & Video

AI models for transcribing and understanding speech

IBM Watson文字转语音Audio & Video

IBM Watson Text to Speech

ReplicateDevelopment

Run open-source machine learning models online