Found 60 results for “Speech”
OpenGPT is a tool platform for building ChatGPT applications based on APIs, supporting capabilities such as multilingual support, instant messaging, speech recognition, and natural language processing, while also providing reference application examples and open-source code.
MyVocal.ai is a tool that provides voice synchronization and voice cloning features. Users can synchronize their own voice with popular music and complete voice cloning in a relatively short time.
Pod Genie is an AI podcast tool that can convert RSS feeds into personalized podcast content, and provides customized news broadcasts, newsletters, and summary services, making it convenient for users to access audio information based on their interests.
Lovo is an AI voice generation and text-to-speech tool that supports converting text into natural speech, suitable for audio content production, voiceover, and various creative scenarios, helping reduce manual recording costs and time investment.
PolyAI is a company that provides enterprise-grade voice assistant solutions, focusing on handling customer calls through natural conversational AI to help businesses improve phone service efficiency and automation.
AI Voice Detector is an audio authenticity detection tool used to identify whether speech is generated by AI. Users can upload audio files for verification, making it suitable for scenarios involving evidence review, media judgment, and authenticity analysis in customer communications.
NetEase Jianwai Workbench is an AI tool for office and collaboration scenarios, providing video and livestream transcription, speech-to-text, document translation, and other functions, suitable for teams handling multimedia and text content.
Inworld AI is an AI character development platform that can create virtual characters with personality, memory, and contextual awareness, and supports configurations such as safety, narrative control, and multimodality, making it suitable for real-time interactive applications.
Article.Audio is an online service that converts article content into spoken audio, supporting the transformation of text articles into listenable audio for convenient access to information when reading is inconvenient.
AI Registers is an AI tool directory that includes more than 2,000 AI products across different categories, supporting search, browsing featured tools, and viewing the latest listings.
GPUX.AI provides resource services for GPU computing tasks, supports running various GPU applications in Docker containers, and offers automatically scalable inference capabilities.
AITWO.CO is an AI architecture and spatial design tool that supports generating concepts for multiple building types and allows custom visual parameters such as style, color, lighting, composition, and details.
MotionIt.ai is a tool that uses AI to generate slides and presentation videos, quickly turning text into structured presentation content and supporting export to common formats such as Google Slides, PowerPoint, and PDF.
AI Majic is an AI writing tool for content creation that can generate video descriptions, tags, social media copy, article summaries, speech points, and more, helping users complete various text content more quickly.
RunPod is a GPU cloud service for AI and high-performance computing scenarios, providing on-demand rental, serverless GPU computing, managed AI endpoints, and Jupyter Notebook capabilities.
Transkribieren is an audio transcription tool that supports uploading multiple audio formats, provides a relatively convenient speech-to-text service, and extends to use on mobile, in the browser, and in meeting scenarios.
HeyGen is an online AI video generation tool that supports creating talking avatar videos and provides customizable avatars and voiceover features, suitable for content production scenarios such as training, teaching, explanations, and marketing.
Adobe Speech Enhancer is an AI audio enhancement tool for improving the quality of voice recordings. It can reduce background noise and highlight voices, making ordinary spoken recordings sound clearer and closer to a studio effect.
GitHub Next is a collection of AI experimental projects for development scenarios, among which Copilot Voice supports coding and operations through voice commands. Users can use "Copilot" as the wake word and replace keyboard input with voice in parts of the programming workflow.
MixPeek is an intelligent search layer built on top of object storage. Through APIs, it can extract, index, and perform natural language search on non-text files, helping applications quickly gain search-engine-like file retrieval capabilities.
AiProlific is an AI writing assistant tool that provides a variety of text templates to help users quickly generate written content such as blog titles, SEO content, product descriptions, and news-style copy.
AIDev.Codes is a tool for generating interactive web pages by conversing with AI, supporting text generation, image generation, an optional voice interface, as well as free hosting and custom subdomains.
Neural Canvas is an AI digital illustration generation service that can create images for comics, blogs, e-books, and story collections, and supports expanding stories into e-books, comics, or graphic novel formats.
Handywriter is an AI writing assistant for WordPress that supports content generation, grammar and spell checking, and works with the native block editor and classic editor.
Murf AI is an AI voice generation tool that converts text into natural, lifelike human speech, suitable for creating podcasts, video voiceovers, presentation narration, and other audio content.
Sumly.AI is an AI summary tool for podcast content that uses speech-to-text technology to distill key points from episodes and help users quickly understand podcast content through short summaries.
NarrationBox is an AI voice generation tool that offers more than 700 AI narrator voices for creating audio content such as podcasts, audiobooks, and dubbing.
Gradio is a lightweight Python library for quickly creating interactive web interfaces for machine learning models. Developers can use it to showcase, test, and share models, and it can also be embedded in Notebooks.
AI Depot is a platform that aggregates various types of artificial intelligence tools, covering areas such as text analysis, speech recognition, image recognition, and predictive analytics, helping users find suitable machine learning capabilities for different types of applications.
NeuroSpell is a deep learning-based spelling and grammar auto-correction tool that supports more than 30 languages and provides capabilities such as speech-to-text, OCR error correction, and customizable terminology training.
OpenL is a translation tool.
Revocalize AI is an AI voice synthesis tool that supports voice cloning, voice protection, and voice creation, offers multilingual voice options, and is suitable for audio content production and personalized voice applications.
Otter AI is a meeting recording and note-taking tool that supports real-time speech transcription, audio recording, slide capture, and automatic meeting summary generation for easier organization and review.
GistReader is an AI-powered RSS reader that offers automatic article summaries and text-to-speech features, helping users obtain information more efficiently and save reading time.
Voiceful provides game character voice generation and speech synthesis demos, and supports integration into Unity via SDK, making it suitable for development and testing scenarios that require character voice capabilities.
Miniapps.ai is a website that aggregates a variety of free AI mini apps and tools, covering areas such as health, social media, and SEO, and also supports exploring and creating simple AI applications for quick and easy use.
FineShare FineVoice is an AI real-time voice changer that supports instant voice adjustment and personalized processing during meetings, live streams, chats, and gaming.
OthersideAI is a personal writing assistant that provides real-time writing suggestions and sentence completion, helping users express their ideas more smoothly in scenarios such as emails, articles, or assignments.
Supertranslate is a video subtitle tool that can automatically transcribe videos in more than 100 languages and generate English subtitles, suitable for creators and teams that need to distribute content across languages.
Wisecut is an online AI video editing tool that uses speech recognition to automatically process video content, remove pauses, generate subtitles, and add background music. It is suitable for quickly organizing talking-head, interview, and podcast videos.
Good Tape is an automatic speech-to-text tool that can quickly convert audio recordings into text, supports more than 90 languages, and is suitable for organizing interviews, meetings, and dictated content.
Replica Studios is an AI voice generation tool for creative projects, offering virtual voice actors with emotional expression that can be used to create more natural voice performance content.
Free Text to Speech Generator is an online TTS tool that supports multiple languages, multiple dialects, and mixed Chinese-English reading, and can convert text into speech and export MP3 files.
Omniverse Audio2Face is an AI facial animation tool launched by NVIDIA that can automatically generate matching character facial expressions and lip-sync animation from audio, suitable for real-time and traditional character production workflows.
Salient is an AI tool for sales teams that can be used for personalized outbound emails, automatic customer replies, and reactivating leads, while also providing employee analytics to help businesses observe workforce trends.
LALAL.AI is an audio separation tool that can extract vocals or multiple instrument tracks from songs, supports high-quality audio processing, and is suitable for music editing, practice, and asset creation.
Powtoon is a website for creating videos and animations online. They provide professionally designed templates as well as useful tips, training courses, and guides to shorten the learning curve. Users can create stunning videos and presentations on the website, and before exporting the final video they can benefit from royalty-free video footage, images, animations, characters, voiceovers, or music. Powtoon can also be used to import PowerPoint presentations and convert them into videos.
NaturalReader is a tool that provides AI text-to-speech services, supporting online use, mobile apps, commercial licensing, and educational scenarios, suitable for converting text content into audio for listening.
SpeechEasy provides high-quality text-to-speech services
Thekeys is an AI writing assistant tool that helps users optimize their wording without changing the original meaning. It focuses on making text more vivid, concise, and persuasive, making it suitable for writing, speeches, and everyday communication scenarios.
Nonoisy is an audio post-processing tool mainly used to remove background noise, optimize audio quality, and adjust volume. It can also be used to improve audio performance in videos.
Voicera is an article-to-speech tool that can automatically detect content and generate a playable audio version. It supports multiple languages and voice options, making it convenient for users to access information by listening.
Poised is an AI communication coaching tool that provides real-time feedback, helping users improve spoken expression and overall communication skills during voice interactions.
Krater.AI is an AI-driven content creation tool that provides a suite of tools for marketers and content creators. It offers solutions such as ad copy generation, creating stunning images, and converting audio content into written content or realistic voiceovers. The website claims to have advanced technology comparable to Jasper, Midjourney, and Writer.com. Krater.AI is designed to be user-friendly and intuitive, offering features such as image generation, copywriting, chat, speech-to-text, code, and more. The website also has a Twitter account where they share updates about their product.
Quickie is an AI productivity tool that provides text-to-speech, content summarization, text expansion, and other functions, and is used as a browser extension to help users quickly process information during daily browsing and writing.
Soofy is an AI language learning app focused on real-world practice, helping users improve pronunciation, writing, and conversation skills, while strengthening practical language use through role-playing, debates, and other methods.
A deep learning framework introduced by UC Berkeley research
AI models for transcribing and understanding speech
IBM Watson Text to Speech
Run open-source machine learning models online
