NaviAI logoNaviAI

Categories

Chat Assistants131Writing & Text225Image & Design326Audio & Video114Development131Education82Business246Gaming & Fun22Health20Travel11Finance2
NaviAI logoNaviAI
HomeAI NewsTutorialsAbout
中文
HomeSearch

Found 14 results for “Speech-to-Text”

网易见外工作台
网易见外工作台Business

NetEase Jianwai Workbench is an AI tool for office and collaboration scenarios, providing video and livestream transcription, speech-to-text, document translation, and other functions, suitable for teams handling multimedia and text content.

Inworld AI
Inworld AIImage & Design

Inworld AI is an AI character development platform that can create virtual characters with personality, memory, and contextual awareness, and supports configurations such as safety, narrative control, and multimodality, making it suitable for real-time interactive applications.

Transkribieren
TranskribierenAudio & Video

Transkribieren is an audio transcription tool that supports uploading multiple audio formats, provides a relatively convenient speech-to-text service, and extends to use on mobile, in the browser, and in meeting scenarios.

Sumly.AI
Sumly.AIEducation

Sumly.AI is an AI summary tool for podcast content that uses speech-to-text technology to distill key points from episodes and help users quickly understand podcast content through short summaries.

NeuroSpell
NeuroSpellWriting & Text

NeuroSpell is a deep learning-based spelling and grammar auto-correction tool that supports more than 30 languages and provides capabilities such as speech-to-text, OCR error correction, and customizable terminology training.

Supertranslate
SupertranslateImage & Design

Supertranslate is a video subtitle tool that can automatically transcribe videos in more than 100 languages and generate English subtitles, suitable for creators and teams that need to distribute content across languages.

Good Tape
Good TapeWriting & Text

Good Tape is an automatic speech-to-text tool that can quickly convert audio recordings into text, supports more than 90 languages, and is suitable for organizing interviews, meetings, and dictated content.

Krater.AI
Krater.AIWriting & Text

Krater.AI is an AI-driven content creation tool that provides a suite of tools for marketers and content creators. It offers solutions such as ad copy generation, creating stunning images, and converting audio content into written content or realistic voiceovers. The website claims to have advanced technology comparable to Jasper, Midjourney, and Writer.com. Krater.AI is designed to be user-friendly and intuitive, offering features such as image generation, copywriting, chat, speech-to-text, code, and more. The website also has a Twitter account where they share updates about their product.

AssemblyAI
AssemblyAIAudio & Video

AI models for transcribing and understanding speech

飞书妙记
飞书妙记other

Feishu Minutes offers intelligent meeting notes and fast AI speech-to-text transcription.

Cohere
CohereDevelopment

Cohere is a platform that provides large language models, helping developers and enterprises build high-performance AI products. The platform mainly offers AI-powered search text services (multilingual embeddings, neural search, search ranking), text classification, and text generation, helping enterprises quickly deploy conversational AI chatbots, generative search engines, text summarization, and enhanced vector retrieval.

ElevenLabs
ElevenLabsAudio & Video

ElevenLabs is an AI text-to-speech platform that provides realistic voice synthesis solutions for developers, creators, and enterprises. Its core products include text-to-speech (supporting 29+ languages including Chinese and 10,000+ voices), AI dubbing, voice cloning, music generation, and more.

Deepgram
DeepgramAudio & Video

Deepgram is a platform that provides advanced AI speech recognition and natural language processing technology. Its core products are powerful Speech-to-Text (STT) and Text-to-Speech (TTS) APIs, enabling developers to quickly integrate voice transcription and understanding capabilities into their own applications and services.

听脑AI
听脑AIother

Tingnao AI is an AI-powered intelligent voice assistant focused on speech-to-text and real-time recording summaries, offering audio/video transcription, real-time recording-to-text, AI summaries, chapter overview, and other features. Users can freely drag text to view audio/video progress and enjoy a convenient intelligent recording experience.