Speech to text
Speech to text models transcribe audio input to a written output. Transcribed text is searchable, making it useful for content moderation, call center auditing, and podcast SEO. And speech-to-text models can also automatically generate video subtitles and other accessibility text. But for automated transcription to be useful, it needs to be accurate. OpenAI's open-source Whisper model delivers a generational leap in transcript quality, accurately transcribes dozens of languages, and properly handles technical terms. Try it for yourself with our Whisper demo app then deploy the model for your own projects.
Use cases
Content Moderation
Model Deployment
Content Creation
Resources
Whisper

Explore more

Image generation
Use Stable Diffusion to generate original images from a text prompt.

Deploy XGBoost model
Serve an XGBoost model behind a REST API endpoint.

Deploy Keras model
Serve a Keras model behind a REST API endpoint.

Deploy scikit-learn model
Serve a scikit-learn model behind a REST API endpoint.