Baseten logo
Product
Model DeploymentWorkletsViews
Explore
ModelsApplications
CustomersDocsPricingSign inStart building
Sign inSign up
Mobile menu icon
All

Speech to text

Speech to text models transcribe audio input to a written output. Transcribed text is searchable, making it useful for content moderation, call center auditing, and podcast SEO. And speech-to-text models can also automatically generate video subtitles and other accessibility text. But for automated transcription to be useful, it needs to be accurate. OpenAI's open-source Whisper model delivers a generational leap in transcript quality, accurately transcribes dozens of languages, and properly handles technical terms. Try it for yourself with our Whisper demo app then deploy the model for your own projects.

View live demo
Use cases

Content Moderation

Model Deployment

Content Creation

Resources

Whisper

View live demo

No more abandoned models

Start shipping full-stack apps and driving business outcomes with machine learning today.

Start building
Hero image

Explore more

Image generation

Use Stable Diffusion to generate original images from a text prompt.

Learn more
View demo

Deploy XGBoost model

Serve an XGBoost model behind a REST API endpoint.

Learn more
View demo

Deploy Keras model

Serve a Keras model behind a REST API endpoint.

Learn more
View demo

Deploy scikit-learn model

Serve a scikit-learn model behind a REST API endpoint.

Learn more
View demo
Baseten logo on black

Gallery

Customers

Docs

Pricing

Changelog

Status

About

Blog

Careers

We're hiring!

© Baseten 2022
Terms of ServicePrivacy Policy
Twitter logoLinkedin LogoYoutube logo