All-new model management, a text embedding model that matches OpenAI, and misgif, the most fun you’ll have with AI all week.
Mistral 7B LLM, GPU comparisons, model observability features, and an open source AI event series
The latest version of Truss brings new solutions for the most common pain points in packaging and serving ML models. Plus, learn how to optimize Stable Diffusion XL inference to run in as little as 3 seconds and build your own open-source version of ChatGPT with Llama 2 and Chainlit.
Llama 2 and SDXL shake up foundation model leaderboards (plus: Langchain, autoscaling, and more)
Autoscaling is the ability of your machine learning model to automatically spawn more replicas or terminate replicas in response to the amount of incoming traffic. Baseten employs a robust series of autoscaling features, including scale to zero and cold starts.
An in-depth look at open source foundation models, primarily LLMs. Falcon-7B and Falcon-40B from TII, WizardLM from Microsoft and Peking University, MusicGen from Meta, and MPT-7B from Mosaic.
LangChain adds Baseten integration, Falcon soars to the top of the LLM leaderboard
An explanation of how Baseten's model library works for deploying and serving popular open-source models.
Discover new models for text generation and text-to-speech, learn more about the GPUs they run on, and plug in to the community forming around open-source models
LLMs go OSS, AI community thrives, Baseten offers free credits to start deploying models