🚀 Announcing the launch of Chains!

Baseten / Blog / Hacks & projects

Hacks & projects

Topics

Latest Model performance Hacks & projects GPU guides ML models Glossary Community Product News

Deploying custom ComfyUI workflows as APIs

Easily package your ComfyUI workflow to use any custom node or model checkpoint.

1 other

CI/CD for AI model deployments

In this article, we outline a continuous integration and continuous deployment (CI/CD) pipeline for using AI models in production.

Philip Kiely

Vlad Shulman

3 others

Prompt: A movie still of an aqueduct

Streaming real-time text to speech with XTTS V2

In this tutorial, we'll build a streaming endpoint for the XTTS V2 text to speech model with real-time narration and 200 ms time to first chunk.

Philip Kiely

1 other

Prompt: A wooden boat full of books floating down a rapid river in a Japanese garden

How to serve your ComfyUI model behind an API endpoint

This guide details deploying ComfyUI image generation pipelines via API for app integration, using Truss for packaging & production deployment.

Philip Kiely

1 other

Model: SDXL + ControlNet, Prompt: A top down view of a river through the woods

GPT vs Mistral: Migrate to open source LLMs seamlessly

Use ChatCompletions API to test open-source LLMs like Mistral 7B in your AI app with just three minor code modifications.

Philip Kiely

1 other

Prompt: A sturdy stone bridge under a full moon, warm colors

Build your own open-source ChatGPT with Llama 2 and Chainlit

Llama 2 rivals GPT-3.5 in quality and powers ChatGPT. Chainlit helps build ChatGPT-like interfaces. This guide shows creating such interfaces with Llama 2.

Philip Kiely

Prompt: A llama wearing multiple gold chains in the park

Build a chatbot with Llama 2 and LangChain

Build a ChatGPT-style chatbot with open-source Llama 2 and LangChain in a Python notebook.

Philip Kiely

Prompt: A llama dressed as a pirate with a parrot on a ship

Three techniques to adapt LLMs for any use case

Prompt engineering, embeddings, vector databases, and fine-tuning are ways to adapt Large Language Models (LLMs) to run on your data for your use case

Philip Kiely

Prompt: Three glowing paper lanterns

Serving four million Riffusion requests in two days

Riffusion is a fine-tuned version of Stable Diffusion. Baseten served Riffusion over four million times in a couple of days, serving top-of-hacker-news traffic.

Prompt: A solarpunk piano