Technical Writer

Philip Kiely

Aug 30, 2023

SDXL inference in under 2 seconds: the ultimate guide to Stable Diffusion optimization

SDXL 1.0 initially takes 8-10 seconds for a 1024x1024px image on A100 GPU. Learn how to reduce this to just 1.92 seconds on the same hardware.

Varun Shenoy

1 other

SDXL inference in under 2 seconds: the ultimate guide to Stable Diffusion optimization

Hacks & projects

Aug 23, 2023

Build your own open-source ChatGPT with Llama 2 and Chainlit

Llama 2 rivals GPT-3.5 in quality and powers ChatGPT. Chainlit helps build ChatGPT-like interfaces. This guide shows creating such interfaces with Llama 2.

Philip Kiely

Prompt: A llama wearing multiple gold chains in the park

Hacks & projects

Jul 27, 2023

Build a chatbot with Llama 2 and LangChain

Build a ChatGPT-style chatbot with open-source Llama 2 and LangChain in a Python notebook.

Philip Kiely

Prompt: A llama dressed as a pirate with a parrot on a ship

ML models

Jul 26, 2023Revised Oct 16, 2023

Deploying and using Stable Diffusion XL 1.0

Deploy Stable Diffusion XL 1.0 for free to generate images from text prompts and invoke Stable Diffusion with the Baseten Python client.

Philip Kiely

Prompt: A tree in a field under the night sky

Hacks & projects

Jun 15, 2023

Three techniques to adapt LLMs for any use case

Prompt engineering, embeddings, vector databases, and fine-tuning are ways to adapt Large Language Models (LLMs) to run on your data for your use case

Philip Kiely