Baseten Blog | Page 7

Glossary

AI infrastructure: build vs. buy

AI infrastructure, ML infrastructure, build vs. buy, model deployment

Hacks & projects

Build a chatbot with Llama 2 and LangChain

Build a ChatGPT-style chatbot with open-source Llama 2 and LangChain in a Python notebook.

ML models

Deploying and using Stable Diffusion XL 1.0

Deploy Stable Diffusion XL 1.0 for free to generate images from text prompts and invoke Stable Diffusion with the Baseten Python client.

ML models

Models We Love: July 2023

Explore open source foundation models: Llama 2 (Meta/Microsoft), FreeWilly1/2, SDXL 1.0 (Stability AI), LayoutLM (Inspira), NSQL 350M (Number Station).

Product

Model autoscaling features on Baseten

Scale replica count up and down in response to traffic, with scale to zero and fast cold starts.

Product

Models We Love: June 2023

Dive into open source foundation models, focusing on LLMs: Falcon-7B/40B, WizardLM, MusicGen (Meta), MPT-7B (Mosaic).

Product

New in June 2023

LangChain adds Baseten integration, Falcon soars to the top of the LLM leaderboard

Hacks & projects

Three techniques to adapt LLMs for any use case

Prompt engineering, embeddings, vector databases, and fine-tuning are ways to adapt Large Language Models (LLMs) to run on your data for your use case

Community

What I learned from my AI startup’s internal hackathon

See hackathon projects from Baseten for ML infrastructure, inference, user experience, and streaming

167811