A few Baseten engineers teamed up and released ChatLLaMA, an open-source ChatGPT alternative, as a weekend project. ChatLLaMA uses Alpaca-7B, a fine-tuned variant of LLaMA designed for a chat interface.
Want to fine-tune your own LLM? You can on Blueprint, which now supports fine-tuning FLAN-T5, an instruction-tuned text-to-text-transfer-transformer model developed by Google.
Why fine-tune an LLM?
Have a top-notch LLM customized with your data — train it on your customer support tickets, user-generated content, or documentation
Build a special-purpose generative app like a recipe generator or a chatbot that’s an expert on healthcare acronyms
Save money versus using generic LLM APIs and own your model
After releasing Blueprint last month, Baseten’s engineers have been building side projects using LLMs and generative AI. Here are a few projects from the team:
Suren built GPTheus, which gives a voice and visual interface to ChatGPT using Whisper and Stable Diffusion
Sid’s hackathon group created an AI-powered dev infrastructure tool that writes dockerfiles and uses them to build codespaces
Philip built a quick poetry generator
Here’s a demo and walkthrough of Suren’s project:
In the last year, ML has gone from an experiment to a core need for many companies who require elevated levels of security and compliance. Baseten is now SOC 2 Type II certified and HIPAA compliant. We are honored to be entrusted with sensitive data and take our platform’s security and stability seriously.
Before we go, here is a photo from our AI meetup at our San Francisco office. We’re hosting another in April, follow us on Twitter to find out more!
See you next month!
— The team at Baseten