Resources / Events

Dynamo and Dine

Dynamo & Dine: High-performance LLM Inference with Baseten and NVIDIA Dynamo

event

dynamo workshop

‌

Host

Baseten

Save your seat!

Share

Join us for a hands-on technical workshop and Brazilian churrasco experience at Fogo de Chão.

Discover how the world's largest AI inference workloads run at lightning speed on NVIDIA Dynamo, a distributed system for model serving.

In this 1-hour workshop, Harry Kim (NVIDIA) and Philip Kiely (Baseten) will dive deep into system-level optimizations that turbocharge LLM inference at scale, including:

KV-aware routing
KV cache offloading
PD disaggregation

After the session and Q&A, stay for a churrasco lunch. Enjoy eight different meats, a fresh salad bar, and traditional sides.

If you’re an AI engineer in SF, don’t miss this technical workshop and chance to network with peers. Lunch is on Nvidia and Baseten!

✅ Follow Baseten on Twitter & Linkedin
✅ Follow Nvidia on Twitter & Linkedin

Save your seat!

Trusted by top engineering and machine learning teams

toby

toby

Related resources

Explore resources

Model performance

Kimi K2 Thinking at 140+ TPS on NVIDIA Blackwell

Abu Qader

Tri Dao

Philip Kiely

Abu Qader

2 others

Kimi K2 Thinking 140+ TPS

Event

‌

Rockhouse Vegas Happy Hour @ AWS re:Invent

rootly hh

Event

‌

AI After Dark @ AWS re:Invent

ai after dark

Explore Baseten today

Start deploying

Talk to an engineer