Model performance engineer
Yineng Zhang
About
Yineng Zhang is a Software Engineer at Baseten Model Performance team. He is also a core developer of the SGLang project.
Model performance engineer
About
Yineng Zhang is a Software Engineer at Baseten Model Performance team. He is also a core developer of the SGLang project.
Qwen 3 235B: open-source MoE LLM brings frontier reasoning to 4 H100 GPUs. See benchmarks, SGLang setup, and FP8 tips for cost-efficient inferencing.