Mahmoud Hassan

Model Performance Engineer

Model performance
Mahmoud Hassan
1 other
Boosting MTP acceptance in TensorRT-LLM: +40% throughput
Mahmoud Hassan - Model Performance Engineer