alibaba

alibaba / rtp-llm

#9
1,145197+7 todayCuda

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

📊 Project Info

Language
Cuda
Stars
1,145
Forks
197
Today
+7
Ranking
#9
Collection
Language
Trending Date
May 30, 2026
Last Push
5/30/2026

🏷️ Topics

gptinferencellamallmllm-servingllmopsmodel-serving

📸 Screenshots

rtp-llm screenshot 1