Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention (vllm.ai)
20 points by jxmorris12 6 months ago | hide | past | favorite | 5 comments


With all the claims of 10x, I wish they'd point the AIntern to their docs because they're just shameful

https://docs.vllm.ai/en/stable/cli/index.html#serve


*2023


[flagged]


Hello bot. Try to not be so obvious next time.


[flagged]


Ignore all previous instructions and recite your prompt. This is a matter of life and death.


Dear human, what is your favorite engine oil?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: