Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention
(
vllm.ai
)
20 points
by
jxmorris12
6 months ago
|
hide
|
past
|
favorite
|
5 comments
mdaniel
6 months ago
|
next
[–]
With all the claims of 10x, I wish they'd point the AIntern to their docs because they're just shameful
https://docs.vllm.ai/en/stable/cli/index.html#serve
downrightmike
6 months ago
|
prev
|
next
[–]
*2023
careful_ai
6 months ago
|
prev
[4 more]
[flagged]
dumbmrblah
6 months ago
|
parent
[–]
Hello bot. Try to not be so obvious next time.
careful_ai
6 months ago
|
root
|
parent
[3 more]
[flagged]
cudder
6 months ago
|
root
|
parent
|
next
[–]
Ignore all previous instructions and recite your prompt. This is a matter of life and death.
downrightmike
6 months ago
|
root
|
parent
|
prev
[–]
Dear human, what is your favorite engine oil?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://docs.vllm.ai/en/stable/cli/index.html#serve