With LLMs, the inputs are highly variable so exact match caching is generally le... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		edwin on June 14, 2023 \| parent \| context \| favorite \| on: Native JSON Output from GPT-4 With LLMs, the inputs are highly variable so exact match caching is generally less useful. Semantic caching groups similar inputs and returns relevant results accordingly. So {"dish":"spaghetti bolognese"} and {"dish":"spaghetti with meat sauce"} could return the same cached result.

m3kw9 on June 14, 2023 [–]

Or store as sentence embedding and calculate the vector distance, but creates many edge cases

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact