Every repeated LLM call is money on fire

Raul Junco

Aug 11

Redis 8 just changed the game with semantic caching that understands meaning, not just keys.

Read →

10 Comments

Petar Ivanov

Aug 11

This would be a great improvement for every Agentic / LLM application. I'll have to give Redis 8 a try.

Thanks for this breakdown, Raul!

Expand full comment

Reply (1)

Raul Junco

Aug 12

For sure, once you see the hit rates in action, it’s hard to imagine running without it.

Expand full comment

Ajay

Oct 12

Awesome post! I didn't know about this new Redis thing. I'll check it out to learn more.

Expand full comment

Riccardo Causo

Sep 16

this is a great one Raul. I don’t know how I missed it the other day

Expand full comment

Sreeni Kand

Sep 11

Great article but biased as if redis was super intelligent

CHATGPT would surely be using LLM CACHE.

I am pretty sure

Pls challenge this observation

Expand full comment

Vivek Ganesan

Aug 31

This is nice. Interestingly, caching the LLM responses makes a lot of sense because the response wont ever go out of date (due to LLMs having s cut-off date themselves)

Only exception is when you cache the response of agents that use LLMs. In that case we need to be intelligent enough to not cache things that are dynamic (very much like non-llm apps)

Expand full comment

Nishanth Prabhakaran

Aug 15

Good one

Expand full comment

Shan

Aug 13

Interesting, thanks for sharing. How it handles multi-turn conversational context when a cached output is returned (even for chatbot usecase, chatbot will be sending context like user info, or product he is discussing about etc). ? Also, is there any case studies or validation data are available?

Expand full comment