https://blog.lmcache.ai/2025-05-08-mooncake/ #27

2025-05-30T18:27:49Z

giscus[bot]
bot May 30, 2025

https://blog.lmcache.ai/2025-05-08-mooncake/

Overview of the Collaboration LMCache and Mooncake have announced a strategic collaboration aimed at pioneering a KVCache-centric Large Language Model (LLM) serving system. This partnership seeks to significantly enhance the efficiency, scalability, and responsiveness of LLM applications. By combining LMCache’s advanced KVCache management techniques with Mooncake’s powerful and optimized backend...

https://blog.lmcache.ai/2025-05-08-mooncake/

prashant182 · 2025-05-30T18:27:50Z

prashant182
May 30, 2025 — with giscus

For the model deployment, did you deploy a single model copy across all 8 GPUs or each GPU got one model copy? Also how many nodes did you use to test this feature? If GPUs are across different nodes i.e. 80 H100, (8 P5 instances) How does mooncake transfer KV Cache across instances?

0 replies

llc-kc · 2025-06-12T12:12:36Z

llc-kc
Jun 12, 2025 — with giscus

what's you test code to give these data?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

https://blog.lmcache.ai/2025-05-08-mooncake/ #27

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

https://blog.lmcache.ai/2025-05-08-mooncake/ #27

Uh oh!

giscus[bot] bot May 30, 2025

https://blog.lmcache.ai/2025-05-08-mooncake/

Replies: 2 comments

Uh oh!

prashant182 May 30, 2025 — with giscus

Uh oh!

llc-kc Jun 12, 2025 — with giscus

giscus[bot]
bot May 30, 2025

prashant182
May 30, 2025 — with giscus

llc-kc
Jun 12, 2025 — with giscus