ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference

85 points | by PaulHoule 17 hours ago

6 comments