Hypura – A storage-tier-aware LLM inference scheduler for Apple Silicon

186 points | by tatef 8 hours ago

82 comments