Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference

1 points | by venkat_2811 11 hours ago

1 comments