Towards unlimited contexts: faster-than-GPU sparse logarithmic attention on CPU [video]

3 points | by mfiguiere 13 hours ago

No comments yet.