Attention at Constant Cost per Token via Symmetry-Aware Taylor Approximation 164 points by fheinsen 4 months ago 96 comments story