Accelerating Gemma 4: faster inference with multi-token prediction drafters 591 points by amrrs 19 hours ago 277 comments story