Is One Layer Enough? A Single Transformer Layer Matches Full-Parameter RL Train 110 points by tcp_handshaker 7 hours ago 27 comments story