MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU 326 points by chrsw 1 weeks ago 57 comments story