trained on a gh200 for 5.4b tokens. released under no license