In this video, Chris shows how AI transformer models such as GPT is more effective at learning when given given focused training on small digestable chunks, built up through more example. He uses his small language model that he built from scratch to show learning