Software to support a course on how GPT works.
corpus.py - convert JSON from the Cornell Movie Corpus into GPT input
tokenizer.py - encode and decode text using lowercase alphabet a-z
one_hot_encoding.py - explain one-hot encoding and how it is used
transformer.py - use tokenizer to generate output
bigram.py - holds a two-character probabilistic model