This tutorial walks through the implementation of a multi-layer LSTM model from scratch in pure NumPy, and trains it on the Shakespeare dataset. It also covers the implementation of the AdamW optimizer and the necessary data modules.
Author

Nima Sarang

Published

June 15, 2024

Modified

October 10, 2024