* Rework the var-builder to handle initializations. * Add some helper functions for layer creation. * Improve the layer initializations. * Get initialized variables. * Precompute the rot embeddings when training lamas.
* Add a flag to change the number of epochs for the mnist training. * Increase the learning rate for the MLP.