* Remove the embedding ops in favor of index-select. * Also remove the cuda kernels.
Minimalist ML framework for Rust