* Add a KV cache to T5. * Suggest using release mode. * Use the kv cache in decoding. * Add a comment.