* correct optional SE layer dimensions. * head_dim instead of num_heads is 32. * update test example output.