hi Why the input of the decoder is an one-hot vector? We usually use dense vectors(through embedding lookup function) as the input in seq2seq models.