fix pack_padded_sequence()

Hi! It seems like for pytorch 1.7.0, pack_padded_sequence's src_length must be in cpu, even if we're using cuda: pytorch/pytorch#43227 I also tried this command in pytorch 1.6.0 to check for backward compatibility and there it works fine, both with and without adding .cpu()
joeynmt · bastings · Nov 13, 2020 · Nov 11, 2020 · Nov 11, 2020 · ef1b0f081fedfcf83ce39cd2f27dcc33d3cbb2d1
commit ef1b0f081fedfcf83ce39cd2f27dcc33d3cbb2d1
diff --git a/joeynmt/encoders.py b/joeynmt/encoders.py
@@ -113,7 +113,7 @@ def forward(self, embed_src: Tensor, src_length: Tensor, mask: Tensor) \
         # apply dropout to the rnn input
         embed_src = self.emb_dropout(embed_src)
 
-        packed = pack_padded_sequence(embed_src, src_length, batch_first=True)
+        packed = pack_padded_sequence(embed_src, src_length.cpu(), batch_first=True)
         output, hidden = self.rnn(packed)
 
         #pylint: disable=unused-variable