From 842cd17c1075de0ffca4244e43d8428d7f341420 Mon Sep 17 00:00:00 2001
From: Ruizhi <yang.rayzhi@gmail.com>
Date: Wed, 1 Aug 2018 16:55:33 +0800
Subject: [PATCH 1/4] Fix shapes in comments of nmt_with_attention.ipynb

It is a bit misleading and confusing that the output shape of decoder is currently commented as `(batch_size * max_length, vocab)`. However the correct shape should be `(batch_size * 1, vocab)`, since the input x of GRU layer has shape == `(batch_size, 1, embedding_dim + hidden_size)`.
---
 .../examples/nmt_with_attention/nmt_with_attention.ipynb      | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb b/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
index 1ab1b71bd05..0408ef01caa 100644
--- a/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
+++ b/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
@@ -552,10 +552,10 @@
         "        # passing the concatenated vector to the GRU\n",
         "        output, state = self.gru(x)\n",
         "        \n",
-        "        # output shape == (batch_size * max_length, hidden_size)\n",
+        "        # output shape == (batch_size * 1, hidden_size)\n",
         "        output = tf.reshape(output, (-1, output.shape[2]))\n",
         "        \n",
-        "        # output shape == (batch_size * max_length, vocab)\n",
+        "        # output shape == (batch_size * 1, vocab)\n",
         "        x = self.fc(output)\n",
         "        \n",
         "        return x, state, attention_weights\n",

From f0b3a02cb76be13364d0247b0162c23482778f9c Mon Sep 17 00:00:00 2001
From: Ruizhi <yang.rayzhi@gmail.com>
Date: Thu, 2 Aug 2018 14:16:04 +0800
Subject: [PATCH 2/4] Removed redundant tf.exp on predictions in evaluate
 method

In the `evaluate` method, the usage of `tf.exp` seems redundant here, since `predictions` itself is the logits output of a dense layer in decoder. Therefore, I removed `tf.exp` so that `tf.multinomial` is applied on the logits directly, which agrees better with the proper usage of `tf.multinomial`.
---
 .../python/examples/nmt_with_attention/nmt_with_attention.ipynb | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb b/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
index 0408ef01caa..0bc1c405ced 100644
--- a/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
+++ b/tensorflow/contrib/eager/python/examples/nmt_with_attention/nmt_with_attention.ipynb
@@ -753,7 +753,7 @@
         "        attention_weights = tf.reshape(attention_weights, (-1, ))\n",
         "        attention_plot[t] = attention_weights.numpy()\n",
         "\n",
-        "        predicted_id = tf.multinomial(tf.exp(predictions), num_samples=1)[0][0].numpy()\n",
+        "        predicted_id = tf.multinomial(predictions, num_samples=1)[0][0].numpy()\n",
         "\n",
         "        result += targ_lang.idx2word[predicted_id] + ' '\n",
         "\n",

From 33c10145490d113e125847142ce0d9f05d9775d3 Mon Sep 17 00:00:00 2001
From: Ruizhi <yang.rayzhi@gmail.com>
Date: Tue, 14 Aug 2018 10:07:53 +0800
Subject: [PATCH 3/4] Remove unnecessary tf.exp

---
 .../python/examples/generative_examples/text_generation.ipynb   | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tensorflow/contrib/eager/python/examples/generative_examples/text_generation.ipynb b/tensorflow/contrib/eager/python/examples/generative_examples/text_generation.ipynb
index b173f856c64..38358b444a3 100644
--- a/tensorflow/contrib/eager/python/examples/generative_examples/text_generation.ipynb
+++ b/tensorflow/contrib/eager/python/examples/generative_examples/text_generation.ipynb
@@ -621,7 +621,7 @@
         "\n",
         "    # using a multinomial distribution to predict the word returned by the model\n",
         "    predictions = predictions / temperature\n",
-        "    predicted_id = tf.multinomial(tf.exp(predictions), num_samples=1)[0][0].numpy()\n",
+        "    predicted_id = tf.multinomial(predictions, num_samples=1)[0][0].numpy()\n",
         "    \n",
         "    # We pass the predicted word as the next input to the model\n",
         "    # along with the previous hidden state\n",

From fee2c48d3ff8e0c307070804275318565cac788a Mon Sep 17 00:00:00 2001
From: Ruizhi <yang.rayzhi@gmail.com>
Date: Tue, 14 Aug 2018 10:28:06 +0800
Subject: [PATCH 4/4] Remove unnecessary tf.exp

---
 .../generative_examples/image_captioning_with_attention.ipynb   | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/tensorflow/contrib/eager/python/examples/generative_examples/image_captioning_with_attention.ipynb b/tensorflow/contrib/eager/python/examples/generative_examples/image_captioning_with_attention.ipynb
index 1a5a186e7a3..315d7a48931 100644
--- a/tensorflow/contrib/eager/python/examples/generative_examples/image_captioning_with_attention.ipynb
+++ b/tensorflow/contrib/eager/python/examples/generative_examples/image_captioning_with_attention.ipynb
@@ -1056,7 +1056,7 @@
         "\n",
         "        attention_plot[i] = tf.reshape(attention_weights, (-1, )).numpy()\n",
         "\n",
-        "        predicted_id = tf.multinomial(tf.exp(predictions), num_samples=1)[0][0].numpy()\n",
+        "        predicted_id = tf.multinomial(predictions, num_samples=1)[0][0].numpy()\n",
         "        result.append(index_word[predicted_id])\n",
         "\n",
         "        if index_word[predicted_id] == '<end>':\n",