even OpKernel::ComputeAync have async completion, the kernel launch / or memory copy can be still launched in the same thread, this is especially true for _Send/_Recv or _HostSend/_HostRecv.

add annotation will let use see which op had issued the memory copy.

PiperOrigin-RevId: 294723865
Change-Id: I93089ccdeef4e6beca0ae571b295c47ee07225c4
This commit is contained in:
A. Unique TensorFlower 2020-02-12 12:11:19 -08:00 committed by TensorFlower Gardener
parent 33e0fc8aa7
commit 05b4fe88ab

View File

@ -1877,7 +1877,7 @@ void ExecutorState::Process(TaggedNode tagged_node, int64 scheduled_nsec) {
};
nodestats::SetOpStart(stats);
{
profiler::TraceMe activity(
profiler::AnnotatedTraceMe activity(
[&] {
return op_kernel->TraceString(
&state->ctx, /*verbose=*/profiler::TfOpDetailsEnabled());