a8950d70bf
- A data dump file set generated by tfdbg2 can contain multiple subsets when there are multiple hosts involved in the instrumented TensorFlow job (e.g., TPUs and Parameter Servers). Currently, there is no bit in those subset of files that indicates they belong to the same instrumented TF job. - This CL addresses this problem by adding a field to the metadata proto used by those files (`tfdbg_run_id`) - The DebugEventsWriter code is revised, so that this new field is written to the metadata file of the file set on the writer's construction. - Also in this CL: remove the previous 1-arg `GetDebugEventsWriter(dump_root)` that creates the writer object if it doesn't exist at the specified dump_root. Replace it with `LookUpDebugEventsWriter(dump_root)` that only looks up the writer object and returns a non-OK status if such an object hasn't been created at `dump_root`. This makes the code less error prone by keeping only the fully-explicit, 3-arg `GetDebugEventsWriter()`. PiperOrigin-RevId: 316537044 Change-Id: Id5be0b771fbf37c0fc796f1514ed858a0e6d38f0 |
||
---|---|---|
.. | ||
__init__.py | ||
client_lib.py | ||
debug_events_writer_wrapper.cc | ||
device_lib.py | ||
device_lib_test.py | ||
device_lib_wrapper.cc | ||
events_writer_test.py | ||
events_writer_wrapper.cc | ||
notebook.py | ||
pywrap_tf_session.py | ||
session.py | ||
session_benchmark.py | ||
session_clusterspec_prop_test.py | ||
session_list_devices_test.py | ||
session_partial_run_test.py | ||
session_ref.cc | ||
session_ref.h | ||
session_test.py | ||
test_construction_fails_op.cc | ||
tf_session_helper.cc | ||
tf_session_helper.h | ||
tf_session_wrapper.cc | ||
timeline.py | ||
timeline_test.py | ||
virtual_gpu_test.py |