Create only CPU device in EncapsulateSubgraphsPass

EncapsulateSubgraphsPass needs to run a portion of the graph as part of its constant folding. It used to create all devices for doing this. This can be slow. Also, the SessionOptions used in EncapsulateSubgraphsPass can collide with "global" SessionOptions. This can lead to problems when the "global" options created only 1 GPU device, but EncapsulateSubgraphsPass wants to create all. More specifically, xla::Backend is currently shared and created only the first time around. When, EncapsulateSubgraphsPass wants to access the backend for the second GPU, it gets an error that it does not exist. PiperOrigin-RevId: 238253724
2019-03-13 10:31:35 -07:00 · 2019-03-13 10:31:35 -07:00 · 959cb35b46
commit 959cb35b46
parent 1b64697fd6
1 changed files with 14 additions and 1 deletions
--- a/tensorflow/compiler/jit/encapsulate_subgraphs_pass.cc
+++ b/tensorflow/compiler/jit/encapsulate_subgraphs_pass.cc
@ -2538,12 +2538,25 @@ Status EncapsulateSubgraphsPass::Run(
  // Constant folding below might need to run part of the function to compute
  // constants. Create an FunctionLibraryRuntime with a single CPU device
  // that can run the part of the function.
+  // NOTE: If this turns out to be slow, we can cache the FLRs keyed by
+  // `options`.
  SessionOptions session_options;
  auto* device_count = session_options.config.mutable_device_count();
  device_count->insert({"CPU", 1});
  std::vector<std::unique_ptr<Device>> devices;
-  TF_CHECK_OK(DeviceFactory::AddDevices(
+
+  DeviceFactory* cpu_factory = DeviceFactory::GetFactory("CPU");
+  if (!cpu_factory) {
+    return errors::NotFound(
+        "CPU Factory not registered. Can't run EncapsulateSubgraphsPass");
+  }
+  TF_RETURN_IF_ERROR(cpu_factory->CreateDevices(
      session_options, "/job:localhost/replica:0/task:0", &devices));
+  if (devices.empty()) {
+    return errors::NotFound(
+        "Failed to create a CPU device for EncapsulateSubgraphsPass");
+  }
+
  std::unique_ptr<DeviceMgr> device_mgr =
      absl::make_unique<DeviceMgr>(std::move(devices));
  OptimizerOptions opts;