STT-tensorflow/tensorflow/compiler/xla
Peter Hawkins 077b553fda [XLA:Python] Specify a 2MiB stack size for host stream threads.
[StreamExecutor] Allow HostExecutor users to control the stack sizes of threads used for HostStream via.

Also include non_portable_tags in the keys used when creating an Executor. There seems to be no good reason that it is omitted.

Will fix https://github.com/google/jax/issues/432 when included in a jaxlib release.

PiperOrigin-RevId: 309472318
Change-Id: Ia2535616047390d6bf6f2da82a666a321dcc9f5d
2020-05-01 14:20:58 -07:00
..
client Add optional layout constraint to AllToAll 2020-04-27 18:37:21 -07:00
experimental/xla_sharding Allow user to pass input_shape to split. 2020-02-19 18:04:17 -08:00
g3doc [XLA] [Docs] Clarify AllReduce semantics when the input is a tuple 2020-04-03 14:02:48 -07:00
pjrt [XLA:Python] Specify a 2MiB stack size for host stream threads. 2020-05-01 14:20:58 -07:00
python Split non-Python PJRT classes into their own directory. 2020-05-01 10:01:09 -07:00
python_api minor spelling tweaks 2019-12-09 18:21:12 +09:00
rpc Use tf_grpc_dependency in tensorflow/compiler/xla 2020-04-02 22:42:59 -07:00
service [XLA:Python] Specify a 2MiB stack size for host stream threads. 2020-05-01 14:20:58 -07:00
tests [XLA:coverage] Disable tests timing out on coverage. 2020-05-01 11:00:19 -07:00
tools Also include the header _isatty is declared for windows. 2020-02-07 10:38:22 -08:00
.clang-format
array2d_test.cc
array2d.h
array3d_test.cc
array3d.h
array4d_test.cc
array4d.h Add a default constructor to Array4D 2020-02-07 15:23:01 -08:00
array_test.cc
array.cc Add specialization of method Array::FillRandom() for complex64 type. 2019-12-13 16:56:06 -08:00
array.h Add specialization of method Array::FillRandom() for complex64 type. 2019-12-13 16:56:06 -08:00
bit_cast_test.cc
bit_cast.h
BUILD [XLA] Turn constant dynamic slices into slices. 2020-04-25 15:59:33 -07:00
comparison_util.cc
comparison_util.h
cpu_function_runtime.cc
cpu_function_runtime.h [XLA:CPU] Reconsider alignment heuristics 2020-04-24 02:51:55 -07:00
debug_options_flags.cc Have XLA CPU emit TraceMe calls by default in JIT mode (roll forward) 2020-04-22 20:21:01 -07:00
debug_options_flags.h minor spelling tweaks 2019-12-09 18:21:12 +09:00
debug_options_parsers_test.cc [ROCm] Fix for compile error in //tensorflow/compiler/xla:debug_options_parsers_test 2020-01-19 02:38:56 +00:00
debug_options_parsers.h Remove ReducePrecisionInsertion pass. 2019-11-04 02:09:31 -08:00
device_util.h
error_spec.h
executable_run_options.cc Add run_id to TraceMes. 2020-03-13 10:42:34 -07:00
executable_run_options.h Add a launch id field in run options and hlo module config. 2020-04-22 16:09:18 -07:00
execution_options_util.cc
execution_options_util.h minor spelling tweaks 2019-12-09 18:21:12 +09:00
index_util_test.cc
index_util.cc
index_util.h
iterator_util_test.cc
iterator_util.h
layout_test.cc [XLA] Remove unsupported sparse layout 2020-01-02 18:04:52 -08:00
layout_util_test.cc [XLA] Remove unsupported sparse layout 2020-01-02 18:04:52 -08:00
layout_util.cc This CL optimizes C++11 range-based for loops where the variable is copied in each iteration but it would suffice to obtain it by const reference. This is only applied to loop variables of types that are expensive to copy which means they are not trivially copyable or have a non-trivial copy constructor or destructor. 2020-04-06 17:55:23 -07:00
layout_util.h [XLA] Remove unsupported sparse layout 2020-01-02 18:04:52 -08:00
layout.cc This CL optimizes C++11 range-based for loops where the variable is copied in each iteration but it would suffice to obtain it by const reference. This is only applied to loop variables of types that are expensive to copy which means they are not trivially copyable or have a non-trivial copy constructor or destructor. 2020-04-06 17:55:23 -07:00
layout.h [XLA] Remove unsupported sparse layout 2020-01-02 18:04:52 -08:00
literal_comparison.cc minor spelling tweaks 2019-12-09 18:21:12 +09:00
literal_comparison.h
literal_test.cc Avoid dividing by zero in xla/literal_test. 2020-02-10 11:29:21 -08:00
literal_util.cc Introduce a few new literal conversion functions for various floating point types 2020-01-27 07:47:30 -08:00
literal_util.h Introduce a few new literal conversion functions for various floating point types 2020-01-27 07:47:30 -08:00
literal.cc [XLA] Turn constant dynamic slices into slices. 2020-04-25 15:59:33 -07:00
literal.h [XLA] Turn constant dynamic slices into slices. 2020-04-25 15:59:33 -07:00
map_util.h
metric_table_report.cc
metric_table_report.h
overflow_util.h
packed_literal_reader.cc
packed_literal_reader.h
parse_flags_from_env_test.cc Handle the cases where stdout may have CRLF suffix on windows in xla parse_flags_from_env_test 2020-03-12 14:33:13 -07:00
parse_flags_from_env.cc [XLA] Use LOG(QFATAL) rather than LOG(FATAL) in XLA_FLAGS argument parsing. 2019-11-27 10:27:55 -08:00
parse_flags_from_env.h minor spelling tweaks 2019-12-09 18:21:12 +09:00
primitive_util_test.cc
primitive_util.cc
primitive_util.h
protobuf_util.cc
protobuf_util.h
README.md
refcounting_hash_map_test.cc [XLA:GPU] Add experimental, lightly tested support for multi-host and multi-process NCCL AllReduce. 2020-02-21 14:00:29 -08:00
refcounting_hash_map.h [XLA:GPU] Add experimental, lightly tested support for multi-host and multi-process NCCL AllReduce. 2020-02-21 14:00:29 -08:00
reference_util_test.cc
reference_util.cc [XLA] Do extensive testing of int32 matmuls 2019-09-16 08:52:24 -07:00
reference_util.h [XLA] Do extensive testing of int32 matmuls 2019-09-16 08:52:24 -07:00
service_interface.h
shape_layout.cc [XLA] Implement dynamic input and output in DynamicPadder. 2019-12-12 10:31:20 -08:00
shape_layout.h
shape_test.cc minor spelling tweaks 2019-12-09 18:21:12 +09:00
shape_tree_test.cc
shape_tree.h
shape_util_test.cc [XLA] Add a few helper functions around dynamic dimensions. 2019-11-25 12:52:48 -08:00
shape_util.cc Fix 64-bit integer portability problems in TensorFlow compiler. 2020-01-16 13:16:05 -08:00
shape_util.h [XLA] Remove unsupported sparse layout 2020-01-02 18:04:52 -08:00
shape.cc [xla::Shape::Shape] Emplace tuple shapes into tuple_shapes_. 2020-03-03 15:06:38 -08:00
shape.h [XLA]Add HLO infrastructure to support dynamic op lowering. 2020-04-30 17:00:49 -07:00
status_macros_test.cc minor spelling tweaks 2019-12-09 18:21:12 +09:00
status_macros.cc
status_macros.h
status.h
statusor.h
test_helpers.h
test.h
text_literal_reader_test.cc
text_literal_reader.cc
text_literal_reader.h
text_literal_writer_test.cc Use Env::LocalTempFilename for a temp filename. 2020-02-19 17:30:20 -08:00
text_literal_writer.cc
text_literal_writer.h
types.h
util_test.cc [XLA] Make liberal use of inlined vectors to avoid memory allocation inside 2019-10-31 14:24:04 -07:00
util.cc [XLA] Add a utility to transform dimensions from one shape's dimensions to another. 2020-03-31 02:34:20 -07:00
util.h [XLA] Add a utility to transform dimensions from one shape's dimensions to another. 2020-03-31 02:34:20 -07:00
window_util_test.cc Add utiliry fct window_util::MakeWindow(sizes, strides) 2019-10-08 12:00:28 -07:00
window_util.cc Support empty window and 0D convolution. For non-GPUs it's already accidentally supported; for GPUs it's not hard to support anyway. 2020-01-13 14:51:47 -08:00
window_util.h Merge pull request from nouiz:small2 2019-10-14 04:08:20 -07:00
xla_data.proto Introduce new RngBitGenerator HLO 2020-02-06 05:06:44 -08:00
xla.bzl Migrate from xla_proto_library to tf_proto_library_cc. 2019-11-15 01:00:17 -08:00
xla.proto Have XLA CPU emit TraceMe calls by default in JIT mode (roll forward) 2020-04-22 20:21:01 -07:00

XLA (Accelerated Linear Algebra) is a domain-specific compiler for linear algebra that optimizes TensorFlow computations. See the documentation.