STT-tensorflow

History

Peter Hawkins 077b553fda [XLA:Python] Specify a 2MiB stack size for host stream threads. [StreamExecutor] Allow HostExecutor users to control the stack sizes of threads used for HostStream via. Also include non_portable_tags in the keys used when creating an Executor. There seems to be no good reason that it is omitted. Will fix https://github.com/google/jax/issues/432 when included in a jaxlib release. PiperOrigin-RevId: 309472318 Change-Id: Ia2535616047390d6bf6f2da82a666a321dcc9f5d		2020-05-01 14:20:58 -07:00
..
client	Add optional layout constraint to AllToAll	2020-04-27 18:37:21 -07:00
experimental/xla_sharding	Allow user to pass input_shape to split.	2020-02-19 18:04:17 -08:00
g3doc	[XLA] [Docs] Clarify AllReduce semantics when the input is a tuple	2020-04-03 14:02:48 -07:00
pjrt	[XLA:Python] Specify a 2MiB stack size for host stream threads.	2020-05-01 14:20:58 -07:00
python	Split non-Python PJRT classes into their own directory.	2020-05-01 10:01:09 -07:00
python_api	minor spelling tweaks	2019-12-09 18:21:12 +09:00
rpc	Use tf_grpc_dependency in tensorflow/compiler/xla	2020-04-02 22:42:59 -07:00
service	[XLA:Python] Specify a 2MiB stack size for host stream threads.	2020-05-01 14:20:58 -07:00
tests	[XLA:coverage] Disable tests timing out on coverage.	2020-05-01 11:00:19 -07:00
tools	Also include the header _isatty is declared for windows.	2020-02-07 10:38:22 -08:00
.clang-format
array2d_test.cc
array2d.h
array3d_test.cc
array3d.h
array4d_test.cc
array4d.h	Add a default constructor to Array4D	2020-02-07 15:23:01 -08:00
array_test.cc
array.cc	Add specialization of method Array::FillRandom() for complex64 type.	2019-12-13 16:56:06 -08:00
array.h	Add specialization of method Array::FillRandom() for complex64 type.	2019-12-13 16:56:06 -08:00
bit_cast_test.cc
bit_cast.h
BUILD	[XLA] Turn constant dynamic slices into slices.	2020-04-25 15:59:33 -07:00
comparison_util.cc
comparison_util.h
cpu_function_runtime.cc
cpu_function_runtime.h	[XLA:CPU] Reconsider alignment heuristics	2020-04-24 02:51:55 -07:00
debug_options_flags.cc	Have XLA CPU emit TraceMe calls by default in JIT mode (roll forward)	2020-04-22 20:21:01 -07:00
debug_options_flags.h	minor spelling tweaks	2019-12-09 18:21:12 +09:00
debug_options_parsers_test.cc	[ROCm] Fix for compile error in //tensorflow/compiler/xla:debug_options_parsers_test	2020-01-19 02:38:56 +00:00
debug_options_parsers.h	Remove ReducePrecisionInsertion pass.	2019-11-04 02:09:31 -08:00
device_util.h
error_spec.h
executable_run_options.cc	Add run_id to TraceMes.	2020-03-13 10:42:34 -07:00
executable_run_options.h	Add a launch id field in run options and hlo module config.	2020-04-22 16:09:18 -07:00
execution_options_util.cc
execution_options_util.h	minor spelling tweaks	2019-12-09 18:21:12 +09:00
index_util_test.cc
index_util.cc
index_util.h
iterator_util_test.cc
iterator_util.h
layout_test.cc	[XLA] Remove unsupported sparse layout	2020-01-02 18:04:52 -08:00
layout_util_test.cc	[XLA] Remove unsupported sparse layout	2020-01-02 18:04:52 -08:00
layout_util.cc	This CL optimizes C++11 range-based for loops where the variable is copied in each iteration but it would suffice to obtain it by const reference. This is only applied to loop variables of types that are expensive to copy which means they are not trivially copyable or have a non-trivial copy constructor or destructor.	2020-04-06 17:55:23 -07:00
layout_util.h	[XLA] Remove unsupported sparse layout	2020-01-02 18:04:52 -08:00
layout.cc	This CL optimizes C++11 range-based for loops where the variable is copied in each iteration but it would suffice to obtain it by const reference. This is only applied to loop variables of types that are expensive to copy which means they are not trivially copyable or have a non-trivial copy constructor or destructor.	2020-04-06 17:55:23 -07:00
layout.h	[XLA] Remove unsupported sparse layout	2020-01-02 18:04:52 -08:00
literal_comparison.cc	minor spelling tweaks	2019-12-09 18:21:12 +09:00
literal_comparison.h
literal_test.cc	Avoid dividing by zero in xla/literal_test.	2020-02-10 11:29:21 -08:00
literal_util.cc	Introduce a few new literal conversion functions for various floating point types	2020-01-27 07:47:30 -08:00
literal_util.h	Introduce a few new literal conversion functions for various floating point types	2020-01-27 07:47:30 -08:00
literal.cc	[XLA] Turn constant dynamic slices into slices.	2020-04-25 15:59:33 -07:00
literal.h	[XLA] Turn constant dynamic slices into slices.	2020-04-25 15:59:33 -07:00
map_util.h
metric_table_report.cc
metric_table_report.h
overflow_util.h
packed_literal_reader.cc
packed_literal_reader.h
parse_flags_from_env_test.cc	Handle the cases where stdout may have CRLF suffix on windows in xla parse_flags_from_env_test	2020-03-12 14:33:13 -07:00
parse_flags_from_env.cc	[XLA] Use LOG(QFATAL) rather than LOG(FATAL) in XLA_FLAGS argument parsing.	2019-11-27 10:27:55 -08:00
parse_flags_from_env.h	minor spelling tweaks	2019-12-09 18:21:12 +09:00
primitive_util_test.cc
primitive_util.cc
primitive_util.h
protobuf_util.cc
protobuf_util.h
README.md
refcounting_hash_map_test.cc	[XLA:GPU] Add experimental, lightly tested support for multi-host and multi-process NCCL AllReduce.	2020-02-21 14:00:29 -08:00
refcounting_hash_map.h	[XLA:GPU] Add experimental, lightly tested support for multi-host and multi-process NCCL AllReduce.	2020-02-21 14:00:29 -08:00
reference_util_test.cc
reference_util.cc	[XLA] Do extensive testing of int32 matmuls	2019-09-16 08:52:24 -07:00
reference_util.h	[XLA] Do extensive testing of int32 matmuls	2019-09-16 08:52:24 -07:00
service_interface.h
shape_layout.cc	[XLA] Implement dynamic input and output in DynamicPadder.	2019-12-12 10:31:20 -08:00
shape_layout.h
shape_test.cc	minor spelling tweaks	2019-12-09 18:21:12 +09:00
shape_tree_test.cc
shape_tree.h
shape_util_test.cc	[XLA] Add a few helper functions around dynamic dimensions.	2019-11-25 12:52:48 -08:00
shape_util.cc	Fix 64-bit integer portability problems in TensorFlow compiler.	2020-01-16 13:16:05 -08:00
shape_util.h	[XLA] Remove unsupported sparse layout	2020-01-02 18:04:52 -08:00
shape.cc	[xla::Shape::Shape] Emplace tuple shapes into `tuple_shapes_`.	2020-03-03 15:06:38 -08:00
shape.h	[XLA]Add HLO infrastructure to support dynamic op lowering.	2020-04-30 17:00:49 -07:00
status_macros_test.cc	minor spelling tweaks	2019-12-09 18:21:12 +09:00
status_macros.cc
status_macros.h
status.h
statusor.h
test_helpers.h
test.h
text_literal_reader_test.cc
text_literal_reader.cc
text_literal_reader.h
text_literal_writer_test.cc	Use Env::LocalTempFilename for a temp filename.	2020-02-19 17:30:20 -08:00
text_literal_writer.cc
text_literal_writer.h
types.h
util_test.cc	[XLA] Make liberal use of inlined vectors to avoid memory allocation inside	2019-10-31 14:24:04 -07:00
util.cc	[XLA] Add a utility to transform dimensions from one shape's dimensions to another.	2020-03-31 02:34:20 -07:00
util.h	[XLA] Add a utility to transform dimensions from one shape's dimensions to another.	2020-03-31 02:34:20 -07:00
window_util_test.cc	Add utiliry fct window_util::MakeWindow(sizes, strides)	2019-10-08 12:00:28 -07:00
window_util.cc	Support empty window and 0D convolution. For non-GPUs it's already accidentally supported; for GPUs it's not hard to support anyway.	2020-01-13 14:51:47 -08:00
window_util.h	Merge pull request #33257 from nouiz:small2	2019-10-14 04:08:20 -07:00
xla_data.proto	Introduce new RngBitGenerator HLO	2020-02-06 05:06:44 -08:00
xla.bzl	Migrate from xla_proto_library to tf_proto_library_cc.	2019-11-15 01:00:17 -08:00
xla.proto	Have XLA CPU emit TraceMe calls by default in JIT mode (roll forward)	2020-04-22 20:21:01 -07:00

README.md

XLA (Accelerated Linear Algebra) is a domain-specific compiler for linear algebra that optimizes TensorFlow computations. See the documentation.