STT-tensorflow/tensorflow/stream_executor/cuda
TensorFlower Gardener 0092ebe4c7 Merge pull request #46300 from nluehr:tf32-fixes
PiperOrigin-RevId: 356348412
Change-Id: I7df8f2d191fede578d54db4ec38b7a6a6ae11865
2021-02-08 14:08:55 -08:00
..
BUILD [XLA/GPU] Add parallel compilation for LLVM and ptxas. 2020-12-08 09:22:49 -08:00
cublas_9_0.inc
cublas_10_0.inc
cublas_10_1.inc
cublas_10_2.inc
cublas_11_0.inc
cublas_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cublasLt_11_0.inc Merge pull request #43237 from benbarsdell:cublaslt 2020-10-15 14:39:38 -07:00
cublasLt_stub.cc Add cublasLt wrappers to stream_executor 2020-09-17 15:01:02 +10:00
cuda_9_0.inc
cuda_10_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_10_1.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_10_2.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_11_0.inc
cuda_11_2.inc Add cuda 11.2 driver and runtime inc files 2021-01-15 13:26:30 -06:00
cuda_activation.h
cuda_asm_compiler.cc [XLA/GPU] Add parallel compilation for LLVM and ptxas. 2020-12-08 09:22:49 -08:00
cuda_blas.cc Fix compile bug in cuda_blas.cc in dbg mode. 2020-11-05 22:21:19 -08:00
cuda_blas.h Merge pull request #44175 from benbarsdell:cublaslt-large-batch-workaround-proper 2020-10-20 10:42:41 -07:00
cuda_diagnostics.cc Optimize calls to std::string::find() and friends for a single char. 2020-12-17 18:11:13 -08:00
cuda_diagnostics.h Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_dnn.cc Merge pull request #46300 from nluehr:tf32-fixes 2021-02-08 14:08:55 -08:00
cuda_dnn.h Roll forward the original CL. 2020-09-09 20:55:27 -07:00
cuda_driver_test.cc [SE] Don't assume that the CUDA context has not changed in the outermost ScopedActivationContext. 2020-08-10 15:24:22 -07:00
cuda_driver.cc Use device handle instead of gpu ordinal in GpuVirtualMemAllocator for 2021-01-21 16:11:33 -08:00
cuda_driver.h
cuda_event.cc
cuda_event.h
cuda_fft.cc [StreamExecutor] Workaround the cuFFT bug in CUDA 10.1/10.2/11. 2020-06-30 19:14:59 -07:00
cuda_fft.h [StreamExecutor] Workaround the cuFFT bug in CUDA 10.1/10.2/11. 2020-06-30 19:14:59 -07:00
cuda_gpu_executor.cc Lower priority of missing NUMA /sys file log message. 2021-01-12 11:51:14 -08:00
cuda_gpu_executor.h
cuda_helpers.h PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_kernel.cc PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_kernel.h
cuda_platform_id.cc
cuda_platform_id.h [StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1. 2018-04-17 14:28:51 -07:00
cuda_platform.cc Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_platform.h Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_rng.cc
cuda_rng.h PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_runtime_9_0.inc Add cuda runtime 9.0 API to dlopen wrapper. 2019-04-25 15:20:10 -07:00
cuda_runtime_10_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_runtime_10_1.inc Add cuda runtime stub for 10.1. 2019-05-03 12:10:46 -07:00
cuda_runtime_10_2.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_runtime_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cuda_runtime_11_2.inc Add cuda 11.2 driver and runtime inc files 2021-01-15 13:26:30 -06:00
cuda_stream.h PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_stub.cc Add cuda 11.2 driver and runtime inc files 2021-01-15 13:26:30 -06:00
cuda_timer.h
cudart_stub.cc Add cuda 11.2 driver and runtime inc files 2021-01-15 13:26:30 -06:00
cudnn_6_0.inc
cudnn_7_0.inc
cudnn_7_1.inc
cudnn_7_3.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_4.inc
cudnn_7_6.inc
cudnn_8_0.inc Eager loading of CUDNN sub-libraries. 2020-10-01 19:51:30 -05:00
cudnn_stub.cc Roll PR #39577 (CUDNN v8 support) forward with fix: 2020-06-02 12:54:05 -07:00
cudnn_version_test.cc Clean up cuDNN compatible version check. 2020-07-01 11:26:32 -07:00
cudnn_version.cc Clean up cuDNN compatible version check. 2020-07-01 11:26:32 -07:00
cudnn_version.h [SE] Remove some uses of TF string utils in StreamExecutor. 2019-04-26 22:00:34 -07:00
cufft_9_0.inc
cufft_10_0.inc
cufft_stub.cc
cupti_10_0.inc
cupti_stub.cc fix build due to cupti static link. 2020-05-29 17:43:42 -07:00
curand_10_0.inc
curand_stub.cc
cusolver_dense_9_0.inc
cusolver_dense_10_0.inc
cusolver_dense_10_1.inc
cusolver_dense_10_2.inc
cusolver_dense_11_0.inc
cusolver_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cusparse_9_0.inc
cusparse_10_0.inc Remove kernels' explicit dependency on cusolver and cusparse. 2019-04-16 15:53:13 -07:00
cusparse_10_1.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cusparse_10_2.inc
cusparse_11_0.inc
cusparse_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
memcpy_test.cc
redzone_allocator_test.cc