STT-tensorflow/tensorflow/stream_executor/cuda
TensorFlower Gardener 7c38468051 Merge pull request from nluehr:TF32-v2
PiperOrigin-RevId: 317757557
Change-Id: I0a0f0cc9025db7d1fbc7975b07d7d934c6fa8c2f
2020-06-22 17:23:41 -07:00
..
BUILD Plumb TF32 for cublas gemm 2020-06-19 15:12:30 -07:00
cublas_9_0.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cublas_10_0.inc Run clang-format on cuda 10.0 inc files, so we can see better diffs for future 2019-08-21 17:51:09 -07:00
cublas_10_1.inc Add 10.1 inc files for cuda libraries. 2019-08-22 10:25:49 -07:00
cublas_10_2.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cublas_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cublas_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cuda_9_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_10_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_10_1.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_10_2.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cuda_activation.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_blas.cc Merge pull request from nluehr:TF32-v2 2020-06-22 17:23:41 -07:00
cuda_blas.h Merge pull request from nluehr:TF32-v2 2020-06-22 17:23:41 -07:00
cuda_diagnostics.cc Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_diagnostics.h Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_dnn.cc Merge pull request from nluehr:TF32-v2 2020-06-22 17:23:41 -07:00
cuda_dnn.h PR : Support two CUDNN CTC Loss algorithms 2020-04-10 14:15:07 -07:00
cuda_driver.cc Merge pull request from bhack:patch-2 2020-06-04 23:22:55 -07:00
cuda_driver.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_event.cc PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_event.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_fft.cc [SE] Remove Stream* argument from ScratchAllocator methods 2019-08-05 11:45:28 -07:00
cuda_fft.h Copying cuBLAS and cuDNN headers into separate directories. 2019-05-11 07:25:42 -07:00
cuda_gpu_executor.cc Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_gpu_executor.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_helpers.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_kernel.cc PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_kernel.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_platform_id.cc [StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1. 2018-04-17 14:28:51 -07:00
cuda_platform_id.h [StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1. 2018-04-17 14:28:51 -07:00
cuda_platform.cc Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_platform.h Qualify uses of std::string 2020-03-20 13:19:21 -07:00
cuda_rng.cc Copying cuBLAS and cuDNN headers into separate directories. 2019-05-11 07:25:42 -07:00
cuda_rng.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_runtime_9_0.inc Add cuda runtime 9.0 API to dlopen wrapper. 2019-04-25 15:20:10 -07:00
cuda_runtime_10_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_runtime_10_1.inc Add cuda runtime stub for 10.1. 2019-05-03 12:10:46 -07:00
cuda_runtime_10_2.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cuda_runtime_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cuda_stream.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cuda_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cuda_timer.h PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cudart_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cudnn_6_0.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_0.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_1.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_3.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_4.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_7_6.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cudnn_8_0.inc Remove cudnn RNN algo APIs from cudnn 8 inc file 2020-06-09 11:41:59 -05:00
cudnn_stub.cc Roll PR (CUDNN v8 support) forward with fix: 2020-06-02 12:54:05 -07:00
cudnn_version_test.cc PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cudnn_version.cc PR : [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued) 2019-01-28 11:30:18 -08:00
cudnn_version.h [SE] Remove some uses of TF string utils in StreamExecutor. 2019-04-26 22:00:34 -07:00
cufft_9_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cufft_10_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cufft_stub.cc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cupti_10_0.inc Run clang-format on cuda 10.0 inc files, so we can see better diffs for future 2019-08-21 17:51:09 -07:00
cupti_stub.cc fix build due to cupti static link. 2020-05-29 17:43:42 -07:00
curand_10_0.inc Run clang-format on cuda 10.0 inc files, so we can see better diffs for future 2019-08-21 17:51:09 -07:00
curand_stub.cc Copying cuBLAS and cuDNN headers into separate directories. 2019-05-11 07:25:42 -07:00
cusolver_dense_9_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cusolver_dense_10_0.inc Remove kernels' explicit dependency on cusolver and cusparse. 2019-04-16 15:53:13 -07:00
cusolver_dense_10_1.inc Add 10.1 inc files for cuda libraries. 2019-08-22 10:25:49 -07:00
cusolver_dense_10_2.inc Remove redundant cusolverDnIRSInfosGetNiters 2020-03-23 16:04:05 -07:00
cusolver_dense_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cusolver_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
cusparse_9_0.inc Regenerated wrapper includes for all CUDA versions & libraries. 2020-03-19 13:30:00 -07:00
cusparse_10_0.inc Remove kernels' explicit dependency on cusolver and cusparse. 2019-04-16 15:53:13 -07:00
cusparse_10_1.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cusparse_10_2.inc Format generated CUDA stub files. 2020-05-26 23:10:31 -07:00
cusparse_11_0.inc Add CUDA 11 stub files. 2020-05-26 23:19:19 -07:00
cusparse_stub.cc Relax stub include version checking. 2020-06-17 07:48:07 -07:00
memcpy_test.cc Remove device memory check, since it's incorrect when the pointer is pointing to pinned host memory. Also, memcpy would fail if the pointer is invalid, so we don't need an additional check. 2019-11-22 14:06:59 -08:00
redzone_allocator_test.cc Extract GpuAsmOpts struct into its own header file. 2020-02-07 02:18:09 -08:00