.. |
BUILD
|
[XLA/GPU] Add parallel compilation for LLVM and ptxas.
|
2020-12-08 09:22:49 -08:00 |
cublas_9_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cublas_10_0.inc
|
|
|
cublas_10_1.inc
|
|
|
cublas_10_2.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cublas_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cublas_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cublasLt_11_0.inc
|
Merge pull request #43237 from benbarsdell:cublaslt
|
2020-10-15 14:39:38 -07:00 |
cublasLt_stub.cc
|
Add cublasLt wrappers to stream_executor
|
2020-09-17 15:01:02 +10:00 |
cuda_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_1.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_2.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cuda_activation.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_asm_compiler.cc
|
[XLA/GPU] Add parallel compilation for LLVM and ptxas.
|
2020-12-08 09:22:49 -08:00 |
cuda_blas.cc
|
Fix compile bug in cuda_blas.cc in dbg mode.
|
2020-11-05 22:21:19 -08:00 |
cuda_blas.h
|
Merge pull request #44175 from benbarsdell:cublaslt-large-batch-workaround-proper
|
2020-10-20 10:42:41 -07:00 |
cuda_diagnostics.cc
|
Optimize calls to std::string::find() and friends for a single char.
|
2020-12-17 18:11:13 -08:00 |
cuda_diagnostics.h
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_dnn.cc
|
Prevent CHECK-fail in LSTM/GRU with zero-length input.
|
2020-12-07 20:37:00 -08:00 |
cuda_dnn.h
|
Roll forward the original CL.
|
2020-09-09 20:55:27 -07:00 |
cuda_driver_test.cc
|
[SE] Don't assume that the CUDA context has not changed in the outermost ScopedActivationContext.
|
2020-08-10 15:24:22 -07:00 |
cuda_driver.cc
|
Add virtual memory management function wrappers to GpuDriver.
|
2020-12-09 23:08:33 -08:00 |
cuda_driver.h
|
|
|
cuda_event.cc
|
|
|
cuda_event.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_fft.cc
|
[StreamExecutor] Workaround the cuFFT bug in CUDA 10.1/10.2/11.
|
2020-06-30 19:14:59 -07:00 |
cuda_fft.h
|
[StreamExecutor] Workaround the cuFFT bug in CUDA 10.1/10.2/11.
|
2020-06-30 19:14:59 -07:00 |
cuda_gpu_executor.cc
|
Remove SharedMemoryConfig since it is not used anywhere.
|
2020-08-11 20:01:04 -07:00 |
cuda_gpu_executor.h
|
|
|
cuda_helpers.h
|
Refactor code that enables deterministic operation of cuDNN
|
2019-12-27 13:08:14 -08:00 |
cuda_kernel.cc
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_kernel.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_platform_id.cc
|
|
|
cuda_platform_id.h
|
|
|
cuda_platform.cc
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_platform.h
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_rng.cc
|
|
|
cuda_rng.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_runtime_9_0.inc
|
Add cuda runtime 9.0 API to dlopen wrapper.
|
2019-04-25 15:20:10 -07:00 |
cuda_runtime_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_runtime_10_1.inc
|
Add cuda runtime stub for 10.1.
|
2019-05-03 12:10:46 -07:00 |
cuda_runtime_10_2.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_runtime_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cuda_stream.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cuda_timer.h
|
|
|
cudart_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cudnn_6_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_1.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_3.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_4.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_6.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_8_0.inc
|
Eager loading of CUDNN sub-libraries.
|
2020-10-01 19:51:30 -05:00 |
cudnn_stub.cc
|
Roll PR #39577 (CUDNN v8 support) forward with fix:
|
2020-06-02 12:54:05 -07:00 |
cudnn_version_test.cc
|
Clean up cuDNN compatible version check.
|
2020-07-01 11:26:32 -07:00 |
cudnn_version.cc
|
Clean up cuDNN compatible version check.
|
2020-07-01 11:26:32 -07:00 |
cudnn_version.h
|
[SE] Remove some uses of TF string utils in StreamExecutor.
|
2019-04-26 22:00:34 -07:00 |
cufft_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cufft_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cufft_stub.cc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cupti_10_0.inc
|
|
|
cupti_stub.cc
|
fix build due to cupti static link.
|
2020-05-29 17:43:42 -07:00 |
curand_10_0.inc
|
|
|
curand_stub.cc
|
|
|
cusolver_dense_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cusolver_dense_10_0.inc
|
Remove kernels' explicit dependency on cusolver and cusparse.
|
2019-04-16 15:53:13 -07:00 |
cusolver_dense_10_1.inc
|
Add 10.1 inc files for cuda libraries.
|
2019-08-22 10:25:49 -07:00 |
cusolver_dense_10_2.inc
|
Remove redundant cusolverDnIRSInfosGetNiters
|
2020-03-23 16:04:05 -07:00 |
cusolver_dense_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cusolver_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cusparse_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cusparse_10_0.inc
|
Remove kernels' explicit dependency on cusolver and cusparse.
|
2019-04-16 15:53:13 -07:00 |
cusparse_10_1.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cusparse_10_2.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cusparse_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cusparse_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
memcpy_test.cc
|
Remove device memory check, since it's incorrect when the pointer is pointing to pinned host memory. Also, memcpy would fail if the pointer is invalid, so we don't need an additional check.
|
2019-11-22 14:06:59 -08:00 |
redzone_allocator_test.cc
|
Extract GpuAsmOpts struct into its own header file.
|
2020-02-07 02:18:09 -08:00 |