.. |
BUILD
|
Plumb TF32 for cublas gemm
|
2020-06-19 15:12:30 -07:00 |
cublas_9_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cublas_10_0.inc
|
Run clang-format on cuda 10.0 inc files, so we can see better diffs for future
|
2019-08-21 17:51:09 -07:00 |
cublas_10_1.inc
|
Add 10.1 inc files for cuda libraries.
|
2019-08-22 10:25:49 -07:00 |
cublas_10_2.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cublas_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cublas_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cuda_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_1.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_10_2.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cuda_activation.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_blas.cc
|
Merge pull request #40624 from nluehr:TF32-v2
|
2020-06-22 17:23:41 -07:00 |
cuda_blas.h
|
Merge pull request #40624 from nluehr:TF32-v2
|
2020-06-22 17:23:41 -07:00 |
cuda_diagnostics.cc
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_diagnostics.h
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_dnn.cc
|
Merge pull request #40624 from nluehr:TF32-v2
|
2020-06-22 17:23:41 -07:00 |
cuda_dnn.h
|
PR #37679: Support two CUDNN CTC Loss algorithms
|
2020-04-10 14:15:07 -07:00 |
cuda_driver.cc
|
Merge pull request #39956 from bhack:patch-2
|
2020-06-04 23:22:55 -07:00 |
cuda_driver.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_event.cc
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_event.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_fft.cc
|
[SE] Remove Stream* argument from ScratchAllocator methods
|
2019-08-05 11:45:28 -07:00 |
cuda_fft.h
|
Copying cuBLAS and cuDNN headers into separate directories.
|
2019-05-11 07:25:42 -07:00 |
cuda_gpu_executor.cc
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_gpu_executor.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_helpers.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_kernel.cc
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_kernel.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_platform_id.cc
|
[StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1.
|
2018-04-17 14:28:51 -07:00 |
cuda_platform_id.h
|
[StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1.
|
2018-04-17 14:28:51 -07:00 |
cuda_platform.cc
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_platform.h
|
Qualify uses of std::string
|
2020-03-20 13:19:21 -07:00 |
cuda_rng.cc
|
Copying cuBLAS and cuDNN headers into separate directories.
|
2019-05-11 07:25:42 -07:00 |
cuda_rng.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_runtime_9_0.inc
|
Add cuda runtime 9.0 API to dlopen wrapper.
|
2019-04-25 15:20:10 -07:00 |
cuda_runtime_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_runtime_10_1.inc
|
Add cuda runtime stub for 10.1.
|
2019-05-03 12:10:46 -07:00 |
cuda_runtime_10_2.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cuda_runtime_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cuda_stream.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cuda_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cuda_timer.h
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cudart_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cudnn_6_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_0.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_1.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_3.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_4.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_7_6.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cudnn_8_0.inc
|
Remove cudnn RNN algo APIs from cudnn 8 inc file
|
2020-06-09 11:41:59 -05:00 |
cudnn_stub.cc
|
Roll PR #39577 (CUDNN v8 support) forward with fix:
|
2020-06-02 12:54:05 -07:00 |
cudnn_version_test.cc
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cudnn_version.cc
|
PR #25011: [GPU][ROCm][CUDA] StreamExecutor logic for ROCm / CUDA platform (PR 20709 / 22669 / 24156 continued)
|
2019-01-28 11:30:18 -08:00 |
cudnn_version.h
|
[SE] Remove some uses of TF string utils in StreamExecutor.
|
2019-04-26 22:00:34 -07:00 |
cufft_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cufft_10_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cufft_stub.cc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cupti_10_0.inc
|
Run clang-format on cuda 10.0 inc files, so we can see better diffs for future
|
2019-08-21 17:51:09 -07:00 |
cupti_stub.cc
|
fix build due to cupti static link.
|
2020-05-29 17:43:42 -07:00 |
curand_10_0.inc
|
Run clang-format on cuda 10.0 inc files, so we can see better diffs for future
|
2019-08-21 17:51:09 -07:00 |
curand_stub.cc
|
Copying cuBLAS and cuDNN headers into separate directories.
|
2019-05-11 07:25:42 -07:00 |
cusolver_dense_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cusolver_dense_10_0.inc
|
Remove kernels' explicit dependency on cusolver and cusparse.
|
2019-04-16 15:53:13 -07:00 |
cusolver_dense_10_1.inc
|
Add 10.1 inc files for cuda libraries.
|
2019-08-22 10:25:49 -07:00 |
cusolver_dense_10_2.inc
|
Remove redundant cusolverDnIRSInfosGetNiters
|
2020-03-23 16:04:05 -07:00 |
cusolver_dense_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cusolver_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
cusparse_9_0.inc
|
Regenerated wrapper includes for all CUDA versions & libraries.
|
2020-03-19 13:30:00 -07:00 |
cusparse_10_0.inc
|
Remove kernels' explicit dependency on cusolver and cusparse.
|
2019-04-16 15:53:13 -07:00 |
cusparse_10_1.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cusparse_10_2.inc
|
Format generated CUDA stub files.
|
2020-05-26 23:10:31 -07:00 |
cusparse_11_0.inc
|
Add CUDA 11 stub files.
|
2020-05-26 23:19:19 -07:00 |
cusparse_stub.cc
|
Relax stub include version checking.
|
2020-06-17 07:48:07 -07:00 |
memcpy_test.cc
|
Remove device memory check, since it's incorrect when the pointer is pointing to pinned host memory. Also, memcpy would fail if the pointer is invalid, so we don't need an additional check.
|
2019-11-22 14:06:59 -08:00 |
redzone_allocator_test.cc
|
Extract GpuAsmOpts struct into its own header file.
|
2020-02-07 02:18:09 -08:00 |