Frank Chen
23e387975e
Rename tpu_load_library to tpu_api_dlsym_initializer.
...
This better reflects what the load library module does.
PiperOrigin-RevId: 317763973
Change-Id: I66169394c2f1d77b260323cb81d76f0f4e167579
2020-06-22 17:45:58 -07:00
TensorFlower Gardener
7c38468051
Merge pull request #40624 from nluehr:TF32-v2
...
PiperOrigin-RevId: 317757557
Change-Id: I0a0f0cc9025db7d1fbc7975b07d7d934c6fa8c2f
2020-06-22 17:23:41 -07:00
Nathan Luehr
3f2c98610e
Move ScopedCublasMathMode inside DoBlasInternalImpl
2020-06-22 10:53:03 -07:00
Nathan Luehr
9afaf559d9
Locate CUDNN_FMA_MATH inside CUDNN 8 macros
2020-06-22 12:28:02 -05:00
Wenhao Jia
715b02167d
Restore TpuPlatform auto registration code.
...
PiperOrigin-RevId: 317409587
Change-Id: If44d7a39a45c4c7026f70a4d79d965a54c4db295
2020-06-19 17:31:38 -07:00
Nathan Luehr
54a7bd1d73
Remove unnecessary tf32_execution_allowed check
2020-06-19 18:02:40 -05:00
Nathan Luehr
434f49d54f
Rework gemm TF32
2020-06-19 15:12:30 -07:00
Nathan Luehr
6ff37474dd
Use CUDNN_TENSOR_OP_MATH to enable tensor cores.
2020-06-19 15:12:30 -07:00
Nathan Luehr
03d836e261
Plumb TF32 for cublas gemm
2020-06-19 15:12:30 -07:00
Nathan Luehr
107e634823
Plumb TF32 for RNN
2020-06-19 15:12:30 -07:00
Nathan Luehr
376efd71b1
Convolution TF32 Plumbing
2020-06-19 15:12:30 -07:00
Nathan Luehr
32d63d0a3e
Removed TENSOR_OP disable env vars.
...
* TF_DISABLE_CUBLAS_TENSOR_OP_MATH
* TF_DISABLE_CUDNN_TENSOR_OP_MATH
* TF_DISABLE_CUDNN_RNN_TENSOR_OP_MATH
2020-06-19 15:12:30 -07:00
A. Unique TensorFlower
9c4b749b09
Internal change
...
PiperOrigin-RevId: 317230321
Change-Id: I043dae37768f6e9cf946d4db2a8c36123ed2d6d9
2020-06-18 19:24:46 -07:00
TensorFlower Gardener
18ca7c95d4
Merge pull request #39764 from nluehr:TF32
...
PiperOrigin-RevId: 317221015
Change-Id: I228bfa51b797fe52112012932a1dba9a356151ba
2020-06-18 18:09:37 -07:00
A. Unique TensorFlower
aab151356d
[pjrt] Refresh stream error status in strategic places to flush out silent failures.
...
PiperOrigin-RevId: 317204018
Change-Id: If75a3ad9ec846ce1621cdba92a2dc738b65b7001
2020-06-18 16:28:27 -07:00
Frank Chen
cf00e559d7
Add TPU configuration ops to default TensorFlow build
...
PiperOrigin-RevId: 317133514
Change-Id: I33bc6d7fdbba5915bd0d1291d4e086139c07eb14
2020-06-18 10:52:49 -07:00
Nathan Luehr
3ebc53c394
Relax stub include version checking.
...
Remove upper bound on version check for latest inc files.
2020-06-17 07:48:07 -07:00
A. Unique TensorFlower
267f956246
load libcupti.so shared object if libcupti.so.10.1 is not installed. This makes it possible for tensorflow to load CUPTI 10.2 via libcupti.so symlink in the event that libcupti.so.10.1 is missing.
...
PiperOrigin-RevId: 316782424
Change-Id: I930912f1d3a8fa80e8b91c41214c374650f08847
2020-06-16 16:49:55 -07:00
Christian Sigg
09cf51da59
Fix switch statement for CUDNN_CONVOLUTION_BWD_FILTER_ALGO_FFT_TILING.
...
PiperOrigin-RevId: 316522188
Change-Id: I6557506f90d4bf6a91e97b65abd494d58769e3a3
2020-06-15 12:43:25 -07:00
TensorFlower Gardener
4f07e60caf
Merge pull request #40335 from nluehr:cudnn8_algo_cleanup
...
PiperOrigin-RevId: 316162945
Change-Id: I72bc02421cbb545f68049a5f36c0a6aea7dbba1a
2020-06-12 13:21:32 -07:00
Wenhao Jia
d7d97a0d72
Internal change only.
...
PiperOrigin-RevId: 316058358
Change-Id: I2317fe006b40b9f930e6610ea7c059d15662dcb1
2020-06-12 00:14:31 -07:00
Nathan Luehr
013ddd96d0
Use CUDNN_TENSOR_OP_MATH to enable tensor cores.
2020-06-11 16:41:28 -05:00
TensorFlower Gardener
f60069b7bb
Merge pull request #39734 from nouiz:vlog_ptxas
...
PiperOrigin-RevId: 315938190
Change-Id: Ib5ac12f94e3eaac65e97bdfd650af95bedf3f326
2020-06-11 11:26:47 -07:00
Henry Tan
f418a84299
Reorganizing TPU OPS C_API related files.
...
PiperOrigin-RevId: 315834750
Change-Id: I501ac67f75130e6a1114b75db5ea262e8ef3890c
2020-06-10 22:22:48 -07:00
Henry Tan
c055650aba
TPU internal change.
...
PiperOrigin-RevId: 315644771
Change-Id: I43705573dd797add30f776ec5903c1f98c614061
2020-06-10 00:48:07 -07:00
Nathan Luehr
9040800a7b
Remove cudnn RNN algo APIs from cudnn 8 inc file
2020-06-09 11:41:59 -05:00
Frank Chen
13c09da422
Open sourcing some TPU-related work
...
PiperOrigin-RevId: 315431095
Change-Id: I734632c0e5723dfca37acf53bbbd2b378b04c95d
2020-06-09 00:03:31 -07:00
Frederic Bastien
3946ad2bfd
Print the ptxas command line as higher VLOG level.
2020-06-08 13:57:12 -07:00
Frederic Bastien
ce806400a4
Print the verbose output of PTXAS.
2020-06-08 13:42:43 -07:00
TensorFlower Gardener
2b623518ef
Merge pull request #40173 from ROCmSoftwarePlatform:google_upstream_rocm_csb_fix_200604_2
...
PiperOrigin-RevId: 315326861
Change-Id: Idd32f299d5ad4b6cb89141b8888bdabb038cbea9
2020-06-08 12:46:16 -07:00
TensorFlower Gardener
e63137a53c
Merge pull request #39956 from bhack:patch-2
...
PiperOrigin-RevId: 314873638
Change-Id: I9fa2b55613c78eb64dec1a40a1f157374498a929
2020-06-04 23:22:55 -07:00
Deven Desai
856f899e4e
[ROCm] Fix for the ROCm CSB breakage - 200604
...
This PR addresses 2 unit test failures introduced by the following commit
bd20260350
failing unit tests
```
//tensorflow/core/common_runtime/gpu:gpu_device_test
//tensorflow/core/common_runtime/gpu:gpu_device_unified_memory_test
```
It is different from PR #40164 , which fixes the build error on the ROCm platform.
The commit above adds unit tests that check the assignment of priority values to GPU streams. Because ROCm support for assigning priority values to GPU streams is missing, those unit tests fail.
This PR/commit adds the missing ROCm support, and updates the unit test to work with AMD GPUs too. The valid priority value range seems to be different for AMD GPUs (0,2) as compared to NVidia GPUs (-1, 0), and hence the changes requrired in the testcases too.
2020-06-05 00:36:29 +00:00
A. Unique TensorFlower
e9ad27a920
Add member typedef value_type
to StatusOr
to be conformant with the
...
standard, e.g. `std::optional`, `std::expected`.
PiperOrigin-RevId: 314649934
Change-Id: Id9e9c70a486254db4221af550f0b3c77440b0fb6
2020-06-03 19:25:02 -07:00
Nathan Luehr
b67608e66c
Plumb TF32 for cublas gemm
2020-06-03 15:47:48 -05:00
Nathan Luehr
0f58bb6309
Plumb TF32 for RNN
2020-06-03 15:47:47 -05:00
Nathan Luehr
dedb51aec2
Convolution TF32 Plumbing
2020-06-03 15:47:47 -05:00
Nathan Luehr
ab6b2ffde3
Removed TENSOR_OP disable env vars.
...
* TF_DISABLE_CUBLAS_TENSOR_OP_MATH
* TF_DISABLE_CUDNN_TENSOR_OP_MATH
* TF_DISABLE_CUDNN_RNN_TENSOR_OP_MATH
2020-06-03 15:47:10 -05:00
Sreeni Kesavarapu
bd20260350
Implement GPU device priority for virtual devices.
...
- This adds a field in virtual device options to specify priority for virtual devices.
- When the priority is specified, it will be used to create the cuda streams for the virtual device with the given priority.
- This is backwards compatible with no priorities specified. When no priorities specified, the current implementation of creating a stream without any priority will continue while any non-zero priorities specified will be used to create streams with that priority.
PiperOrigin-RevId: 314470276
Change-Id: I4943f71e901245fb21b6f7e833adbdcd8126f1fa
2020-06-02 22:31:35 -07:00
Christian Sigg
255f590ab6
Roll PR #39577 (CUDNN v8 support) forward with fix:
...
Handle '64_' prefix in the cuDNN version for Windows.
PiperOrigin-RevId: 314385732
Change-Id: I9b36497dbe460e2e7cad3d755cf9ce8d41647ce3
2020-06-02 12:54:05 -07:00
Zhenyu Tan
5cf536f78d
Rollback PR #39577 : CUDNN v8 support
...
PiperOrigin-RevId: 314219496
Change-Id: I3e36453044e5b73274e05d84f8d7f2cdb5b144ae
2020-06-01 15:57:18 -07:00
TensorFlower Gardener
8b4dd17f2b
Merge pull request #39577 from nluehr:CUDNN8
...
PiperOrigin-RevId: 313914508
Change-Id: Ib186d57303d220a49a0b27d8be591476e69dcfd2
2020-05-30 01:18:08 -07:00
A. Unique TensorFlower
396105ddd5
fix build due to cupti static link.
...
PiperOrigin-RevId: 313880914
Change-Id: I4cf691e60fd28e773f124e440ebb30f054d5f2e9
2020-05-29 17:43:42 -07:00
A. Unique TensorFlower
8c63be3940
Internal change
...
PiperOrigin-RevId: 313850352
Change-Id: I89584b0bcb4409eb74d21e31fb0eb68844186707
2020-05-29 14:31:09 -07:00
A. Unique TensorFlower
13645c7fd2
use static linked cupti internally.
...
PiperOrigin-RevId: 313822331
Change-Id: Ie2c1e4a2d92ed9f92e2dcd8d3bc354546ab4663f
2020-05-29 13:47:38 -07:00
Sreeni Kesavarapu
27ac0a900e
Implement GPU device priority for virtual devices.
...
- This adds a field in virtual device options to specify priority for virtual devices.
- When the priority is specified, it will be used to create the cuda streams for the virtual device with the given priority.
- This is backwards compatible with no priorities specified. When no priorities specified, the current implementation of creating a stream without any priority will continue while any non-zero priorities specified will be used to create streams with that priority.
PiperOrigin-RevId: 313814564
Change-Id: Ie580ea3f8c7fc7248e40e1ae4c898f0a029fba35
2020-05-29 13:15:24 -07:00
TensorFlower Gardener
4a9c6447f8
Merge pull request #39576 from nluehr:CUDA11
...
PiperOrigin-RevId: 313758239
Change-Id: I962e4d81b91f2a1249a3749adb4579440059d6dc
2020-05-29 11:57:26 -07:00
bhack
d38caa0466
add new line
2020-05-28 19:48:52 +02:00
bhack
0d687877e4
Change error level to warning
2020-05-28 19:00:57 +02:00
Christian Sigg
7b495df2da
Add CUDA 11 stub files.
...
PiperOrigin-RevId: 313335318
Change-Id: I6e7717b580540fa2b039057702115fd25c3c54ad
2020-05-26 23:19:19 -07:00
Christian Sigg
65264bfc98
Format generated CUDA stub files.
...
PiperOrigin-RevId: 313334624
Change-Id: Ie7fc91e0c59754b9d9a7db686f3876577c8164e7
2020-05-26 23:10:31 -07:00