[MLIR][KernelGen] Fix performance discrepancy for logical_not kernel
Loads and stores are not successfully vectorized. Disable loop unrolling to increase parallelism. PiperOrigin-RevId: 346293461 Change-Id: Ibfc360360eacc66c570ea990580fccfedac8c1de
This commit is contained in:
parent
b0e5853cf0
commit
40cd0a047d
@ -289,7 +289,7 @@ gen_kernel_library(
|
||||
generate_unranked = True,
|
||||
tile_size = "256",
|
||||
types = ["i1"],
|
||||
unroll_factors = "4",
|
||||
unroll_factors = "1",
|
||||
)
|
||||
|
||||
gen_kernel_library(
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user