Add documentation for using select TensorFlow ops in TensorFlow Lite

This experimental feature allows the use of certain TensorFlow ops from
within the TensorFlow Lite runtime. Using these ops requires opting in to
TF op usage during model conversion, as well as adding an additional
dependency to the client's target.

See `lite/g3doc/using_select_tf_ops.md` for more details. Note that this
feature is under active development and is still in the experimental
stage.

BUG=113614898
PiperOrigin-RevId: 221836702
This commit is contained in:
Jared Duke 2018-11-16 12:43:46 -08:00 committed by TensorFlower Gardener
parent e0cf714702
commit 704961fe72

View File

@ -0,0 +1,249 @@
# [Experimental] Using TensorFlow Lite with select TensorFlow ops
The TensorFlow Lite builtin op library has grown rapidly, and will continue to
grow, but there remains a long tail of TensorFlow ops that are not yet natively
supported by TensorFlow Lite . These unsupported ops can be a point of friction
in the TensorFlow Lite model conversion process. To that end, the team has
recently been working on an experimental mechanism for reducing this friction.
This document outlines how to use TensorFlow Lite with select TensorFlow ops.
*Note that this feature is experimental and is under active development.* As you
use this feature, keep in mind the [known limitations](#known-limitations), and
please send feedback about models that work and issues you are facing to
tflite@tensorflow.org.
TensorFlow Lite will continue to have
[TensorFlow Lite builtin ops](tf_ops_compatibility.md) optimized for mobile and
embedded devices. However, TensorFlow Lite models can now use a subset of
TensorFlow ops when TFLite builtin ops are not sufficient.
Models converted with TensorFlow ops will require a TensorFlow Lite interpreter
that has a larger binary size than the interpreter with only TFLite builtin ops.
Additionally, performance optimizations will not be available for any TensorFlow
ops in the TensorFlow Lite model.
This document outlines how to [convert](#converting-the-model) and
[run](#running-the-model) a TFLite model with TensorFlow ops on your platform of
choice. It also discusses some [known limitations](#known-limitations), the
[future plans](#future-plans) for this feature, and basic
[performance and size metrics](#metrics).
## Converting the model
To convert a TensorFlow model to a TensorFlow Lite model with TensorFlow ops,
use the `target_ops` argument in the
[TensorFlow Lite converter](https://www.tensorflow.org/lite/convert/). The
following values are valid options for `target_ops`:
* `TFLITE_BUILTINS` - Converts models using TensorFlow Lite builtin ops.
* `SELECT_TF_OPS` - Converts models using TensorFlow ops. The exact subset of
supported ops can be found in the whitelist at
`lite/toco/tflite/whitelisted_flex_ops.cc`.
The recommended approach is to convert the model with `TFLITE_BUILTINS`, then
with both `TFLITE_BUILTINS,SELECT_TF_OPS`, and finally with only
`SELECT_TF_OPS`. Using both options (i.e. `TFLITE_BUILTINS,SELECT_TF_OPS`)
creates models with TensorFlow Lite ops where possible. Using only
`SELECT_TF_OPS` is useful when the model contains TensorFlow ops that are only
partially supported by TensorFlow Lite, and one would like to avoid those
limitations.
The following example shows how to use `target_ops` in the
[`TFLiteConverter`](https://www.tensorflow.org/lite/convert/python_api) Python
API.
```
import tensorflow as tf
converter = tf.lite.TFLiteConverter.from_saved_model(saved_model_dir)
converter.target_ops = [tf.lite.OpsSet.TFLITE_BUILTINS,
tf.lite.OpsSet.SELECT_TF_OPS]
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)
```
The following example shows how to use `target_ops` in the
[`tflite_convert`](https://www.tensorflow.org/lite/convert/cmdline_examples)
command line tool.
```
tflite_convert \
--output_file=/tmp/foo.tflite \
--graph_def_file=/tmp/foo.pb \
--input_arrays=input \
--output_arrays=MobilenetV1/Predictions/Reshape_1 \
--target_ops=TFLITE_BUILTINS,SELECT_TF_OPS
```
When building and running `tflite_convert` directly with `bazel`, please pass
`--define=with_select_tf_ops=true` as an additional argument.
```
bazel run --define=with_select_tf_ops=true tflite_convert -- \
--output_file=/tmp/foo.tflite \
--graph_def_file=/tmp/foo.pb \
--input_arrays=input \
--output_arrays=MobilenetV1/Predictions/Reshape_1 \
--target_ops=TFLITE_BUILTINS,SELECT_TF_OPS
```
## Running the model
When using a TensorFlow Lite model that has been converted with support for
select TensorFlow ops, the client must also use a TensorFlow Lite runtime that
includes the necessary library of TensorFlow ops.
### Android AAR
A new Android AAR target with select TensorFlow ops has been added for
convenience. Assuming a <a href="./demo_android.md">working TensorFlow Lite
build environment</a>, build the Android AAR with select TensorFlow ops as
follows:
```sh
bazel build --cxxopt='--std=c++11' -c opt \
--config=android_arm --config=monolithic \
//tensorflow/lite/java:tensorflow-lite-with-select-tf-ops
```
This will generate an AAR file in `bazel-genfiles/tensorflow/lite/java/`. From
there, you can either import the AAR directly into your project, or publish the
custom AAR to your local Maven repository:
```sh
mvn install:install-file \
-Dfile=bazel-genfiles/tensorflow/lite/java/tensorflow-lite-with-select-tf-ops.aar \
-DgroupId=org.tensorflow \
-DartifactId=tensorflow-lite-with-select-tf-ops -Dversion=0.1.100 -Dpackaging=aar
```
Finally, in your app's `build.gradle`, ensure you have the `mavenLocal()`
dependency and replace the standard TensorFlow Lite dependency with the one that
has support for select TensorFlow ops:
```
allprojects {
repositories {
jcenter()
mavenLocal()
}
}
dependencies {
compile 'org.tensorflow:tensorflow-lite-with-select-tf-ops:0.1.100'
}
```
### iOS
With XCode Command Line Tools installed, TensorFlow Lite with select TensorFlow
ops support can be built with the following command:
```sh
tensorflow/contrib/makefile/build_all_ios_with_tflite.sh
```
This will generate the required static linking libraries in the
`tensorflow/contrib/makefile/gen/lib/` directory.
The TensorFlow Lite camera example app can be used to test this. A new
TensorFlow Lite XCode project with support for select TensorFlow ops has been
added to
`tensorflow/lite/examples/ios/camera/tflite_camera_example_with_select_tf_ops.xcodeproj`.
To use this feature in a your own project, either clone the example project or
set the project settings for a new or existing project to the following:
* In Build Phases -> Link Binary With Libraries, add the static libraries
under `tensorflow/contrib/makefile/gen/lib/` directory:
* `libtensorflow-lite.a`
* `libprotobuf.a`
* `nsync.a`
* In Build Settings -> Header Search Paths, add the following directories:
* `tensorflow/lite/`
* `tensorflow/contrib/makefile/downloads/flatbuffer/include`
* `tensorflow/contrib/makefile/downloads/eigen`
* In Build Settings -> Other Linker Flags, add `-force_load
tensorflow/contrib/makefile/gen/lib/libtensorflow-lite.a`.
A CocoaPod with support for select TensorFlow ops will also be released in the
future.
### C++
When building TensorFlow Lite libraries using the bazel pipeline, the additional
TensorFlow ops library can be included and enabled as follows:
* Enable monolithic builds if necessary by adding the `--config=monolithic`
build flag.
* Do one of the following:
* Include the `--define=with_select_tf_ops=true` build flag in the `bazel
build` invocation when building TensorFlow Lite.
* Add the TensorFlow ops delegate library dependency to the build
dependencies: `tensorflow/lite/delegates/flex:delegate`.
Note that the necessary `TfLiteDelegate` will be installed automatically when
creating the interpreter at runtime as long as the delegate is linked into the
client library. It is not necessary to explicitly install the delegate instance
as is typically required with other delegate types.
### Python pip Package
Python support is actively under development.
## Metrics
### Performance
When using a mixture of both builtin and select TensorFlow ops, all of the same
TensorFlow Lite optimizations and optimized builtin kernels will be be available
and usable with the converted model. For the TensorFlow ops, performance should
generally be comparable to that of
[TensorFlow Mobile](https://www.tensorflow.org/lite/tfmobile/).
The following table describes the average time taken to run inference on
MobileNet on a Pixel 2. The listed times are an average of 100 runs. These
targets were built for Android using the flags: `--config=android_arm64 -c opt`.
Build | Time (milliseconds)
------------------------------------ | -------------------
Only built-in ops (`TFLITE_BUILTIN`) | 260.7
Using only TF ops (`SELECT_TF_OPS`) | 264.5
### Binary Size
The following table describes the binary size of TensorFlow Lite for each build.
These targets were built for Android using `--config=android_arm -c opt`.
Build | C++ Binary Size | Android APK Size
--------------------- | --------------- | ----------------
Only built-in ops | 796 KB | 561 KB
Built-in ops + TF ops | 23.0 MB | 8.0 MB
## Known Limitations
The following is a list of some of the known limitations:
* Control flow ops are not yet supported.
* The
[`post_training_quantization`](https://www.tensorflow.org/performance/post_training_quantization)
flag is currently not supported for TensorFlow ops so it will not quantize
weights for any TensorFlow ops. In models with both TensorFlow Lite builtin
ops and TensorFlow ops, the weights for the builtin ops will be quantized.
* Ops that require explicit initialization from resources, like HashTableV2,
are not yet supported.
* Certain TensorFlow ops may not support the full set of input/output types
that are typically available on stock TensorFlow.
## Future Plans
The following is a list of improvements to this pipeline that are in progress:
* *Selective registration* - There is work being done to make it simple to
generate TFLite interpreter binaries that only contain the TensorFlow ops
required for a particular set of models.
* *Improved usability* - The conversion process will be simplified to only
require a single pass through the converter. Additionally, pre-built Android
AAR and iOS CocoaPod binaries will be provided.
* *Improved performance* - There is work being done to ensure TensorFlow Lite
with TensorFlow ops has performance parity to TensorFlow Mobile.