This is an example for realtime (~30fps @ RTX 2070 Super) neural style transfer in openFrameworks.
The Python code has been taken from this repo. Neural style transfer can also be done using GANs. However, this method is much faster. Check this post for more information on this topic. Nevertheless, we highly recommend using a GPU.
Make sure to download and extract the folder containing multiple SavedModels from the assets. The easiest way is to use the script provided in ../scripts/download_example_model.sh. The application will by default look for a folder models in bin/data/.
You can download the checkpoints for each model using the bash script ../scripts/download_training_examples.sh.
This example inherits a small problem. The computational graph uses a function which relies on the size of the tensor. This leads to an error when saving the model. We have to specify the input dimensions using the following wrapper:
@tf.function(input_signature=[tf.TensorSpec([1, 480, 640, 3], dtype=tf.float32)])
def model_predict(input_1):
return {'outputs': network(input_1, training=False)}
Note: Run the python script python/checkpoint2SavedModel.py on the downloadable checkpoints to change the input signatures!
For GPU users: by default TensorFlow will try to reserve almost all GPU memory, independent of the model size. We define certain presets to change this behaviour. You can choose between 10 to 90 % reservation and with or without memory growth.
// restrict TensorFlow to reserve only a maximum of 70% of the GPUs memory
// and set memory growth to true
ofxTF2::setGPUMaxMemory(ofxTF2::GPU_PERCENT_70, true);
In this example we will use the ThreadedModel
class and augment the runModel function. This way we can modify the in and outputs inside the thread.
Here, the model expects a 3D tensor (which has no dimension for batches yet) and outputs the values to be displayable without clamping (the neural network applies a weird shift).
class ImageToImageModel : public ofxTF2::ThreadedModel {
public:
// override the runModel function of ofxTF2::ThreadedModel
// this way the thread will take this augmented function
// otherwise it would call runModel with no way of pre/post-processing
cppflow::tensor runModel(const cppflow::tensor & input) const override {
// cast data type and expand to batch size of 1
auto inputCast = cppflow::cast(input, TF_UINT8, TF_FLOAT);
inputCast = cppflow::expand_dims(inputCast, 0);
// call to super
auto output = ofxTF2Model::runModel(inputCast);
// postprocess: last layer = (tf.nn.tanh(x) * 150 + 255. / 2)
return ofxTF2::mapTensorValues(output, -22.5f, 277.5f, 0.0f, 255.0f);
}
};
Note: The default layout for images in TensorFlow is NHWC (batch size, height, width, channel), which is different to openFrameworks' layout (width, height, channel). So an image which is 640 pixels wide and 480 pixels high is given as an tensor of [1, 480, 640, 3]
.
Check this link for more details.