Unknown y_true tensor #9

deploy-soon · 2020-10-18T06:58:38Z

Hi, I'm trying to put categorical_focal_loss in my image segmentation task. The dataset is defined with tf.data.Dataset object and the model is defined with keras Model. The model is compiled like

loss_gamma = [0.5, 1., ...]
model.compile(
    optimizer=tf.keras.optimizers.Adam(lr=lr),
    loss=SparseCategoricalFocalLoss(gamma=loss_gamma),
...)
model.fit(...)

While training the segmentation task, assert exemption raise because the y_true tensor is Unknown.
https://github.com/artemmavrin/focal-loss/blob/master/src/focal_loss/_categorical_focal_loss.py#L136-L141
How do I define the true tensor? In my task, the true tensor is shaped with (BATCH, HEIGHT, WIDTH). My virtual environment is on ubuntu18.04, tensorflow 2.2.0

The text was updated successfully, but these errors were encountered:

artemmavrin · 2020-10-18T07:07:16Z

Hi @deploy-soon can you please share a minimal example that replicates the error?

deploy-soon · 2020-10-21T09:44:48Z

These are short code for segmentation task. While generate some labels, I add some noise and random crop to images and labels, so I use map_label wrapper for dataset.

import numpy as np
import tensorflow as tf
from focal_loss import SparseCategoricalFocalLoss

def map_label(x):
    def wrapper(param):
        # add some noise with np
        field = np.zeros((640, 480))
        return field
    return tf.py_function(func=wrapper, inp=[x], Tout=tf.float32)
                                                                                                                                                                                                               
ipt = tf.zeros([100, 640, 480, 3], dtype=tf.dtypes.float32)
images = tf.data.Dataset.from_tensor_slices(ipt)
labels = tf.data.Dataset.range(100).map(map_label)
dataset = tf.data.Dataset.zip((images, labels)).batch(2)

images = tf.keras.Input(shape=(640, 480, 3), name="ipt")
xs = tf.keras.layers.Conv2D(20, (3, 3), padding="same")(images)
labels = tf.keras.layers.Activation("softmax", name="opt")(xs)

model = tf.keras.Model(inputs=images, outputs=labels)
model.compile(optimizer=tf.keras.optimizers.Adam(lr=0.01),
              loss=SparseCategoricalFocalLoss(gamma=1.0),
              metrics=[tf.keras.metrics.SparseCategoricalAccuracy()])
model.fit(dataset, epochs=1)

When you run this code within tensorflow 2.2.0, you may see NotImplementedError.
@artemmavrin

artemmavrin · 2020-11-01T06:30:35Z

Sorry, for the delay. I'm able to replicate your error.

It looks like TensorFlow can't infer the rank of the values in the labels dataset. SparseCategoricalFocalLoss needs the ground truth tensor rank to be statically known for its reshaping logic:

focal-loss/src/focal_loss/_categorical_focal_loss.py

Lines 137 to 147 in 9e023de

    
           y_true_rank = y_true.shape.rank 
        
           if y_true_rank is None: 
        
               raise NotImplementedError('Sparse categorical focal loss not supported ' 
        
                                         'for target/label tensors of unknown rank') 
        
           reshape_needed = (y_true_rank is not None and y_pred_rank is not None and 
        
                             y_pred_rank != y_true_rank + 1) 
        
           if reshape_needed: 
        
               y_true = tf.reshape(y_true, [-1]) 
        
               y_pred = tf.reshape(y_pred, [-1, y_pred_shape[-1]])

A workaround that seems to work is to manually force the label shape to be known:

import numpy as np
import tensorflow as tf
from focal_loss import SparseCategoricalFocalLoss

def map_label(x):
    def wrapper(param):
        # add some noise with np
        field = np.zeros((640, 480))
        return field
    return tf.py_function(func=wrapper, inp=[x], Tout=tf.float32)
                                                                                                                                                                                                               
ipt = tf.zeros([100, 640, 480, 3], dtype=tf.dtypes.float32)
images = tf.data.Dataset.from_tensor_slices(ipt)
labels = tf.data.Dataset.range(100).map(map_label)
labels = labels.map(lambda label: tf.reshape(label, [640, 480]))  # New line
dataset = tf.data.Dataset.zip((images, labels)).batch(2)

images = tf.keras.Input(shape=(640, 480, 3), name="ipt")
xs = tf.keras.layers.Conv2D(20, (3, 3), padding="same")(images)
labels = tf.keras.layers.Activation("softmax", name="opt")(xs)

model = tf.keras.Model(inputs=images, outputs=labels)
model.compile(optimizer=tf.keras.optimizers.Adam(lr=0.01),
              loss=SparseCategoricalFocalLoss(gamma=1.0),
              metrics=[tf.keras.metrics.SparseCategoricalAccuracy()])
model.fit(dataset, epochs=1)

deploy-soon · 2020-12-21T12:12:49Z

Finally, I got some hints to solve these error. Since the output of tf.py_function is not fixed, the output should set to size of input explicitly. In sparse_categorical scheme, y_true can be reshaped like below before checking the rank of true tensor.

y_true.set_shape(y_pred.get_shape()[:3])

@artemmavrin

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unknown y_true tensor #9

Unknown y_true tensor #9

deploy-soon commented Oct 18, 2020

artemmavrin commented Oct 18, 2020

deploy-soon commented Oct 21, 2020

artemmavrin commented Nov 1, 2020

deploy-soon commented Dec 21, 2020

Unknown y_true tensor #9

Unknown y_true tensor #9

Comments

deploy-soon commented Oct 18, 2020

artemmavrin commented Oct 18, 2020

deploy-soon commented Oct 21, 2020

artemmavrin commented Nov 1, 2020

deploy-soon commented Dec 21, 2020