Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

lpizzinidev · 2024-02-18T16:24:34Z

Updates the "Automatic Speech Recognition using CTC" example to support Keras v3.

fchollet

Thanks for the PR!

fchollet · 2024-02-19T01:39:05Z

examples/audio/ctc_asr.py

@@ -244,16 +249,74 @@ def encode_single_sample(wav_file, label):
 """


+# Reference: https://github.com/keras-team/keras/blob/ec67b760ba25e1ccc392d288f7d8c6e9e153eea2/keras/legacy/backend.py#L674-L711
+def ctc_label_dense_to_sparse(labels, label_lengths):


Rather than rewriting this code, you can just use the built-in Keras 3 loss function keras.losses.CTC. I expect it will also enable the code example to run with all backends.

Thanks for the feedback 👍
After removing the legacy code we still have some references to tf in the example and I'm not sure this can be made backend-agnostic.
Please let me know if I should substitute the remaining tf references.

fchollet

LGTM, thank you! You can add the generated files.

fchollet · 2024-02-24T18:12:29Z

examples/audio/ctc_asr.py

@@ -320,7 +307,7 @@ def build_model(input_dim, output_dim, rnn_layers=5, rnn_units=128):
    # Optimizer
    opt = keras.optimizers.Adam(learning_rate=1e-4)
    # Compile the model and return
-    model.compile(optimizer=opt, loss=CTCLoss)
+    model.compile(optimizer=opt, loss=keras.losses.ctc)


Prefer using CTC() (ends up running the same thing but it's more idiomatic)

fchollet · 2024-02-24T18:13:12Z

examples/audio/ctc_asr.py

+    input_length = tf.cast(input_length, tf.int32)
+
+    if greedy:
+        (decoded, log_prob) = tf.nn.ctc_greedy_decoder(


So, we're going to have to use TF for this and ctc_beam_search_decoder I guess, unless we implement them as new backend ops.

Again, thanks for the feedback 👍
I created an issue to address this.
Please let me know if I should change the description or add/remove details.
Thanks!

github-actions · 2024-08-02T01:50:37Z

This PR is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

github-actions · 2024-08-16T01:51:33Z

This PR was closed because it has been inactive for 28 days. Please reopen if you'd like to work on this further.

Updated Automatic Speech Recognition using CTC example for Keras v3

6d9e3f0

github-actions bot assigned sachinprasadhs Feb 18, 2024

fchollet reviewed Feb 19, 2024

View reviewed changes

use keras.losses.CTC function

b11c690

fchollet reviewed Feb 24, 2024

View reviewed changes

lpizzinidev added 2 commits February 25, 2024 15:51

updated autogenerated files

f207998

Merge branch 'master' into ctc-asr-example-v3

b4108fb

sachinprasadhs added the stat:awaiting response from contributor label Jul 18, 2024

github-actions bot added the stale label Aug 2, 2024

github-actions bot closed this Aug 16, 2024

sachinprasadhs reopened this Aug 16, 2024

sachinprasadhs removed stale stat:awaiting response from contributor labels Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

lpizzinidev commented Feb 18, 2024

fchollet left a comment

fchollet Feb 19, 2024

lpizzinidev Feb 24, 2024

fchollet left a comment

fchollet Feb 24, 2024

fchollet Feb 24, 2024

lpizzinidev Feb 25, 2024

github-actions bot commented Aug 2, 2024

github-actions bot commented Aug 16, 2024

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Are you sure you want to change the base?

Updated Automatic Speech Recognition using CTC example for Keras v3 #1768

Conversation

lpizzinidev commented Feb 18, 2024

fchollet left a comment

Choose a reason for hiding this comment

fchollet Feb 19, 2024

Choose a reason for hiding this comment

lpizzinidev Feb 24, 2024

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

fchollet Feb 24, 2024

Choose a reason for hiding this comment

fchollet Feb 24, 2024

Choose a reason for hiding this comment

lpizzinidev Feb 25, 2024

Choose a reason for hiding this comment

github-actions bot commented Aug 2, 2024

github-actions bot commented Aug 16, 2024