Streamline sample Colab notebook & avoid (suppress?) errors #66

mdingemanse · 2024-09-04T07:33:14Z

Working through this Colab notebook I noticed its output is not entirely selfexplanatory yet and also it generates some errors that may throw off beginners:

final CSV also includes gaze, should have only speech (this cell should have line 5 uncommented)
parsing EAF generates an error (it works, but looks alarming — can this be suppressed or avoided?)
/usr/local/lib/python3.10/dist-packages/pympi/Elan.py:1471: UserWarning: Parsing unknown version of ELAN spec... This could result in errors... warnings.warn('Parsing unknown version of ELAN spec... '
saving corpus locally (this cell) throws an error

---------------------------------------------------------------------------

TypeError                                 Traceback (most recent call last)

[<ipython-input-12-3af648ec5aa5>](https://localhost:8080/#) in <cell line: 2>()
      1 # Save the corpus as a .csv file locally
----> 2 Dutch_corpus.write_csv(path = "Dutch_corpus.csv")

8 frames

[/usr/local/lib/python3.10/dist-packages/sktalk/corpus/write/writer.py](https://localhost:8080/#) in <lambda>(x)
     52         norm = pd.json_normalize(data=metadata, sep="_")
     53         df = pd.DataFrame(norm)
---> 54         df[:] = np.vectorize(lambda x: ', '.join(
     55             x) if isinstance(x, list) else x)(df)
     56         return df

TypeError: sequence item 0: expected str instance, dict found

The text was updated successfully, but these errors were encountered:

liesenf · 2024-09-04T09:11:10Z

final CSV also includes gaze, should have only speech

changed the default and only speech tiers are now selected

parsing EAF generates an error (it works, but looks alarming — can this be suppressed or avoided?)

the warning message originates from dependency pympi . I will have to check whether it can be suppressed there. Since it's just a warning, I address 3. first.

saving corpus locally throws error

function write_csv encounters a TypeError if metadata is provided in metadata fields. Proposed solution ready for review in linked pull request.

mdingemanse assigned liesenf Sep 4, 2024

mdingemanse changed the title ~~Streamline sample Colba notebook & avoid (suppress?) errors~~ Streamline sample Colab notebook & avoid (suppress?) errors Sep 4, 2024

liesenf linked a pull request Sep 4, 2024 that will close this issue

- updated _metadata_to_df function #70

Merged

mdingemanse closed this as completed in #70 Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streamline sample Colab notebook & avoid (suppress?) errors #66

Streamline sample Colab notebook & avoid (suppress?) errors #66

mdingemanse commented Sep 4, 2024 •

edited

Loading

liesenf commented Sep 4, 2024 •

edited

Loading

Streamline sample Colab notebook & avoid (suppress?) errors #66

Streamline sample Colab notebook & avoid (suppress?) errors #66

Comments

mdingemanse commented Sep 4, 2024 • edited Loading

liesenf commented Sep 4, 2024 • edited Loading

mdingemanse commented Sep 4, 2024 •

edited

Loading

liesenf commented Sep 4, 2024 •

edited

Loading