Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DRAFT] New data classes #233

Closed
wants to merge 108 commits into from

Conversation

Gautzilla
Copy link
Contributor

This PR follows the issue #228.

It is based on #216, so it is ahead of main by a lot of commits but it should get resolved once #216 gets merged.

It intends at creating base classes for representing data and datasets, which could be declined into specialized classes for audio, spectrum, auxiliary or whatever data we use in OSEkit.

This should help separating concerns and reusing code, and hopefully largely reduce the size of the old Dataset and Spectrogram classes.

This PR only aims at creating the classes, their integration will be done later on.

@Gautzilla Gautzilla added the data format Work related to spectrogram/audio format and how to process it label Dec 3, 2024
@Gautzilla Gautzilla requested a review from mathieudpnt December 3, 2024 16:32
@Gautzilla Gautzilla self-assigned this Dec 3, 2024
Gautzilla and others added 27 commits January 16, 2025 10:05
This PR fixes the concatenation of multiple NPZ files to form a larger spectrogram as LTAS were computed in the legacy OSEkit.

It introduces the split() method for SpectroData objects, which deviates from the behaviour of AudioData.split() as it forces the chunks to be made on frames on which a window of the SFT is centered. This is required to reconstruct a sft from sft parts, as shown in the "It is possible to calculate the SFT of signal parts:" section.

This PR also improves the accuracy of the SpectroFile objects begin/end timestamps by writting the timestamps in the npz file, rather than computing it in a pretty questionable way as it was done before 🥸.
@Gautzilla Gautzilla closed this Feb 12, 2025
@Gautzilla Gautzilla deleted the new-data-classes branch February 12, 2025 08:23
@Gautzilla Gautzilla mentioned this pull request Feb 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data format Work related to spectrogram/audio format and how to process it
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant