Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve parameters, logging, examples files and feature detection for mzmml statistics. #47

Merged
merged 33 commits into from
Mar 10, 2025
Merged
Show file tree
Hide file tree
Changes from 19 commits
Commits
Show all changes
33 commits
Select commit Hold shift + click to select a range
d6a4a6b
improve logging.
ypriverol Mar 6, 2025
21f43f0
major changes in ms structure to include precursors.
ypriverol Mar 7, 2025
2cc7f86
minor change
ypriverol Mar 7, 2025
647b9c3
minor change
ypriverol Mar 7, 2025
68ac536
changes from stream mzml reading to sequential
ypriverol Mar 7, 2025
9dd2e1f
changes from stream mzml reading to sequential
ypriverol Mar 7, 2025
23024d2
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
de2130c
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
89ae75a
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
32914fe
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
4c88bb8
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
15ed5d7
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
68d9e89
changes from stream mzml reading to sequential
ypriverol Mar 8, 2025
2b06c6a
changes from stream mzml reading to sequential
ypriverol Mar 6, 2025
c8c49f0
Merge remote-tracking branch 'origin/dev' into dev
ypriverol Mar 8, 2025
e959970
fixed the scan capture
ypriverol Mar 9, 2025
ec92528
remove files parquet tests
ypriverol Mar 9, 2025
2af3899
tests changed
ypriverol Mar 9, 2025
384c787
tests changed
ypriverol Mar 9, 2025
066a17b
recover tests needed
ypriverol Mar 9, 2025
443e15c
recover tests needed
ypriverol Mar 9, 2025
5740d85
move id_only -> ms2_file parameter
ypriverol Mar 10, 2025
85ebdff
fix test.
ypriverol Mar 10, 2025
2069808
fix test.
ypriverol Mar 10, 2025
8b0054e
fix test.
ypriverol Mar 10, 2025
e8227a4
update documentation for mzmlstastistics
ypriverol Mar 10, 2025
a514c16
update documentation for mzmlstastistics
ypriverol Mar 10, 2025
c9f1a42
update documentation for mzmlstastistics
ypriverol Mar 10, 2025
ef5aeaf
update documentation for mzmlstastistics
ypriverol Mar 10, 2025
caa6bce
id_only -> ms2_file parameter; improve documentation
ypriverol Mar 9, 2025
51efff5
Merge remote-tracking branch 'origin/dev' into dev
ypriverol Mar 10, 2025
b6afaf7
Fix small bug with Scan comparisons
ypriverol Mar 10, 2025
b4ef7ee
Fix small bug with Scan comparisons
ypriverol Mar 10, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -166,3 +166,6 @@ cython_debug/
/tests/test_data/diann2mztab/RD139_Narrow_UPS1_0_1fmol_inj2.mzML
/tests/test_data/diann2mztab/RD139_Narrow_UPS1_0_25fmol_inj1.mzML
/tests/test_data/diann2mztab/RD139_Narrow_UPS1_0_25fmol_inj2.mzML

.qodo
/tests/test_data/RD139_Narrow_UPS1_0_1fmol_inj1.mzML
5 changes: 5 additions & 0 deletions .markdownlint.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
{
"MD033": {
"allowed_elements": ["details", "summary"]
}
}
15 changes: 15 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,21 @@ The following functionalities are available in the package:
- `psmconvert` - The convert_psm function converts peptide spectrum matches (PSMs) from an idXML file to a CSV file, optionally filtering out decoy matches. It extracts and processes data from both the idXML and an associated spectra file, handling multiple search engines and scoring systems.
- `mzmlstats` - The `mzmlstats` processes mass spectrometry data files in either `.mzML` or `Bruker .d` formats to extract and compile statistics about the spectra. It supports generating detailed or ID-only CSV files based on the spectra data.

#### mzml statistics

quantms-utils have multiple scripts to generate mzML stats. These files are used by multiple tools and packages within quantms ecosystem for quality control, mzTab generation, etc. Here are some details about the formats, the fields they contain and gow they are computed.

<details>
<summary>MS info and details</summary>

`mzmlstats` allows the user to produce a file containing all features for every signal in the MS/MS experiment. The produced file is a parquet file, with the original name of the file plus the following postfix `{file_name}_ms_info.parquet`. Here, the definition of each column and how they are estimated and used:

- `scan`: The scan accession for each MS and MS/MS signal in the mzML, depending on the manufacturer, the scan will have different formats. Example, for thermo (e.g `controllerType=0 controllerNumber=1 scan=43920`). We tried to find the definition of [quantms.io](https://github.com/bigbio/quantms.io/blob/main/docs/README.adoc#scan).
- `ms_level`: The MS level of the signal, 1 for MS and 2 for MS/MS.
- `num_peaks`: The number of peaks in the MS. Compute with pyopenms with `spectrum.get_peaks()`.

</details>

## Contributions and issues

Contributions and issues are welcome. Please, open an issue in the [GitHub repository](https://github.com/bigbio/quantms) or PR in the [GitHub repository](https://github.com/bigbio/quantms-utils).
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@ name = "quantms-utils"
description = "Python scripts and helpers for the quantMS workflow"
readme = "README.md"
license = "MIT"
version = "0.0.19"
version = "0.0.20"
authors = [
"Yasset Perez-Riverol <[email protected]>",
"Dai Chengxin <[email protected]>",
Expand Down
2 changes: 1 addition & 1 deletion quantmsutils/__init__.py
Original file line number Diff line number Diff line change
@@ -1 +1 @@
__version__ = "0.0.19"
__version__ = "0.0.20"
Loading
Loading