More robust seedname.wout parser #1

AntimoMarrazzo · 2017-07-17T20:28:15Z

We should use regular expression instead of split&strip to parse x y z and the spread at each iteration.

giovannipizzi · 2017-10-13T13:44:00Z

I think it's a bit more complex than that... the problem is that sometimes there are no spaces between coordinates!! (for long numbers starting with a minus)

What about adding a new output to W90 (if it's not already in the tb_parameters) with the centers spreads etc instead of trying to parse the wout?

greschd · 2017-10-13T16:59:30Z

Is there a good library to produce structured output (csv, xml, json, yaml) from Fortran? If so, I would argue for using that when creating a new output file. Otherwise we'll have to write yet another output parser..

giovannipizzi · 2017-10-13T17:29:21Z

Maybe things like YAML are simple enough to be written even without a library? (I hope...)

greschd · 2017-10-13T21:12:07Z

Or a CSV with just index, x, y, z, spread. Of course since it's per iteration it would also make sense to have a nested list, which is more suited for YAML.

after the comma (for many-digit numbers). Not perfect, but good workaround (also for #1) that does not require to change Wannier90.

giovannipizzi · 2019-11-01T14:10:06Z

I would consider this issue more general, and improve the names and parsing of the output.
We can then move the issue of creating a more machine-readable file in the Wannier90 repository, and when the feature is released implement its parsing from here.

giovannipizzi · 2019-11-01T14:10:54Z

@normarivano could you report here the table that we started discussing, with the suggested changes to the parser key names, for discussion with the others?

normarivano · 2020-02-11T11:21:44Z

I attach hereafter the table for the parsing of the standard output file. We propose some changes for the keys' names to be implemented within this release. Can you give me some feedback as soon as possible? Thank´s a lot.
@greschd @giovannipizzi @AntimoMarrazzo @qiaojunfeng

Keys in raw_wout_parser (i.e. seedname.wout parsing)
Suggested parsed key	Current key	Description	Where in the seedname.wout
warnings	warnings	List of warnings	General
number_wfs	number_wannier_functions	Number of Wannier functions.	MAIN
length_units	length_units	Units used to express the lengths, if not Ang we will have an additional warning in the list warnings¨.	MAIN
output_verbosity	output_verbosity	Level of verbosity of the output from 0 (low) to 3 (high). Default value is 1 and parsing is supported only in this case.	MAIN
convergence_tolerance	wannierize_convergence_tolerance	The convergence tolerance to find the final spread	WANNIERIZE
r2_mn_writeout	r2_nm_writeout	Boolean variable. If true, a file seedname.r2mn is written.	WANNIERIZE
xyz_wf_centres_writeout	xyz_wf_center_writeout	Boolean variable. If true, a file seedname_centres.xyz is written in xyz format for further visualization (jmol, vmd etc.).	WANNIERIZE
Omega_I	Omega_I	Gauge invariant spread.	Final State
Omega_D	Omega_D	Diagonal part of gauge-dependent spread.	Final State
Omega_OD	Omega_OD	Off-diagonal port of gauge-dependent spread.	Final State
Omega_total	Omega_total	Sum of the total gauge-dependent spread and gauge-invariant spread.	Final State
wf_ids	wannier_function	List of ordinal numbers identifying each Wannier function.	Final State
im_re_ratio	im_re_ratio	List of the ratio between the imaginary and real part of the each Wannier function in the same order given by wf_ids above.	Final State
wf_centres	coordinates	List of the coordiantes (x,y,z) of the centres of each Wannier function in the same order given by wf_ids and im_re_ratio above.	Final State
wf_spreads	spread	List of the spreads associated to each Wannier function in the same order given by wf_ids, im_re_ratio and wf_centres.	Final State

normarivano · 2020-02-11T11:27:49Z

I´m also going to open an other issue (not in Milestones) in which we can discuss additional information to parse for the future. Me and @qiaojunfeng already started discussing that.
#70

greschd · 2020-02-11T13:17:52Z

Looks all good to me 👍

giovannipizzi · 2020-02-12T23:06:52Z

Thanks a lot! Looks good to me as well. Very good also to go back to UK spelling considering this is what the W90 code uses.

Just a couple of comments of things I would change:

r2_mn_writeout -> r2mn_writeout (there is no underscore neither in the name of the flag in the w90 input (write_r2mn), neither in the file extension)
xyz_wf_centres_writeout -> xyz_writeout (similar reason as above, for consistency (e.g. the W90 flag is called simply write_xyz).

AntimoMarrazzo · 2020-02-13T13:15:23Z

All good for me - I close the issue.

giovannipizzi · 2020-02-21T08:42:51Z

I reopen it - let's close it when we have the PR that changes the names

AntimoMarrazzo assigned AntimoMarrazzo and giovannipizzi and unassigned AntimoMarrazzo Oct 13, 2017

AntimoMarrazzo added the enhancement label Oct 13, 2017

giovannipizzi added a commit that referenced this issue Jan 31, 2018

Fixed bug on parsing coordinates of WFs when there is no space

8474ece

after the comma (for many-digit numbers). Not perfect, but good workaround (also for #1) that does not require to change Wannier90.

giovannipizzi mentioned this issue Jan 31, 2018

Fixed bug on parsing coordinates of WFs when there is no space #21

Merged

giovannipizzi added this to the Version 2.0 release milestone Nov 1, 2019

normarivano self-assigned this Nov 8, 2019

AntimoMarrazzo closed this as completed Feb 13, 2020

giovannipizzi reopened this Feb 21, 2020

normarivano mentioned this issue Feb 21, 2020

Parser key names improvements #78

Merged

greschd closed this as completed in #78 Feb 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More robust seedname.wout parser #1

More robust seedname.wout parser #1

AntimoMarrazzo commented Jul 17, 2017

giovannipizzi commented Oct 13, 2017

greschd commented Oct 13, 2017

giovannipizzi commented Oct 13, 2017

greschd commented Oct 13, 2017 •

edited

Loading

giovannipizzi commented Nov 1, 2019

giovannipizzi commented Nov 1, 2019

normarivano commented Feb 11, 2020

normarivano commented Feb 11, 2020

greschd commented Feb 11, 2020

giovannipizzi commented Feb 12, 2020

AntimoMarrazzo commented Feb 13, 2020

giovannipizzi commented Feb 21, 2020

More robust seedname.wout parser #1

More robust seedname.wout parser #1

Comments

AntimoMarrazzo commented Jul 17, 2017

giovannipizzi commented Oct 13, 2017

greschd commented Oct 13, 2017

giovannipizzi commented Oct 13, 2017

greschd commented Oct 13, 2017 • edited Loading

giovannipizzi commented Nov 1, 2019

giovannipizzi commented Nov 1, 2019

normarivano commented Feb 11, 2020

normarivano commented Feb 11, 2020

greschd commented Feb 11, 2020

giovannipizzi commented Feb 12, 2020

AntimoMarrazzo commented Feb 13, 2020

giovannipizzi commented Feb 21, 2020

greschd commented Oct 13, 2017 •

edited

Loading