-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error While Running the Synthesis Command #1
Comments
Hello! Several members of our team are also running into the same issue with running the Synthesize command. Best, |
Can you paste the full backtrace? We need to see the CenSyn code that is ultimately causing the errors. You may need to set the number of cores in the synthesis config file to 1 to see all the errors. |
Seems like a Pandas issue. What version are you using? In the future you can copy/paste the error text directly in here and format as a code block as such:
|
You might also try forgoing the parquet conversion and using CSV as the input (the latest CenSyn version supports this). So change references to .parquet in the synthesis config to .csv. |
Hi Rolando! I'm not sure about the other team but I am using Python 3.8.17. I tried using the CSV as the input but I am still getting the error mentioned above. Here's the full error:
|
What version of the pandas package are you using? |
Ah sorry! the pandas version is Version: 2.0.3 |
That's probably the problem. I am on 1.3.3, so we're a major version off. Can you try a conda environment that has python=3.10 and pandas=1.3.3 as I have? |
What version of numpy do you have? I seem to be running into a problem with that when I designate the pandas version as 1.3.3. |
|
Was able to recreate with a new environment with pandas=2.0.3, so it's likely a major version issue. Forcing pandas < 2.0.0 in the conda create step should probably fix things. |
The issue stems from "CalculateModel" models that are used to create new variables that are groupby means of other variables, for example:
Removing variables from the configs that use these models will move past this particular error. Pandas changed some default argument setting for some of the groupby functions, so that may be the source of the error (in particular the After removing the culprit variables, you will still get an error related to pandas no longer accepting sets as colunm listings. I will be making a pull request that fixes this and allows things to run. |
I see - thank you Rolando! I was also actually able to run it using your version of pandas and numpy, but with python version 3.8 |
Great! Can you see if the changes I made in the pull request still allow things to run on 3.8? If so then I feel fine with merging them into the main branch. |
Yup! I re-ran it again with those changes and looks like it ran fine with python 3.8 |
Okay, leaving this open to give others a chance to test. |
Hi! I wanted to flag this error I've been running into while attempting to run the synthesize command ($ censyn --s synthesize.cfg) on my computer. I've attached. few screenshots below which detail the issue, which is part of a much larger error message. It seems as though it is related to non-numeric data being present in the data file given. Let me know if you have any insight into why this could be happening or need any more information from me! Thank you!
The text was updated successfully, but these errors were encountered: