-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Access to Undefined Reference #122
Comments
This issue could potentially be related to Issue #120. There could potentially be a string column that fails to parse a delimiter, putting a whole whack of data into a single cell, causing the rest of the columns. |
If I iterate through the file with no batch size specified, I get an inexact error (trying to convert a NaN to Int32)
|
I just verified the fix to Issue #120, that fix doesn't fix this problem unfortunately so this issue is still open. |
i also get exactly this |
I can only partially iterate from a file created by parquet-mr. I can iterate through it once, but trying to do this a second time yields
Note that NamedTuple{...} is abridged becasue the actual tuple is a massive long list of the entire file schema. I can't give you the original file for this one, but I wouldn't be surprised if it has something to do with initializing a mutable type with #undef and failing to populate it. There could be sizable gaps in data for some of the columns. Note that it was created by parquet-mr
Parquet file: Input/input_data.parquet
version: 1
nrows: 4887400
created by: parquet-mr version 1.9.0 (build 38262e2c80015d0935dad20f8e18f2d6f9fbd03c)
cached: 157 column chunks
The text was updated successfully, but these errors were encountered: