You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
lines 59 to 64 in schema.py. code checks if file columns are subset of self.get_column_names() but in Exception printing difference between columns and self.columns so it is always shows that all Columns are different
if set(columns).issubset(self.get_column_names()):
columns_to_pair = [column for column in self.columns if column.name in columns]
else:
raise PanSchArgumentError(
'Columns {} passed in are not part of the schema'.format(set(columns).difference(self.columns))
)
The text was updated successfully, but these errors were encountered:
Probably related to the issue above. In my case schema.validate(test_data) always returns Invalid number of columns. The schema specifies 21, but the data frame has 22 even thought test_data actually has 21 columns.
Probably related to the issue above. In my case schema.validate(test_data) always returns Invalid number of columns. The schema specifies 21, but the data frame has 22 even thought test_data actually has 21 columns.
yes that exactly what is the issue. I will write a sample code and will post it later today
lines 59 to 64 in schema.py. code checks if file columns are subset of self.get_column_names() but in Exception printing difference between columns and self.columns so it is always shows that all Columns are different
The text was updated successfully, but these errors were encountered: