You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Instructions:
1 - check the contents of the test2.csv to seee the innocuous use of "NA" as a category value in the "PD" column.
2 - Now open the filebug.ows in Orange3, open the File widget to upload the Test2.csv data.
3 - Open the Data Table to see how the data has been uploaded, scroll down to where you expect to see the "NA" text value and notice that precisely the rows which use the "NA" value have been modified and the "NA" replaced with "?"
The behaviour is inconsistent because if you create a smaller table using the category values "NA", the import works fine.
What's your environment?
Operating system:
Orange version:
How you installed Orange:
I am using Orange 3.36 on MAC OS Sanoma V 14.4.1 installed with the Orange DMG
The text was updated successfully, but these errors were encountered:
"File" and "CSV File Import" both do this. Neither cares whether NA is put in quotation marks. The only difference is "File" still converts to ? even if "text" type is chosen, "CSV File Import" does not.
If data is smaller, Orange's auto-detection recognizes PD as text attribute. We can tweak these rules, but it's guesswork, so it will never be correct.
Categorical and text values use a different set of symbols for missing values. I don't think we can do anything here: we cannot prohibit value "NA" in text, and convert it to missing.
The File widget uses rules text variables even if the user manually changes the type to categorical. I suppose this happens because the type is converted after the data is read.
I suppose the latter is why @markotoplak marked this as a bug (and I agree).
What's wrong?
On importing standard CSV data file with a category column , it sometimes converts the category text "NA" to "unknown" ie "?" , but not always.
My current workaround is to not use NA but rename it NX and it works fine.
How can we reproduce the problem?
Filebug.ows.zip
Test2.csv
Instructions:
1 - check the contents of the test2.csv to seee the innocuous use of "NA" as a category value in the "PD" column.
2 - Now open the filebug.ows in Orange3, open the File widget to upload the Test2.csv data.
3 - Open the Data Table to see how the data has been uploaded, scroll down to where you expect to see the "NA" text value and notice that precisely the rows which use the "NA" value have been modified and the "NA" replaced with "?"
The behaviour is inconsistent because if you create a smaller table using the category values "NA", the import works fine.
What's your environment?
I am using Orange 3.36 on MAC OS Sanoma V 14.4.1 installed with the Orange DMG
The text was updated successfully, but these errors were encountered: