Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with text_classification_classical_text_explainer.ipynb #182

Open
fatosismali opened this issue Nov 24, 2021 · 3 comments
Open

Issue with text_classification_classical_text_explainer.ipynb #182

fatosismali opened this issue Nov 24, 2021 · 3 comments

Comments

@fatosismali
Copy link

When executing the following cell:
classifier, best_params = explainer.fit(X_train, y_train)

It results with the following error:
ValueError: empty vocabulary; perhaps the documents only contain stop words

Using the same data set as in the example notebook - haven't changed anything in the code.

@imatiach-msft
Copy link
Collaborator

I had a similar issue, using an older version of spacy (2.3.7) package on pypi fixed it, looks like the tokenizer code needs to be updated to latest spacy

@imatiach-msft
Copy link
Collaborator

see related issue:
#176

@Kushalamummigatti
Copy link

Hi... Is the issue solved? Am facing the same error ValueError: empty vocabulary; perhaps the documents only contain stop words. When trying to use explainer.fit(text_train, y_train_encoded) for classification.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants