Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can MGP-STR deal with Chinese text? Can I train MGP-STR with Huggingface version? #117

Open
BrianPYChen opened this issue Mar 6, 2024 · 1 comment

Comments

@BrianPYChen
Copy link

Hi AlibabaResearch,

I have few questions listed as below:

  1. Can MGP-STR deal with Chinese text with code in GitHub or Huggingface?
  2. Can I train MGP-STR with Huggingface version?

Thanks.

@wdp-007
Copy link
Collaborator

wdp-007 commented Mar 12, 2024

Hi,
Currently, MGP-STR is unable to process Chinese as the model has not been trained on Chinese data, and we have not found an effective method for segmenting Chinese words. If you have discovered one, we welcome the exchange of ideas.

The version on Huggingface is only capable of inference; for training purposes, you may refer to the instructions provided on GitHub.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants