Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: Data annotation and pre processing for Relation Extraction #8457

Open
piarosebelledelapaz opened this issue May 16, 2024 · 0 comments
Assignees
Labels
question Further information is requested

Comments

@piarosebelledelapaz
Copy link

请提出你的问题

Hello,

I am trying to do relation extraction for a document and I have few questions regarding the annotation format to finetune the model.

  1. Is multiple linking possible (1-N relations) and is accepted by the model?
  • vaccine X links to 1st date of vaccination
  • vaccine X links to 2nd date of vaccination
    image
  1. What does train/dev/test.txt generate inside the file? Because i preprocessed my data but there's a bunch of jargons to the generated .txt file so I would like to understand what really is the format to input to the model. I did my annotations accordingly to the label-studio guide provided by PaddleNLP but the contents from the training/validation data files are not clear. Here is a sample content from the train.txt file I got.
    image

  2. Once the model has been fine-tuned, does it also generate detection and recognition results from the document, or just the relation extraction results? Because i have fine-tuned weights from PaddleOCR for the detection and recognition. I was wondering if this would be of use with PaddleNLP.

If you could provide me clarifications with this regards, that would be very helpful! Thanks in advance.

@piarosebelledelapaz piarosebelledelapaz added the question Further information is requested label May 16, 2024
@piarosebelledelapaz piarosebelledelapaz changed the title [Question]: Data annotation and pre processing [Question]: Data annotation and pre processing for Relation Extraction May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants