How to instruct the model for getting proper key value pair as json format, without getting any other text. #154

Dineshkumar-Anandan-ZS0367 · 2024-04-26T13:45:22Z

I need to get json results from the paragraph contains key value pairs, but llam3 instruct model return json format with some unwanted string, how to get proper answer from llama3 model.

or

Anyother options in coding or a parameter available to get that result.

aqib-mirza · 2024-04-26T15:42:43Z

If you specify the "format" and set it to "json" you will have your desired results.

Dineshkumar-Anandan-ZS0367 · 2024-04-27T17:14:28Z

llama3 8b instruct model, how to use this format params, can you share? Need a example or prompt related documentation.

aqib-mirza · 2024-04-27T18:55:21Z

Here is an example code
"""model_id = "meta-llama/Meta-Llama-3-8B-Instruct"

pipeline = transformers.pipeline(
"text-generation",
model=model_id,
model_kwargs={"torch_dtype": torch.float16},
device="cuda",
token = "HF-Token"
)

messages = [
{"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak! and return every answer in JSON format"},
{"role": "user", "content": "Who are you?"},
]

prompt = pipeline.tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
format = "JSON"
)

terminators = [
pipeline.tokenizer.eos_token_id,
pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = pipeline(
prompt,
max_new_tokens=256,
eos_token_id=terminators,
do_sample=True,
temperature=0.6,
top_p=0.9,

)
print(outputs[0]["generated_text"][len(prompt):])"""

Dineshkumar-Anandan-ZS0367 · 2024-04-27T19:13:59Z

Thanks a ton sir! I will check this.

Dineshkumar-Anandan-ZS0367 · 2024-04-27T21:34:10Z

Same prompt and same ocr text from image.
Each request the llm gives different results, how can I maintain the results.

Is there any options for this, I understand this is a llm.

Can you suggest some ideas for prompt to extract key value pairs in a paragraph.

Dineshkumar-Anandan-ZS0367 · 2024-04-29T04:44:19Z

Getting same result as before inspite of using

prompt = pipeline.tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True,
format = "JSON"
)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to instruct the model for getting proper key value pair as json format, without getting any other text. #154

How to instruct the model for getting proper key value pair as json format, without getting any other text. #154

Dineshkumar-Anandan-ZS0367 commented Apr 26, 2024

aqib-mirza commented Apr 26, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024 •

edited

aqib-mirza commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 29, 2024

How to instruct the model for getting proper key value pair as json format, without getting any other text. #154

How to instruct the model for getting proper key value pair as json format, without getting any other text. #154

Comments

Dineshkumar-Anandan-ZS0367 commented Apr 26, 2024

aqib-mirza commented Apr 26, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024 • edited

aqib-mirza commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 29, 2024

Dineshkumar-Anandan-ZS0367 commented Apr 27, 2024 •

edited