PAD token missing ? #150

omkar-12bits · 2024-05-06T13:16:07Z

i tried using eos_token , unk_token and bos_token with left and right padding side but whenever the padding tokens size increases the outputs are pure garbage.

The text was updated successfully, but these errors were encountered:

omkar-12bits · 2024-05-14T06:31:20Z

after trying this on both mistral and mixtral its sure that paddings doesn't work very well with these models.
i was just playing with prompts at inference time and saw that paddings makes generation worst.
if inference on batches doesn't work well then how does it performs while training ? shouldn't it also produce garbage ?

geronimi73 · 2024-05-29T07:37:19Z

can you post the code you use please?

omkar-12bits closed this as completed May 6, 2024

omkar-12bits reopened this May 14, 2024

omkar-12bits closed this as completed May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PAD token missing ? #150

PAD token missing ? #150

omkar-12bits commented May 6, 2024 •

edited

omkar-12bits commented May 14, 2024

geronimi73 commented May 29, 2024

PAD token missing ? #150

PAD token missing ? #150

Comments

omkar-12bits commented May 6, 2024 • edited

omkar-12bits commented May 14, 2024

geronimi73 commented May 29, 2024

omkar-12bits commented May 6, 2024 •

edited