early stop when all sequence reach EOS #57

je1lee · 2024-04-09T05:49:06Z

With model.generate() it takes too long even sequence generation have done earlier with EOS token. Because now, it generates til it reached to output_len

fix the generate method to stop when every sequence has generated EOS token

je1lee · 2024-04-16T03:57:37Z

@pengchongjin any idea for this?

pengchongjin · 2024-05-29T16:08:18Z

Thanks for the change. Could you please paste a few example outputs before and after this change?

Also please make sure to test both run.py and run_xla.py. Thanks!

je1lee · 2024-06-03T06:03:03Z

@pengchongjin
test done with both scripts

BEFORE

model generates token regardless of eos token, so time spent in generation increases quadratically as output_len increases

AFTER

model stop generate when model samples out eos token time spent in generation remain still as output_len increases

je1lee and others added 4 commits April 9, 2024 05:43

fix: early stop when all sequence reach EOS

55a1c73

style: tab in line

488e5f2

fix: (xla) early stop when all sequence reach EOS

815a0c9

Merge branch 'google:main' into fix/earlystop

e1d6092

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

early stop when all sequence reach EOS #57

early stop when all sequence reach EOS #57

je1lee commented Apr 9, 2024 •

edited

je1lee commented Apr 16, 2024

pengchongjin commented May 29, 2024

je1lee commented Jun 3, 2024 •

edited

early stop when all sequence reach EOS #57

Are you sure you want to change the base?

early stop when all sequence reach EOS #57

Conversation

je1lee commented Apr 9, 2024 • edited

je1lee commented Apr 16, 2024

pengchongjin commented May 29, 2024

je1lee commented Jun 3, 2024 • edited

je1lee commented Apr 9, 2024 •

edited

je1lee commented Jun 3, 2024 •

edited