Skip to content

Releases: InternLM/InternEvo

InternEvo-v0.5.2dev20240525

25 May 12:49
41545ce
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.5.1dev20240517...v0.5.2dev20240525

InternEvo-v0.5.1dev20240517

17 May 08:37
b46532f
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.5.0dev20240510...v0.5.1dev20240517

InternEvo-v0.5.0dev20240510

10 May 08:30
6dfdb34
Compare
Choose a tag to compare

What's Changed

  • refactor(model): refactor model architecture by @mwiacx in #126

Full Changelog: v0.4.1dev20240510...v0.5.0dev20240510

InternEvo-v0.4.1dev20240510

10 May 08:26
b024cf4
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.0dev20240403...v0.4.1dev20240510

InternEvo-v0.4.0dev20240403

03 Apr 12:29
42e4b84
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.3dev20240315...v0.4.0dev20240403

InternEvo-v0.3.3dev20240315

15 Mar 02:10
def75dd
Compare
Choose a tag to compare

What's Changed

  • remove dependency of flash_attn when use_flash_attn is set to false by @sallyjunjun in #20
  • fix(transformers): fix parameter error of safe_open in revert scripts by @x54-729 in #74
  • Update version.txt by @sunpengsdu in #81
  • fix(embedding.py): fix flash attn error of llama and internlm2 by @sallyjunjun in #83
  • fix(ckpt): fix load funcs when loading llama & hf_llama by @gaoyang07 in #79
  • Fix missing requirments for NUMA by @Godricly in #80
  • test(workflow): add workflow for norm_weight_test by @kkscilife in #70
  • feat(moe): impl moe with megablock kernel by @blankde in #76

New Contributors

Full Changelog: v0.3.2dev20240313...v0.3.3dev20240315

InternEvo-v0.3.2dev20240313

13 Mar 07:32
fb6a587
Compare
Choose a tag to compare

What's Changed

  • Delete .github/workflows/stale.yml by @del-zhenwu in #66
  • Fix (unitest, interleaved pp and other bugs): re-adapt unitest for isp and adapt interleaved pp for no flash_attention by @li126com in #52
  • feat(model/linear.py): support norm head for model internlm2 by @huangting4201 in #68
  • feat(modeling_internlm2.py): update model type to INTERNLM2_PUBLIC by @huangting4201 in #69
  • feat(ckpt): optimize model checkpointing in Volc and Ali by @zigzagcai in #65
  • fix(communication/isp.py): fix redundant callback and remove head embed hook by @huangting4201 in #72
  • (feat/demo) add internlm2 1.8b config by @00INDEX in #73
  • Feat(QA): temp no fa by @li126com in #75

New Contributors

Full Changelog: v0.3.1dev20240229...v0.3.2dev20240313

InternEvo-v0.3.1dev20240229

29 Feb 02:58
458f1fd
Compare
Choose a tag to compare

What's Changed

  • Feat(QA): check output for no fa by @li126com in #42
  • feat(model): update modeling_internlm2 with configs by @gaoyang07 in #15
  • feat(tests): update ci e2e tests by @huangting4201 in #45
  • fix(moe): fix bugs for moe sequence parallel and memory pool by @blankde in #50
  • fix(optimizer/hybrid_zero_optim.py): fix layer norm grad allreduce when sp is True by @huangting4201 in #53
  • test(workflow): change env into flash2 and add rerun workflow by @kkscilife in #48
  • feat(code-docs): update doc tensor parallel by @huangting4201 in #43
  • feat(parallel_context.py): add gqa process group to allreduce dkv by @huangting4201 in #54
  • fix(context/process_group_initializer.py): fix gqa process group by @huangting4201 in #58
  • feat(*): remove unnecessary communication by @mwiacx in #60
  • Fix(param overlap): fix overlap of broadcasting and computing by @li126com in #46
  • test(ci): add write permissions for actions by @kkscilife in #56
  • feat(model): update model internlm2 by @huangting4201 in #47
  • Feat(QA norm):check norm weights for different ranks by @li126com in #62
  • feat(switch topology): add control switch by @li126com in #55
  • Fix/fix broadcast overlap with isp by @mwiacx in #64
  • fix(QA): fix test_swap_nb_loss_and_gradnorm by @li126com in #63

New Contributors

Full Changelog: v0.3.0dev20240201...v0.3.1dev20240229

InternEvo-v0.2.4-internlm2

07 Feb 14:58
Compare
Choose a tag to compare

InternEvo-v0.3.0dev20240201

01 Feb 08:09
ae5a7ee
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.2.3dev20240201...v0.3.0dev20240201