weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0 #304

gaodexiaozheng · 2023-11-22T08:11:54Z

when running the below commend:
python weight_diff.py recover --path_raw /models/Llama-2-7b-hf --path_diff /models/alpaca-7b-wdiff --path_tuned ./llama-alpaca-7b-hf

it shows the error:
RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0

though I can understand this error, this should be resolved.

boyue-jiang · 2023-11-24T20:21:39Z

I encountered the same problem. When I check the source code, I found that relates to the format of the model.state_dict. So you should check the state_dict shape of the raw model and model diff.

Irenehere · 2024-01-11T08:02:55Z

I have the same problem. Any idea to solve this error?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0 #304

weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0 #304

gaodexiaozheng commented Nov 22, 2023

boyue-jiang commented Nov 24, 2023

Irenehere commented Jan 11, 2024

weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0 #304

weight_diff.py state_dict_recovered[key].add_(state_dict_raw[key]) RuntimeError: The size of tensor a (32001) must match the size of tensor b (32000) at non-singleton dimension 0 #304

Comments

gaodexiaozheng commented Nov 22, 2023

boyue-jiang commented Nov 24, 2023

Irenehere commented Jan 11, 2024