do einops' operations account for contiguous memory layout? #88

CDitzel · 2020-11-16T12:53:19Z

Upon heavy reshaping and dimension manipulations, it is necessary from time to time to call .contiguous() on the resulting tensors to straighten out the memory layout. Does einops account for this automatically? I dont see no call to contiguous() anywhere in the examples

arogozhnikov · 2020-11-17T23:16:31Z

Good question!

Let's assume we are talking in pytorch terms.
It is common to use .contiguous() before using view in pytorch, however einops replaces view (e.g. with rearrange).

Einops operations can work with any memory layout and internally flatten array if that's required.
However when possible einops returns view to original tensor, not copy (in agreement with how built-in reshape/transpose work).

There is afaik no need to provide contiguous tensors to other operations (besides view), but if for some reason it is required - you can always use .contigous()

arogozhnikov · 2020-11-17T23:18:05Z

I'll keep this question open for reference

CDitzel · 2021-02-18T15:39:06Z

so I recently had the problem that pytorch would throw me a warning lile

[W accumulate_grad.h:170] Warning: grad and param do not obey the gradient layout contract. This is not an error, but may impair performance.
grad.sizes() = [1, 64, 1, 1], strides() = [64, 1, 1, 1]
param.sizes() = [1, 64, 1, 1], strides() = [64, 1, 64, 64] (function operator())

and the line which causes the issueing of this warning was

constituents = rearrange(constituents, "b (h w) d -> b d h w", h=h, w=w)

it was gone after I changed it to

constituents = rearrange(constituents, "b (h w) d -> b d h w", h=h, w=w).contiguous()

so I thougt you wanna know that. Maybe it is an option to call contiguous anywars within einops?

arogozhnikov · 2021-02-20T21:43:27Z

Hi @CDitzel ,

Maybe it is an option to call contiguous anywars within einops?

All operations try to incur minimal overhead in terms of memory and time, returning views when possible is the right policy (e.g. transpose in pytorch/numpy works the same way). Returning contiguous always would result in higher memory consumption even when not necessary.

and the line which causes the issueing of this warning was

Line that causes warning is likely the next operation because that op is not optimized for non-cont layout but does not call .contiguous for the same reasons einops does not this - to avoid overhead.

Let me state - everything works how it should work

pytorch reports potentially slow place
what you've done by adding .contiguous() is the right approach. Downstream code wants contiguous array - you make this explicit additional step to provide it.

You also want to check that speed is improved after this addition. If not - you can skip .contiguous and spend less memory on that.

Warning you observe looks new to me, but it's good news that pytorch starts reporting potentially non-optimal places. Non-informativeness of warning you see is disappointing though.

CDitzel · 2021-02-23T08:15:00Z

thank you for your reply. I agree that this topic should be kept open for future references

arogozhnikov added the question Further information is requested label Nov 17, 2020

arogozhnikov closed this as completed Nov 17, 2020

arogozhnikov reopened this Nov 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do einops' operations account for contiguous memory layout? #88

do einops' operations account for contiguous memory layout? #88

CDitzel commented Nov 16, 2020

arogozhnikov commented Nov 17, 2020

arogozhnikov commented Nov 17, 2020

CDitzel commented Feb 18, 2021

arogozhnikov commented Feb 20, 2021

CDitzel commented Feb 23, 2021

do einops' operations account for contiguous memory layout? #88

do einops' operations account for contiguous memory layout? #88

Comments

CDitzel commented Nov 16, 2020

arogozhnikov commented Nov 17, 2020

arogozhnikov commented Nov 17, 2020

CDitzel commented Feb 18, 2021

arogozhnikov commented Feb 20, 2021

CDitzel commented Feb 23, 2021