Optimization Plans for Conv2D CPU Execution #1130

abdussamettrkr · 2024-05-16T13:38:48Z

Hello, I wonder if there are any future plans to optimize Conv2D CPU execution. I guess currently MLX uses a naive implementation?

awni · 2024-05-20T20:42:14Z

We don't have an immediate plan for this. I'm sure there is potential to make CPU convolutions faster. I'm curious though, why not use the GPU instead?

briancpark · 2024-05-22T03:01:35Z

Has there been an attempt or discussion to port in BNNS conv into MLX? It's listed as TODO.
I've looked into it personally, but I'm noticing some limitations with BNNS API and MLX. For one, different format preferences. I believe BNNS prefers NCHW and OIHW, instead of NHWC and OHWI in MLX. I assume the latter is chosen as implementing the Metal variant was priority, and it has better performance properties under those formats in GPU-land.

For me, the answer for why not use GPU instead is that CPU could be more efficient. I've seen some cases in CoreML where CNNs compiled under GPU only perform as same or marginally better than CNNs compiled on CPU-only. And it could be the case the CPU is more energy efficient? (I never confirmed).

awni · 2024-05-22T03:08:55Z

I don't think we've benchmarked the convs in BNNS but it would be interesting to see how they perform. There would have to be a copy in and out as everything in MLX assumes channels are last so that might hinder perf.

Regarding efficiency, the CPU will likely be faster (with a good implementation) for smaller models, the GPU for larger models. As for power efficiency, I do not know how they stack up and how it changes with scale.. a very good question.

awni added the performance label May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization Plans for Conv2D CPU Execution #1130

Optimization Plans for Conv2D CPU Execution #1130

abdussamettrkr commented May 16, 2024

awni commented May 20, 2024

briancpark commented May 22, 2024

awni commented May 22, 2024

Optimization Plans for Conv2D CPU Execution #1130

Optimization Plans for Conv2D CPU Execution #1130

Comments

abdussamettrkr commented May 16, 2024

awni commented May 20, 2024

briancpark commented May 22, 2024

awni commented May 22, 2024