[Question] FHE inference over a single image time #658

malickKhurram · 2024-04-29T07:21:12Z

Hi, I found this community very helpful. I have a question related to inference time.
I am running following use case example.
https://github.com/zama-ai/concrete-ml/blob/main/use_case_examples/cifar/cifar_brevitas_training/evaluate_one_example_fhe.py

The issue is that FHE inference over a single image took more than 28 min.
Is there a way to optimize it or reduce the inference time.

andrei-stoian-zama · 2024-04-29T07:52:34Z

Hi,

One possible way to reduce latency is to use a bigger machine, since Concrete makes very good use of parallelism: we obtain around 40 seconds inference time on that model using the hpc7a instances on AWS.

A second possible approach is, depending on the use-case, to optimize your model by making it smaller or pruning it. You might want to look into structured pruning. See this section of the documentation about such techniques - some are already used in the CIFAR example that you link.

malickKhurram · 2024-04-29T08:55:47Z

Hi andrei, thanks for your quick reply. I am using a high end workstation with 64 GB RAM and 24 core processor.
I will take a look at the pruning.
Thanks

guevara mentioned this issue May 6, 2024

A High-Level Technical Overview of Fully Homomorphic Encryption guevara/read-it-later#11150

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] FHE inference over a single image time #658

[Question] FHE inference over a single image time #658

malickKhurram commented Apr 29, 2024

andrei-stoian-zama commented Apr 29, 2024

malickKhurram commented Apr 29, 2024

[Question] FHE inference over a single image time #658

[Question] FHE inference over a single image time #658

Comments

malickKhurram commented Apr 29, 2024

andrei-stoian-zama commented Apr 29, 2024

malickKhurram commented Apr 29, 2024