Prepare model for deployment to Private Vertex AI endpoint #55

BriianPowell · 2024-04-04T23:00:05Z

Hello, I have a use-case where I'd like to deploy this model to a private Vertex AI endpoint, is there any documentation/literature around how to do that?

pkgoogle · 2024-04-10T18:52:30Z

Hi @BriianPowell, welcome, does this answer your question? https://cloud.google.com/vertex-ai/generative-ai/docs/open-models/use-gemma Let us know if it does not.

BriianPowell · 2024-04-11T01:06:54Z

Hey there, @pkgoogle thanks for getting back to me. Currently there is a no way to deploy the Gemma version from the Model Garden to a private Vertex AI endpoint. I have some constraints on my project where the Vertex AI Endpoint needs to be attached to a VPC.

I am thinking about following this guide from one of the links located in that article.

Would this work in conjunction with the image that's being created in this repo or are they totally different things?

pkgoogle · 2024-04-11T17:50:58Z

Hi @BriianPowell, I believe it will work -- can you give it a try and see if you run into any issues? Thanks.

BriianPowell · 2024-04-12T20:30:59Z

@pkgoogle Just thinking here, but my understanding is that the current state of this project doesn't allow hosting the image as a api_server? I think I may have to go the vllm or hex-llm route

tilakrayal added the type:support Support issues label Apr 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare model for deployment to Private Vertex AI endpoint #55

Prepare model for deployment to Private Vertex AI endpoint #55

BriianPowell commented Apr 4, 2024

pkgoogle commented Apr 10, 2024

BriianPowell commented Apr 11, 2024

pkgoogle commented Apr 11, 2024

BriianPowell commented Apr 12, 2024

Prepare model for deployment to Private Vertex AI endpoint #55

Prepare model for deployment to Private Vertex AI endpoint #55

Comments

BriianPowell commented Apr 4, 2024

pkgoogle commented Apr 10, 2024

BriianPowell commented Apr 11, 2024

pkgoogle commented Apr 11, 2024

BriianPowell commented Apr 12, 2024