Preload model for the ollama provider #1190

sgwhat · 2024-04-26T07:42:35Z

Validations

I believe this is a way to improve. I'll try to join the Continue Discord for questions
I'm not able to find an open issue that requests the same enhancement

Problem

I've noticed that when I use ollama to chat, the model loading always occurs during the first round of conversation, which makes the first round much slower than subsequent ones.

I'm exploring ways to help ollama preload the model. Even though I tried ollama run llama2:latest before conversation, the model still loads at the start of the first conversation.

Solution

No response

The text was updated successfully, but these errors were encountered:

sestinj · 2024-04-26T09:22:15Z

@sgwhat https://github.com/ollama/ollama/blob/main/docs/api.md#load-a-model

This might be a good solution

sgwhat · 2024-04-26T09:31:03Z

@sgwhat https://github.com/ollama/ollama/blob/main/docs/api.md#load-a-model

This might be a good solution

I have tired, but still useless.

sgwhat added the enhancement New feature or request label Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preload model for the ollama provider #1190

Preload model for the ollama provider #1190

sgwhat commented Apr 26, 2024

sestinj commented Apr 26, 2024 •

edited

sgwhat commented Apr 26, 2024

Preload model for the ollama provider #1190

Preload model for the ollama provider #1190

Comments

sgwhat commented Apr 26, 2024

Validations

Problem

Solution

sestinj commented Apr 26, 2024 • edited

sgwhat commented Apr 26, 2024

sestinj commented Apr 26, 2024 •

edited