About OLLAMA_PARALLEL split the max context length #4079

DirtyKnightForVi · 2024-05-01T12:19:18Z

What is the issue?

I encountered this while testing SQL QA with extremely large table, and i put all DDL into system .

When OLLAMA_PARALLEL = 4, I observed that model appears to only understand the last 4000 tokens of the DDL. This is quite different from my previous experience. My webui is open webui , it can set num_ctx to 16000, but useless.

BUT changing OLLAMA_PARALLEL=1, model can understand the whole DDL !

so , max_num_ctx = 16000 / OLLAMA_PARALLEL ? Even when the machine is free ?

OS

Linux

GPU

Nvidia

CPU

Intel

Ollama version

0.1.33-RC5

The text was updated successfully, but these errors were encountered:

DirtyKnightForVi added the bug Something isn't working label May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About OLLAMA_PARALLEL split the max context length #4079

About OLLAMA_PARALLEL split the max context length #4079

DirtyKnightForVi commented May 1, 2024

About OLLAMA_PARALLEL split the max context length #4079

About OLLAMA_PARALLEL split the max context length #4079

Comments

DirtyKnightForVi commented May 1, 2024

What is the issue?

OS

GPU

CPU

Ollama version