Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

Open
mike2003 opened this issue May 9, 2024 · 4 comments
Open

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

mike2003 opened this issue May 9, 2024 · 4 comments
Assignees
Labels
type: bug Something isn't working

Comments

@mike2003
Copy link

mike2003 commented May 9, 2024

Describe the bug
I'm trying to get JAN on avx.
I downloaded nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz and extracted it into the jan extender folder.

In log:

2024-05-09T09:41:28.532Z [NITRO]::Debug: Nitro is ready
2024-05-09T09:41:28.533Z [NITRO]::Debug: Loading model with params {"cpu_threads":5,"ctx_len":2048,"prompt_template":"<|system|>\n{system_message}<|user|>\n{prompt}<|assistant|>","llama_model_path":"S:\\JAN\\models\\tinyllama-1.1b\\tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf","system_prompt":"<|system|>\n","user_prompt":"<|user|>\n","ai_prompt":"<|assistant|>","ngl":100}
2024-05-09T09:41:28.542Z [NITRO]::Debug: 94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m\�[93m_�[94m_�[93m\�[94m/�[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m\�[93m_�[94m_�[93m\�[94m/�[93m �[94m �[93m �[94m �[0m
�[0m20240509 09:41:28.323000 UTC 2916 INFO  Nitro version:  - main.cc:50
20240509 09:41:28.324000 UTC 2916 INFO  Server started, listening at: 127.0.0.1:3928 - main.cc:54
20240509 09:41:28.324000 UTC 2916 INFO  Please load your model - main.cc:55
20240509 09:41:28.324000 UTC 2916 INFO  Number of thread is:8 - main.cc:62
20240509 09:41:28.540000 UTC 3768 ERROR AVX2 is not supported by your processor - llamaCPP.cc:546

2024-05-09T09:41:28.551Z [NITRO]::Debug: Load model success with response {}
2024-05-09T09:41:28.555Z [NITRO]::Debug: 20240509 09:41:28.554000 UTC 3508 ERROR Model has not been loaded - llamaCPP.cc:52

2024-05-09T09:41:28.556Z [NITRO]::Debug: Validate model state with response 409
2024-05-09T09:41:28.556Z [NITRO]::Error: Validate model status failed
2024-05-09T09:41:28.556Z [NITRO]::Debug: Validate model state failed with response "Conflict"
2024-05-09T09:41:31.653Z [NITRO]::Debug: Request to kill Nitro

Why i have ERROR AVX2 is not supported by your processor - llamaCPP.cc:546
if i use nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz

win10x64/1660super/cudatk12.4

@mike2003 mike2003 added the type: bug Something isn't working label May 9, 2024
@mike2003
Copy link
Author

mike2003 commented May 9, 2024

By the way, version nitro-0.3.22-win-amd64-avx-cuda-12-0.tar.gz doesn't launch at all. Even as a command line.

@mike2003
Copy link
Author

mike2003 commented May 9, 2024

I tried running nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz via command line and this is what came out! I don't understand why I'm being asked for AVX2!

λ curl http://localhost:5000/inferences/llamacpp/loadmodel   -H 'Content-Type: application/json'   -d '{"cpu_threads":5,"ctx_len":2048,"prompt_template":"<|system|>\n{system_message}<|user|>\n{prompt}<|assistant|>","llama_model_path":"S:\\JAN\\models\\tinyllama-1.1b\\tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf","system_prompt":"<|system|>\n","user_prompt":"<|user|>\n","ai_prompt":"<|assistant|>","ngl":100}'
{"message":"AVX2 is not supported by your processor, please download and replace the correct Nitro asset version"

@mike2003
Copy link
Author

mike2003 commented May 9, 2024

Last version nitro-0.3.18-win-amd64-avx-cuda-12-0.tar.gz without avx2 error
but exit

20240509 10:09:47.865000 UTC 9568 INFO  Nitro version:  - main.cc:50
20240509 10:09:47.867000 UTC 9568 INFO  Server started, listening at: 127.0.0.1:5000 - main.cc:54
20240509 10:09:47.867000 UTC 9568 INFO  Please load your model - main.cc:55
20240509 10:09:47.867000 UTC 9568 INFO  Number of thread is:8 - main.cc:62
{"timestamp":1715249390,"level":"INFO","function":"LoadModelImpl","line":646,"message":"system info","n_threads":5,"total_threads":8,"system_info":"AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 0 | VSX = 0 | MATMUL_INT8 = 0 | "}

curl: (56) Recv failure: Connection was reset

@louis-jan
Copy link
Contributor

Last version nitro-0.3.18-win-amd64-avx-cuda-12-0.tar.gz without avx2 error but exit

20240509 10:09:47.865000 UTC 9568 INFO  Nitro version:  - main.cc:50
20240509 10:09:47.867000 UTC 9568 INFO  Server started, listening at: 127.0.0.1:5000 - main.cc:54
20240509 10:09:47.867000 UTC 9568 INFO  Please load your model - main.cc:55
20240509 10:09:47.867000 UTC 9568 INFO  Number of thread is:8 - main.cc:62
{"timestamp":1715249390,"level":"INFO","function":"LoadModelImpl","line":646,"message":"system info","n_threads":5,"total_threads":8,"system_info":"AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 0 | VSX = 0 | MATMUL_INT8 = 0 | "}

curl: (56) Recv failure: Connection was reset

Could I get the request body?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: bug Something isn't working
Projects
Status: No status
Development

No branches or pull requests

3 participants