bug: I'm trying to get JAN on avx with ERROR AVX2. #552

mike2003 · 2024-05-09T09:45:10Z

Describe the bug
I'm trying to get JAN on avx.
I downloaded nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz and extracted it into the jan extender folder.

In log:

2024-05-09T09:41:28.532Z [NITRO]::Debug: Nitro is ready
2024-05-09T09:41:28.533Z [NITRO]::Debug: Loading model with params {"cpu_threads":5,"ctx_len":2048,"prompt_template":"<|system|>\n{system_message}<|user|>\n{prompt}<|assistant|>","llama_model_path":"S:\\JAN\\models\\tinyllama-1.1b\\tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf","system_prompt":"<|system|>\n","user_prompt":"<|user|>\n","ai_prompt":"<|assistant|>","ngl":100}
2024-05-09T09:41:28.542Z [NITRO]::Debug: 94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m\�[93m_�[94m_�[93m\�[94m/�[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m �[93m �[94m\�[93m_�[94m_�[93m\�[94m/�[93m �[94m �[93m �[94m �[0m
�[0m20240509 09:41:28.323000 UTC 2916 INFO  Nitro version:  - main.cc:50
20240509 09:41:28.324000 UTC 2916 INFO  Server started, listening at: 127.0.0.1:3928 - main.cc:54
20240509 09:41:28.324000 UTC 2916 INFO  Please load your model - main.cc:55
20240509 09:41:28.324000 UTC 2916 INFO  Number of thread is:8 - main.cc:62
20240509 09:41:28.540000 UTC 3768 ERROR AVX2 is not supported by your processor - llamaCPP.cc:546

2024-05-09T09:41:28.551Z [NITRO]::Debug: Load model success with response {}
2024-05-09T09:41:28.555Z [NITRO]::Debug: 20240509 09:41:28.554000 UTC 3508 ERROR Model has not been loaded - llamaCPP.cc:52

2024-05-09T09:41:28.556Z [NITRO]::Debug: Validate model state with response 409
2024-05-09T09:41:28.556Z [NITRO]::Error: Validate model status failed
2024-05-09T09:41:28.556Z [NITRO]::Debug: Validate model state failed with response "Conflict"
2024-05-09T09:41:31.653Z [NITRO]::Debug: Request to kill Nitro

Why i have ERROR AVX2 is not supported by your processor - llamaCPP.cc:546
if i use nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz

win10x64/1660super/cudatk12.4

The text was updated successfully, but these errors were encountered:

mike2003 · 2024-05-09T09:46:15Z

By the way, version nitro-0.3.22-win-amd64-avx-cuda-12-0.tar.gz doesn't launch at all. Even as a command line.

mike2003 · 2024-05-09T09:56:17Z

I tried running nitro-0.3.21-win-amd64-avx-cuda-12-0.tar.gz via command line and this is what came out! I don't understand why I'm being asked for AVX2!

λ curl http://localhost:5000/inferences/llamacpp/loadmodel   -H 'Content-Type: application/json'   -d '{"cpu_threads":5,"ctx_len":2048,"prompt_template":"<|system|>\n{system_message}<|user|>\n{prompt}<|assistant|>","llama_model_path":"S:\\JAN\\models\\tinyllama-1.1b\\tinyllama-1.1b-chat-v1.0.Q4_K_M.gguf","system_prompt":"<|system|>\n","user_prompt":"<|user|>\n","ai_prompt":"<|assistant|>","ngl":100}'
{"message":"AVX2 is not supported by your processor, please download and replace the correct Nitro asset version"

mike2003 · 2024-05-09T10:09:47Z

Last version nitro-0.3.18-win-amd64-avx-cuda-12-0.tar.gz without avx2 error
but exit

20240509 10:09:47.865000 UTC 9568 INFO  Nitro version:  - main.cc:50
20240509 10:09:47.867000 UTC 9568 INFO  Server started, listening at: 127.0.0.1:5000 - main.cc:54
20240509 10:09:47.867000 UTC 9568 INFO  Please load your model - main.cc:55
20240509 10:09:47.867000 UTC 9568 INFO  Number of thread is:8 - main.cc:62
{"timestamp":1715249390,"level":"INFO","function":"LoadModelImpl","line":646,"message":"system info","n_threads":5,"total_threads":8,"system_info":"AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 0 | VSX = 0 | MATMUL_INT8 = 0 | "}

curl: (56) Recv failure: Connection was reset

louis-jan · 2024-06-05T10:58:03Z

Last version nitro-0.3.18-win-amd64-avx-cuda-12-0.tar.gz without avx2 error but exit

20240509 10:09:47.865000 UTC 9568 INFO  Nitro version:  - main.cc:50
20240509 10:09:47.867000 UTC 9568 INFO  Server started, listening at: 127.0.0.1:5000 - main.cc:54
20240509 10:09:47.867000 UTC 9568 INFO  Please load your model - main.cc:55
20240509 10:09:47.867000 UTC 9568 INFO  Number of thread is:8 - main.cc:62
{"timestamp":1715249390,"level":"INFO","function":"LoadModelImpl","line":646,"message":"system info","n_threads":5,"total_threads":8,"system_info":"AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | FMA = 1 | NEON = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | WASM_SIMD = 0 | BLAS = 0 | SSE3 = 1 | SSSE3 = 0 | VSX = 0 | MATMUL_INT8 = 0 | "}

curl: (56) Recv failure: Connection was reset

Could I get the request body?

mike2003 added the type: bug Something isn't working label May 9, 2024

louis-jan assigned CameronNg Jun 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

mike2003 commented May 9, 2024

mike2003 commented May 9, 2024 •

edited

mike2003 commented May 9, 2024

mike2003 commented May 9, 2024

louis-jan commented Jun 5, 2024

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

bug: I'm trying to get JAN on avx with ERROR AVX2. #552

Comments

mike2003 commented May 9, 2024

mike2003 commented May 9, 2024 • edited

mike2003 commented May 9, 2024

mike2003 commented May 9, 2024

louis-jan commented Jun 5, 2024

mike2003 commented May 9, 2024 •

edited