Bug 1227605 - ollama stopt working with the newest update. msg: illegal instrucktion (core dumped)
Summary: ollama stopt working with the newest update. msg: illegal instrucktion (core ...
Status: NEW
Alias: None
Product: openSUSE Tumbleweed
Classification: openSUSE
Component: Other (show other bugs)
Version: Current
Hardware: x86-64 openSUSE Tumbleweed
: P5 - None : Normal (vote)
Target Milestone: ---
Assignee: Eyad Issa
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-07-10 07:02 UTC by l k
Modified: 2024-07-10 11:25 UTC (History)
2 users (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description l k 2024-07-10 07:02:54 UTC
don't now if just on opensuse pakaging of ollama.


Log messages:
dev@localhost:~> ollama start
2024/07/10 08:57:05 routes.go:1064: INFO server config env="map[CUDA_VISIBLE_DEVICES: GPU_DEVICE_ORDINAL: HIP_VISIBLE_DEVICES: HSA_OVERRIDE_GFX_VERSION: OLLAMA_DEBUG:false OLLAMA_FLASH_ATTENTION:false OLLAMA_HOST:http://127.0.0.1:11434 OLLAMA_INTEL_GPU:false OLLAMA_KEEP_ALIVE: OLLAMA_LLM_LIBRARY: OLLAMA_MAX_LOADED_MODELS:1 OLLAMA_MAX_QUEUE:512 OLLAMA_MAX_VRAM:0 OLLAMA_MODELS:/home/dev/.ollama/models OLLAMA_NOHISTORY:false OLLAMA_NOPRUNE:false OLLAMA_NUM_PARALLEL:1 OLLAMA_ORIGINS:[http://localhost https://localhost http://localhost:* https://localhost:* http://127.0.0.1 https://127.0.0.1 http://127.0.0.1:* https://127.0.0.1:* http://0.0.0.0 https://0.0.0.0 http://0.0.0.0:* https://0.0.0.0:* app://* file://* tauri://*] OLLAMA_RUNNERS_DIR: OLLAMA_SCHED_SPREAD:false OLLAMA_TMPDIR: ROCR_VISIBLE_DEVICES:]"
time=2024-07-10T08:57:05.590+02:00 level=INFO source=images.go:730 msg="total blobs: 16"
time=2024-07-10T08:57:05.592+02:00 level=INFO source=images.go:737 msg="total unused blobs removed: 0"
[GIN-debug] [WARNING] Creating an Engine instance with the Logger and Recovery middleware already attached.

[GIN-debug] [WARNING] Running in "debug" mode. Switch to "release" mode in production.
 - using env:   export GIN_MODE=release
 - using code:  gin.SetMode(gin.ReleaseMode)

[GIN-debug] POST   /api/pull                 --> github.com/ollama/ollama/server.(*Server).PullModelHandler-fm (5 handlers)
[GIN-debug] POST   /api/generate             --> github.com/ollama/ollama/server.(*Server).GenerateHandler-fm (5 handlers)
[GIN-debug] POST   /api/chat                 --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (5 handlers)
[GIN-debug] POST   /api/embeddings           --> github.com/ollama/ollama/server.(*Server).EmbeddingsHandler-fm (5 handlers)
[GIN-debug] POST   /api/create               --> github.com/ollama/ollama/server.(*Server).CreateModelHandler-fm (5 handlers)
[GIN-debug] POST   /api/push                 --> github.com/ollama/ollama/server.(*Server).PushModelHandler-fm (5 handlers)
[GIN-debug] POST   /api/copy                 --> github.com/ollama/ollama/server.(*Server).CopyModelHandler-fm (5 handlers)
[GIN-debug] DELETE /api/delete               --> github.com/ollama/ollama/server.(*Server).DeleteModelHandler-fm (5 handlers)
[GIN-debug] POST   /api/show                 --> github.com/ollama/ollama/server.(*Server).ShowModelHandler-fm (5 handlers)
[GIN-debug] POST   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).CreateBlobHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/blobs/:digest        --> github.com/ollama/ollama/server.(*Server).HeadBlobHandler-fm (5 handlers)
[GIN-debug] GET    /api/ps                   --> github.com/ollama/ollama/server.(*Server).ProcessHandler-fm (5 handlers)
[GIN-debug] POST   /v1/chat/completions      --> github.com/ollama/ollama/server.(*Server).ChatHandler-fm (6 handlers)
[GIN-debug] GET    /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] GET    /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListModelsHandler-fm (5 handlers)
[GIN-debug] GET    /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
[GIN-debug] HEAD   /                         --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func1 (5 handlers)
[GIN-debug] HEAD   /api/tags                 --> github.com/ollama/ollama/server.(*Server).ListModelsHandler-fm (5 handlers)
[GIN-debug] HEAD   /api/version              --> github.com/ollama/ollama/server.(*Server).GenerateRoutes.func2 (5 handlers)
time=2024-07-10T08:57:05.592+02:00 level=INFO source=routes.go:1111 msg="Listening on 127.0.0.1:11434 (version 0.0.0)"
time=2024-07-10T08:57:05.593+02:00 level=INFO source=payload.go:30 msg="extracting embedded files" dir=/tmp/ollama2475486328/runners
time=2024-07-10T08:57:05.629+02:00 level=INFO source=payload.go:44 msg="Dynamic LLM libraries [cpu cpu_avx cpu_avx2]"
time=2024-07-10T08:57:05.644+02:00 level=INFO source=types.go:98 msg="inference compute" id=0 library=cpu compute="" driver=0.0 name="" total="19.4 GiB" available="17.6 GiB"
[GIN] 2024/07/10 - 08:57:15 | 200 |      52.753µs |       127.0.0.1 | HEAD     "/"
[GIN] 2024/07/10 - 08:57:15 | 200 |    1.841915ms |       127.0.0.1 | GET      "/api/tags"
[GIN] 2024/07/10 - 08:57:34 | 200 |      28.313µs |       127.0.0.1 | HEAD     "/"
[GIN] 2024/07/10 - 08:57:34 | 200 |   21.780392ms |       127.0.0.1 | POST     "/api/show"
time=2024-07-10T08:57:34.518+02:00 level=INFO source=memory.go:309 msg="offload to cpu" layers.requested=-1 layers.model=28 layers.offload=0 layers.split="" memory.available="[17.7 GiB]" memory.required.full="9.0 GiB" memory.required.partial="0 B" memory.required.kv="540.0 MiB" memory.required.allocations="[9.0 GiB]" memory.weights.total="8.5 GiB" memory.weights.repeating="8.4 GiB" memory.weights.nonrepeating="164.1 MiB" memory.graph.full="212.0 MiB" memory.graph.partial="376.1 MiB"
time=2024-07-10T08:57:34.519+02:00 level=INFO source=server.go:368 msg="starting llama server" cmd="/tmp/ollama2475486328/runners/cpu_avx/ollama_llama_server --model /home/dev/.ollama/models/blobs/sha256-5ff0abeeac1d2dbdd5455c0b49ba3b29a9ce3c1fb181b2eef2e948689d55d046 --ctx-size 2048 --batch-size 512 --embedding --log-disable --parallel 1 --port 42197"
time=2024-07-10T08:57:34.519+02:00 level=INFO source=sched.go:382 msg="loaded runners" count=1
time=2024-07-10T08:57:34.520+02:00 level=INFO source=server.go:556 msg="waiting for llama runner to start responding"
time=2024-07-10T08:57:34.520+02:00 level=INFO source=server.go:594 msg="waiting for server to become available" status="llm server error"
time=2024-07-10T08:57:34.771+02:00 level=ERROR source=sched.go:388 msg="error loading llama server" error="llama runner process has terminated: signal: illegal instruction (core dumped) "
[GIN] 2024/07/10 - 08:57:34 | 500 |  277.570917ms |       127.0.0.1 | POST     "/api/chat"

-----------------------------------------------------------------------------

Command message:
dev@localhost:~> ollama run deepseek-coder-v2:latest  
Error: llama runner process has terminated: signal: illegal instruction (core dumped) 
dev@localhost:~> 


--------------------------------------------------------------------


The same result with llama3 and gemma(2).