LlamaGPT installation fails

chuckc · January 22, 2024, 12:24pm

Fresh install of umbrelOS on a VirtualBox guest with 10Gb RAM, 200Gb disc space and with/without guest additions. Guest OSes tested Ubuntu 22, Debian 11 and 12, Linux Mint.

umbrelOS troubleshooting log in attached picture.

chuckc · February 3, 2024, 10:36am

github.com/abetlen/llama-cpp-python

FileNotFoundError: Shared library with base name 'llama' not found

opened 07:52AM - 04 Aug 23 UTC

mghaoui-interpulse

build

# Prerequisites Please answer the following questions for yourself before sub…mitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of now. - [x] I carefully followed the [README.md](https://github.com/abetlen/llama-cpp-python/blob/main/README.md). - [x] I [searched using keywords relevant to my issue](https://docs.github.com/en/issues/tracking-your-work-with-issues/filtering-and-searching-issues-and-pull-requests) to make sure that I am creating a new issue that is not already open (or closed). - [x] I reviewed the [Discussions](https://github.com/abetlen/llama-cpp-python/discussions), and have a new bug or useful enhancement to share. # Expected Behavior I'm following the instructions on the README. llama_cpp is buildable on my machine with cuBLAS support (libraries and paths are correct). ``` > python3 -m venv .venv > source .venv/bin/activate (.venv) > CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install llama-cpp-python --force-reinstall --upgrade --no-cache-dir ``` The installation seems to go well: ``` Collecting llama-cpp-python Downloading llama_cpp_python-0.1.77.tar.gz (1.6 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 12.2 MB/s eta 0:00:00 Installing build dependencies ... done Getting requirements to build wheel ... done Preparing metadata (pyproject.toml) ... done Collecting typing-extensions>=4.5.0 (from llama-cpp-python) Downloading typing_extensions-4.7.1-py3-none-any.whl (33 kB) Collecting numpy>=1.20.0 (from llama-cpp-python) Downloading numpy-1.25.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (18.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 18.2/18.2 MB 15.5 MB/s eta 0:00:00 Collecting diskcache>=5.6.1 (from llama-cpp-python) Downloading diskcache-5.6.1-py3-none-any.whl (45 kB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 45.6/45.6 kB 306.0 MB/s eta 0:00:00 Building wheels for collected packages: llama-cpp-python Building wheel for llama-cpp-python (pyproject.toml) ... done Created wheel for llama-cpp-python: filename=llama_cpp_python-0.1.77-cp311-cp311-linux_x86_64.whl size=1386177 sha256=67bb0d8316976217d7638216027ad89c76bc58241d7d64f49a1b6b76a40f0c74 Stored in directory: /tmp/pip-ephem-wheel-cache-q0i3qayl/wheels/e2/67/cb/481cfaabbb5fd5edab627c5b475de63e1b6f7d4d7b678d4d25 Successfully built llama-cpp-python Installing collected packages: typing-extensions, numpy, diskcache, llama-cpp-python Attempting uninstall: typing-extensions Found existing installation: typing_extensions 4.7.1 Uninstalling typing_extensions-4.7.1: Successfully uninstalled typing_extensions-4.7.1 Attempting uninstall: numpy Found existing installation: numpy 1.25.2 Uninstalling numpy-1.25.2: Successfully uninstalled numpy-1.25.2 Attempting uninstall: diskcache Found existing installation: diskcache 5.6.1 Uninstalling diskcache-5.6.1: Successfully uninstalled diskcache-5.6.1 Attempting uninstall: llama-cpp-python Found existing installation: llama-cpp-python 0.1.77 Uninstalling llama-cpp-python-0.1.77: Successfully uninstalled llama-cpp-python-0.1.77 Successfully installed diskcache-5.6.1 llama-cpp-python-0.1.77 numpy-1.25.2 typing-extensions-4.7.1 ``` I expected to be able to import the library but that doesn't work. # Current Behavior ``` > python3 Python 3.11.4 (main, Jun 28 2023, 19:51:46) [GCC] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from llama_cpp import Llama Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/home/moni/samples/llama-cpp-python/llama_cpp/__init__.py", line 1, in <module> from .llama_cpp import * File "/home/moni/samples/llama-cpp-python/llama_cpp/llama_cpp.py", line 80, in <module> _lib = _load_shared_library(_lib_base_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/moni/samples/llama-cpp-python/llama_cpp/llama_cpp.py", line 71, in _load_shared_library raise FileNotFoundError( FileNotFoundError: Shared library with base name 'llama' not found ``` # Environment and Context * Physical (or virtual) hardware you are using, e.g. for Linux: `$ lscpu` ``` Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 48 bits physical, 48 bits virtual Byte Order: Little Endian CPU(s): 16 On-line CPU(s) list: 0-15 Vendor ID: AuthenticAMD Model name: AMD Ryzen 7 5800X 8-Core Processor CPU family: 25 Model: 33 Thread(s) per core: 2 Core(s) per socket: 8 Socket(s): 1 Stepping: 0 Frequency boost: disabled CPU(s) scaling MHz: 52% CPU max MHz: 4850.1948 CPU min MHz: 2200.0000 BogoMIPS: 7588.01 Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy ab m sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba ibrs ibpb stibp vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzer o irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif v_spec_ctrl umip pku ospke vaes vpclmulqdq rdpid overflow_recov succor smca fsrm Virtualization features: Virtualization: AMD-V Caches (sum of all): L1d: 256 KiB (8 instances) L1i: 256 KiB (8 instances) L2: 4 MiB (8 instances) L3: 32 MiB (1 instance) NUMA: NUMA node(s): 1 NUMA node0 CPU(s): 0-15 Vulnerabilities: Itlb multihit: Not affected L1tf: Not affected Mds: Not affected Meltdown: Not affected Mmio stale data: Not affected Retbleed: Not affected Spec store bypass: Mitigation; Speculative Store Bypass disabled via prctl Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization Spectre v2: Mitigation; Retpolines, IBPB conditional, IBRS_FW, STIBP always-on, RSB filling, PBRSB-eIBRS Not affected Srbds: Not affected Tsx async abort: Not affected ``` * Operating System, e.g. for Linux: `$ uname -a` ``` Linux moni-opensuse-bp 6.4.6-1-default #1 SMP PREEMPT_DYNAMIC Tue Jul 25 04:42:30 UTC 2023 (55520bc) x86_64 x86_64 x86_64 GNU/Linux ``` * SDK version, e.g. for Linux: ``` $ python3 --version $ make --version $ g++ --version ``` ``` Python 3.11.4 GNU Make 4.4.1 Built for x86_64-suse-linux-gnu g++ (SUSE Linux) 13.1.1 20230720 [revision 9aac37ab8a7b919a89c6d64bc7107a8436996e93] ``` # Steps to Reproduce Please provide detailed steps for reproducing the issue. We are not sitting in front of your screen, so the more detail the better. 1. step 1 2. step 2 3. step 3 4. etc. **Note: Many issues seem to be regarding functional or performance issues / differences with `llama.cpp`. In these cases we need to confirm that you're comparing against the version of `llama.cpp` that was built with your python package, and which parameters you're passing to the context.** Try the following: 1. `git clone https://github.com/abetlen/llama-cpp-python` 2. `cd llama-cpp-python` 3. `rm -rf _skbuild/` # delete any old builds 4. `python setup.py develop` 5. `cd ./vendor/llama.cpp` 6. Follow [llama.cpp's instructions](https://github.com/ggerganov/llama.cpp#build) to `cmake` llama.cpp 7. Run llama.cpp's `./main` with the same arguments you previously passed to llama-cpp-python and see if you can reproduce the issue. If you can, [log an issue with llama.cpp](https://github.com/ggerganov/llama.cpp/issues) I tried, and I get this: ``` /usr/lib/python3.11/site-packages/setuptools/command/develop.py:40: EasyInstallDeprecationWarning: easy_install command is deprecated. !! ******************************************************************************** Please avoid running ``setup.py`` and ``easy_install``. Instead, use pypa/build, pypa/installer, pypa/build or other standards-based tools. See https://github.com/pypa/setuptools/issues/917 for details. ******************************************************************************** !! easy_install.initialize_options(self) Traceback (most recent call last): File "/home/moni/.local/lib/python3.11/site-packages/skbuild/setuptools_wrap.py", line 645, in setup cmkr = cmaker.CMaker(cmake_executable) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/moni/.local/lib/python3.11/site-packages/skbuild/cmaker.py", line 148, in __init__ self.cmake_version = get_cmake_version(self.cmake_executable) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/moni/.local/lib/python3.11/site-packages/skbuild/cmaker.py", line 105, in get_cmake_version raise SKBuildError(msg) from err Problem with the CMake installation, aborting build. CMake executable is cmake ```

chuckc · February 3, 2024, 10:37am

github.com/getumbrel/llama-gpt

Errors thrown when trying to start llama-gpt api using docker-compose-gguf.yml

opened 06:07PM - 30 Jan 24 UTC

ghevge

When I try to start llama-gpt api using docker-compose-gguf.yml, I get a bunch o…f errors (see below) on the api containers. I've also tried with different .gguf models, but still seeing the same errors. Any idea what is causing the errors? Thanks ``` lama-gpt-api_llama-gpt-api_1 exited with code 1 llama-gpt-ui_1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-ui_1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-ui_1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-ui_1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-api_1 | /usr/local/lib/python3.11/site-packages/setuptools/command/develop.py:40: EasyInstallDeprecationWarning: easy_install command is deprecated. llama-gpt-api_1 | !! llama-gpt-api_1 | llama-gpt-api_1 | ******************************************************************************** llama-gpt-api_1 | Please avoid running ``setup.py`` and ``easy_install``. llama-gpt-api_1 | Instead, use pypa/build, pypa/installer or other llama-gpt-api_1 | standards-based tools. llama-gpt-api_1 | llama-gpt-api_1 | See https://github.com/pypa/setuptools/issues/917 for details. llama-gpt-api_1 | ******************************************************************************** llama-gpt-api_1 | llama-gpt-api_1 | !! llama-gpt-api_1 | easy_install.initialize_options(self) llama-gpt-ui_1 | [INFO wait] Host [llama-gpt-api:8000] not yet available... llama-gpt-api_1 | [1/2] Generating /app/vendor/llama.cpp/libllama.so llama-gpt-api_1 | FAILED: /app/vendor/llama.cpp/libllama.so llama-gpt-api_1 | cd /app/vendor/llama.cpp && make libllama.so llama-gpt-api_1 | make[1]: Entering directory '/app/vendor/llama.cpp' llama-gpt-api_1 | I llama.cpp build info: llama-gpt-api_1 | I UNAME_S: Linux llama-gpt-api_1 | I UNAME_P: unknown llama-gpt-api_1 | I UNAME_M: x86_64 llama-gpt-api_1 | I CFLAGS: -I. -O3 -std=c11 -fPIC -DNDEBUG -Wall -Wextra -Wpedantic -Wcast-qual -Wdouble-promotion -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -pthread -march=native -mtune=native -DGGML_USE_K_QUANTS llama-gpt-api_1 | I CXXFLAGS: -I. -I./common -O3 -std=c++11 -fPIC -DNDEBUG -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wno-multichar -pthread -march=native -mtune=native -DGGML_USE_K_QUANTS llama-gpt-api_1 | I LDFLAGS: llama-gpt-api_1 | I CC: cc (Debian 10.2.1-6) 10.2.1 20210110 llama-gpt-api_1 | I CXX: g++ (Debian 10.2.1-6) 10.2.1 20210110 llama-gpt-api_1 | llama-gpt-api_1 | cc -I. -O3 -std=c11 -fPIC -DNDEBUG -Wall -Wextra -Wpedantic -Wcast-qual -Wdouble-promotion -Wshadow -Wstrict-prototypes -Wpointer-arith -Wmissing-prototypes -pthread -march=native -mtune=native -DGGML_USE_K_QUANTS -c ggml.c -o ggml.o llama-gpt-api_1 | In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111, llama-gpt-api_1 | from ggml.c:302: llama-gpt-api_1 | ggml.c: In function ‘ggml_vec_dot_q4_0_q8_0’: llama-gpt-api_1 | /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to ‘always_inline’ ‘_mm256_fmadd_ps’: target specific option mismatch llama-gpt-api_1 | 63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C) llama-gpt-api_1 | | ^~~~~~~~~~~~~~~ llama-gpt-api_1 | ggml.c:2527:15: note: called from here llama-gpt-api_1 | 2527 | acc = _mm256_fmadd_ps( d, q, acc ); llama-gpt-api_1 | | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ llama-gpt-api_1 | In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111, llama-gpt-api_1 | from ggml.c:302: llama-gpt-api_1 | /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to ‘always_inline’ ‘_mm256_fmadd_ps’: target specific option mismatch llama-gpt-api_1 | 63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C) llama-gpt-api_1 | | ^~~~~~~~~~~~~~~ llama-gpt-api_1 | ggml.c:2527:15: note: called from here llama-gpt-api_1 | 2527 | acc = _mm256_fmadd_ps( d, q, acc ); llama-gpt-api_1 | | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ llama-gpt-api_1 | In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111, llama-gpt-api_1 | from ggml.c:302: llama-gpt-api_1 | /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to ‘always_inline’ ‘_mm256_fmadd_ps’: target specific option mismatch llama-gpt-api_1 | 63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C) llama-gpt-api_1 | | ^~~~~~~~~~~~~~~ llama-gpt-api_1 | ggml.c:2527:15: note: called from here llama-gpt-api_1 | 2527 | acc = _mm256_fmadd_ps( d, q, acc ); llama-gpt-api_1 | | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ llama-gpt-api_1 | In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:111, llama-gpt-api_1 | from ggml.c:302: llama-gpt-api_1 | /usr/lib/gcc/x86_64-linux-gnu/10/include/fmaintrin.h:63:1: error: inlining failed in call to ‘always_inline’ ‘_mm256_fmadd_ps’: target specific option mismatch llama-gpt-api_1 | 63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C) llama-gpt-api_1 | | ^~~~~~~~~~~~~~~ llama-gpt-api_1 | ggml.c:2527:15: note: called from here llama-gpt-api_1 | 2527 | acc = _mm256_fmadd_ps( d, q, acc ); llama-gpt-api_1 | | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ llama-gpt-api_1 | make[1]: *** [Makefile:349: ggml.o] Error 1 llama-gpt-api_1 | make[1]: Leaving directory '/app/vendor/llama.cpp' llama-gpt-api_1 | ninja: build stopped: subcommand failed. llama-gpt-api_1 | Traceback (most recent call last): llama-gpt-api_1 | File "/usr/local/lib/python3.11/site-packages/skbuild/setuptools_wrap.py", line 674, in setup llama-gpt-api_1 | cmkr.make(make_args, install_target=cmake_install_target, env=env) llama-gpt-api_1 | File "/usr/local/lib/python3.11/site-packages/skbuild/cmaker.py", line 697, in make llama-gpt-api_1 | self.make_impl(clargs=clargs, config=config, source_dir=source_dir, install_target=install_target, env=env) llama-gpt-api_1 | File "/usr/local/lib/python3.11/site-packages/skbuild/cmaker.py", line 742, in make_impl llama-gpt-api_1 | raise SKBuildError(msg) llama-gpt-api_1 | llama-gpt-api_1 | An error occurred while building with CMake. llama-gpt-api_1 | Command: llama-gpt-api_1 | /usr/local/lib/python3.11/site-packages/cmake/data/bin/cmake --build . --target install --config Release -- llama-gpt-api_1 | Install target: llama-gpt-api_1 | install llama-gpt-api_1 | Source directory: llama-gpt-api_1 | /app llama-gpt-api_1 | Working directory: llama-gpt-api_1 | /app/_skbuild/linux-x86_64-3.11/cmake-build llama-gpt-api_1 | Please check the install target is valid and see CMake's output for more information. llama-gpt-api_1 | llama-gpt-api_1 | make: *** [Makefile:9: build] Error 1 llama-gpt-api_1 | Initializing server with: llama-gpt-api_1 | Batch size: 2096 llama-gpt-api_1 | Number of CPU threads: 4 llama-gpt-api_1 | Number of GPU layers: 0 llama-gpt-api_1 | Context window: 4096 llama-gpt-api_1 | Traceback (most recent call last): llama-gpt-api_1 | File "<frozen runpy>", line 189, in _run_module_as_main llama-gpt-api_1 | File "<frozen runpy>", line 112, in _get_module_details llama-gpt-api_1 | File "/app/llama_cpp/__init__.py", line 1, in <module> llama-gpt-api_1 | from .llama_cpp import * llama-gpt-api_1 | File "/app/llama_cpp/llama_cpp.py", line 80, in <module> llama-gpt-api_1 | _lib = _load_shared_library(_lib_base_name) llama-gpt-api_1 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ llama-gpt-api_1 | File "/app/llama_cpp/llama_cpp.py", line 71, in _load_shared_library llama-gpt-api_1 | raise FileNotFoundError( llama-gpt-api_1 | FileNotFoundError: Shared library with base name 'llama' not found llama-gpt-api_llama-gpt-api_1 exited with code 1 ``` docker-compose.yaml: ``` version: '3.6' services: llama-gpt-api: # Pin to llama-cpp-python 0.1.80 with GGUF support image: ghcr.io/abetlen/llama-cpp-python:latest@sha256:de0fd227f348b5e43d4b5b7300f1344e712c14132914d1332182e9ecfde502b2 restart: on-failure volumes: - './models:/models' - './api:/api' ports: - 3001:8000 environment: MODEL: '/models/${MODEL_NAME:-code-llama-2-7b-chat.gguf}' MODEL_DOWNLOAD_URL: '${MODEL_DOWNLOAD_URL:-https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-GGUF/resolve/main/codellama-7b-instruct.Q4_K_M.gguf}' N_GQA: '${N_GQA:-1}' USE_MLOCK: 1 cap_add: - IPC_LOCK command: '/bin/sh /api/run.sh' llama-gpt-ui: # TODO: Use this image instead of building from source after the next release # image: 'ghcr.io/getumbrel/llama-gpt-ui:latest' build: context: ./ui dockerfile: Dockerfile ports: - 3002:3000 restart: on-failure environment: - 'OPENAI_API_KEY=sk-XXXXXXXXXXXXXXXXXXXX' - 'OPENAI_API_HOST=http://llama-gpt-api:8000' - 'DEFAULT_MODEL=/models/${MODEL_NAME:-llama-2-7b-chat.bin}' - 'NEXT_PUBLIC_DEFAULT_SYSTEM_PROMPT=${DEFAULT_SYSTEM_PROMPT:-"You are a helpful and friendly AI assistant. Respond very concisely."}' - 'WAIT_HOSTS=llama-gpt-api:8000' - 'WAIT_TIMEOUT=${WAIT_TIMEOUT:-3600}' ```

Topic		Replies	Views
Installing UmbrelOS 1.2.1 on 8tb Drive Support and Troubleshooting	9	195	July 24, 2024
After fresh install no disk space Support and Troubleshooting	3	210	July 7, 2024
Disk full (20GB) right after installation Support and Troubleshooting	3	40	December 13, 2024
1TB NVMe SSD only giving UmbrelOS 100GB Support and Troubleshooting	2	369	January 5, 2024
Umbrel on Ubuntu Server - Disk Space? Support and Troubleshooting	2	932	June 14, 2023

LlamaGPT installation fails

Related topics