Install / pip
SSL or certificate errors during pip install (corporate proxy / QDC)
SSL or certificate errors during pip install (corporate proxy / QDC)
Windows SmartScreen blocks the CLI installer
Windows SmartScreen blocks the CLI installer
.exe is not yet code-signed. Click More info → Run anyway in the SmartScreen dialog.CLI
`geniex` not found after install
`geniex` not found after install
PATH. Run:`--compute cpu` or `--compute gpu` errors with a Qualcomm AI Hub Model
`--compute cpu` or `--compute gpu` errors with a Qualcomm AI Hub Model
--compute npu (or omit the flag — npu is the default for qairt). To run on CPU/GPU, switch to a GGUF model on the llama.cpp runtime.Server
`geniex serve` returns 'model not found'
`geniex serve` returns 'model not found'
geniex serve.Docker container can't see the NPU
Docker container can't see the NPU
--privileged flag is required for NPU access. Make sure your docker run includes it, plus the volume mounts for /usr/lib. See CLI install (Docker).Linux
`docker pull` fails with permission / unauthorized
`docker pull` fails with permission / unauthorized
`geniex` install.sh reports missing shared libraries
`geniex` install.sh reports missing shared libraries
Container loads model but inference fails with `Failed to create device: 14001`
Container loads model but inference fails with `Failed to create device: 14001`
docker run includes --privileged and the /usr/lib mount, and that the host’s Qualcomm driver packages (qcom-adreno1, qcom-fastrpc1) are installed — see the missing-libraries entry above.Android
Model loads but generation self-repeats or outputs nothing
Model loads but generation self-repeats or outputs nothing
generateStreamFlow instead of the chat-templated prompt. Qualcomm AI Engine Direct pipelines treat their input as already-templated — pass applyChatTemplate().formattedText, not the raw user message.Qualcomm AI Hub pull fails with `INVALID_INPUT`
Qualcomm AI Hub pull fails with `INVALID_INPUT`
chipset for Qualcomm AI Hub pulls — auto-detect only runs on Windows on Snapdragon. Set ModelPullInput.chipset to "SM8750" (Snapdragon 8 Elite) or "SM8850" (Snapdragon 8 Elite Gen 5). See Android API reference → ModelPullInput.Qualcomm AI Engine Direct load fails with 'unknown model name'
Qualcomm AI Engine Direct load fails with 'unknown model name'
qwen3_4b_instruct_2507, qwen2_5_vl_7b_instruct, etc.). To add a new Qualcomm AI Hub Model, register it on the C++ side first — see third-party/geniex-qairt/models/{llm,vlm}_model_registry.h.`nGpuLayers` or `nCtx` rejected on Qualcomm AI Engine Direct
`nGpuLayers` or `nCtx` rejected on Qualcomm AI Engine Direct
nGpuLayers and nCtx at their defaults; tune max_tokens and enable_thinking instead.