ggml-org/llama.cpp

Packages

llama.cpp is a single-repo project, but it groups its code into a few logical packages that compile and ship as separate artifacts. This section documents those packages from the perspective of a downstream consumer.

What ships out of this repo

Artifact	CMake target / file	Purpose
`libllama` (static or shared)	`llama` (`src/`)	C library; main consumer-facing dependency
`libggml` + per-backend libraries	`ggml` (`ggml/`)	Tensor library; can be reused independently
`libcommon` (in-tree only)	`common` (`common/`)	CLI / chat helpers; not exported by `llama-config.cmake`
`llama-*` binaries	`tools/<tool>/`	All the CLI programs
`gguf` Python package	`gguf-py/`	Pip-installable GGUF reader/writer
Conversion scripts	`convert_*.py`	HF → GGUF conversion
`find_package(llama)` config	`cmake/llama-config.cmake.in`	Downstream CMake integration

Downstream integration

find_package(llama) exposes both llama and ggml targets. The minimal example lives at examples/simple-cmake-pkg/. Header-only include/llama-cpp.h is the recommended C++ entry point.

Packaging shims live under cmake/ (llama-config.cmake.in, llama.pc.in) and the Linux/macOS/Windows installers are produced by .github/workflows/release.yml.

Versioning

There is no semver. Releases are git tags (b<number>) cut from master. ABI breakage is tracked in issue 9289 (libllama) and issue 9291 (server REST API).

License

MIT for the project as a whole. Vendored third-party code under vendor/ retains its own licenses, recorded in licenses/.

Built by Factory AutoWiki from public repository content. It is a generated preview for codebase exploration, not source-maintained documentation.

Pages

What ships out of this repo

Downstream integration

Versioning

License