llama.cpp/.devops/full-cuda.Dockerfile

ARG UBUNTU_VERSION=22.04

# This needs to generally match the container host's environment.
ARG CUDA_VERSION=11.7.1

# Target the CUDA build image
ARG BASE_CUDA_DEV_CONTAINER=nvidia/cuda:${CUDA_VERSION}-devel-ubuntu${UBUNTU_VERSION}

FROM ${BASE_CUDA_DEV_CONTAINER} AS build

# Unless otherwise specified, we make a fat build.
ARG CUDA_DOCKER_ARCH=all

RUN apt-get update && \
    apt-get install -y build-essential python3 python3-pip git libcurl4-openssl-dev libgomp1

COPY requirements.txt   requirements.txt
COPY requirements       requirements

RUN pip install --upgrade pip setuptools wheel \
    && pip install -r requirements.txt

WORKDIR /app

COPY . .

# Set nvcc architecture
ENV CUDA_DOCKER_ARCH=${CUDA_DOCKER_ARCH}
# Enable CUDA
ENV GGML_CUDA=1
# Enable cURL
ENV LLAMA_CURL=1

RUN make -j$(nproc)

ENTRYPOINT ["/app/.devops/tools.sh"]
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00			`ARG UBUNTU_VERSION=22.04`

			`# This needs to generally match the container host's environment.`
			`ARG CUDA_VERSION=11.7.1`

			`# Target the CUDA build image`
			`ARG BASE_CUDA_DEV_CONTAINER=nvidia/cuda:${CUDA_VERSION}-devel-ubuntu${UBUNTU_VERSION}`

build : Fix docker build warnings (#8535) (#8537) 2024-07-17 20:21:55 +02:00			`FROM ${BASE_CUDA_DEV_CONTAINER} AS build`
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00
			`# Unless otherwise specified, we make a fat build.`
			`ARG CUDA_DOCKER_ARCH=all`

			`RUN apt-get update && \`
docker : add openmp lib (#7780) 2024-06-06 07:17:21 +02:00			`apt-get install -y build-essential python3 python3-pip git libcurl4-openssl-dev libgomp1`
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00
python : add check-requirements.sh and GitHub workflow (#4585) * python: add check-requirements.sh and GitHub workflow This script and workflow forces package versions to remain compatible across all convert.py scripts, while allowing secondary convert scripts to import dependencies not wanted in convert.py. Move requirements into ./requirements * Fail on "==" being used for package requirements (but can be suppressed) * Enforce "compatible release" syntax instead of == * Update workflow * Add upper version bound for transformers and protobuf * improve check-requirements.sh * small syntax change * don't remove venvs if nocleanup is passed * See if this fixes docker workflow * Move check-requirements.sh into ./scripts/ --------- Co-authored-by: Jared Van Bortel <jared@nomic.ai> 2023-12-29 15:50:29 +01:00			`COPY requirements.txt requirements.txt`
			`COPY requirements requirements`
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00
			`RUN pip install --upgrade pip setuptools wheel \`
			`&& pip install -r requirements.txt`

			`WORKDIR /app`

			`COPY . .`

			`# Set nvcc architecture`
			`ENV CUDA_DOCKER_ARCH=${CUDA_DOCKER_ARCH}`
cuda : rename build flag to LLAMA_CUDA (#6299) 2024-03-26 01:16:01 +01:00			`# Enable CUDA`
devops : remove clblast + LLAMA_CUDA -> GGML_CUDA (#8139) ggml-ci 2024-06-26 18:32:07 +02:00			`ENV GGML_CUDA=1`
server: add cURL support to server Dockerfiles (#6474) * server: add cURL support to `full.Dockerfile` * server: add cURL support to `full-cuda.Dockerfile` and `server-cuda.Dockerfile` * server: add cURL support to `full-rocm.Dockerfile` and `server-rocm.Dockerfile` * server: add cURL support to `server-intel.Dockerfile` * server: add cURL support to `server-vulkan.Dockerfile` * fix typo in `server-vulkan.Dockerfile` Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2024-04-04 18:31:22 +02:00			`# Enable cURL`
			`ENV LLAMA_CURL=1`
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00
Fixed painfully slow single process builds. (#7326) * Fixed painfully slow single process builds. * Added nproc for systems that don't default to nproc 2024-05-30 22:32:38 +02:00			`RUN make -j$(nproc)`
docker : add support for CUDA in docker (#1461) Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> 2023-07-07 20:25:25 +02:00
			`ENTRYPOINT ["/app/.devops/tools.sh"]`