llama : add llm_build helper functions (#3848)

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-28 12:57:03 +01:00

* llama : add llm_build_norm helper function

ggml-ci

* llama : add llm_build_ffn helper function (#3849)

ggml-ci

* llama : add llm_build_k_shift helper

ggml-ci

* llama : fix offloading after recent changes

* llama : add llm_build_kv_store helper

ggml-ci

* llama : remove obsolete offload names

* llama : fix llm_build_k_shift to use n_head_kv instead of n_head

* llama : simplify falcon Q, K, V computation

* llama : remove obsolete comments in build graphs

* llama : add llm_build_kqv helper

ggml-ci

* llama : minor

* llama : add LLAMA_OFFLOAD_DEBUG + fix starcoder offloading

* llama : fix input allocation logic

* llama : update offload functions for KQ tensors

* llama : normalize tensor names

ggml-ci

* llama : enable warning about not offloaded tensors

* llama : remove extra ; + deduplicate gate_b logic

* llama : add llm_build_inp_embd helper

This commit is contained in:

Georgi Gerganov

2023-10-31 19:23:12 +02:00

committed by

GitHub

parent 210e6e5d02

commit 5baefef497

No known key found for this signature in database

GPG Key ID: 4AEE18F83AFDEB23

1 changed files with 845 additions and 1410 deletions

2255

llama.cpp

View File

File diff suppressed because it is too large Load Diff

llama : add llm_build helper functions (#3848)

2255 llama.cpp View File

2255

llama.cpp

View File