Commit Graph

2 Commits

Author SHA1 Message Date
Georgi Gerganov
eb42596277
llama : do not use KV cache for non-causal models
ggml-ci
2024-03-04 13:34:16 +02:00
Georgi Gerganov
d0347840c1
llama : fix embeddings
ggml-ci
2024-03-04 11:43:16 +02:00