llama.cpp/chat-with-qwen.txt at da40c42062688964d43d0a53558bf9b006e89f79 - llama.cpp - Gitea: Git with a cup of tea

Mirrors/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2024-12-30 16:07:17 +01:00

Shijie 37c746d687

llama : add Qwen support (#4281 )

* enable qwen to llama.cpp

* llama : do not GPU split bias tensors

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-12-01 20:16:31 +02:00

1 line

28 B

Plaintext

Raw Blame History

You are a helpful assistant.