llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-27 20:43:07 +01:00

History

Shintarou Okada 753be377b6 llama : add PLaMo model (#3557 ) * add plamo mock * add tensor loading * plamo convert * update norm * able to compile * fix norm_rms_eps hparam * runnable * use inp_pos * seems ok * update kqv code * remove develop code * update README * shuffle attn_q.weight and attn_output.weight for broadcasting * remove plamo_llm_build_kqv and use llm_build_kqv * fix style * update * llama : remove obsolete KQ_scale * plamo : fix tensor names for correct GPU offload --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2023-12-24 15:35:49 +02:00
..
__init__.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
constants.py	llama : add PLaMo model (#3557 )	2023-12-24 15:35:49 +02:00
gguf_reader.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
gguf_writer.py	llama : add Mixtral support (#4406 )	2023-12-13 14:04:25 +02:00
gguf.py	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981 )	2023-11-11 08:04:50 +03:00
py.typed	convert : various script cleanups/fixes + merges and special token handling (#2842 )	2023-08-30 11:25:50 +03:00
tensor_mapping.py	llama : add PLaMo model (#3557 )	2023-12-24 15:35:49 +02:00
vocab.py	py : open merges file as 'utf-8' (#4566 )	2023-12-21 19:07:34 +02:00