llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-26 12:21:40 +01:00

History

kallewoof 484d2f31ae bug-fix: snprintf prints NULL in place of the last character (#10419 ) * bug-fix: snprintf prints NULL in place of the last character We need to give snprintf enough space to print the last character and the null character, thus we allocate one extra byte and then ignore it when converting to std::string. * add comment about extra null-term byte requirement		2024-12-11 14:48:04 +01:00
..
batched	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
batched-bench	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
batched.swift	llama : llama_perf + option to disable timings during decode (#9355 )	2024-09-13 09:53:38 +03:00
convert-llama2c-to-ggml	make : deprecate (#10514 )	2024-12-02 21:22:53 +02:00
cvector-generator	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
deprecation-warning	Update deprecation-warning.cpp (#10619 )	2024-12-04 23:19:20 +01:00
embedding	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
eval-callback	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
export-lora	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gbnf-validator	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gen-docs	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gguf	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gguf-hash	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gguf-split	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
gritlm	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
imatrix	make : deprecate (#10514 )	2024-12-02 21:22:53 +02:00
infill	readme : add option, update default value, fix formatting (#10271 )	2024-12-03 12:50:08 +02:00
jeopardy	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
llama-bench	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
llama.android	llama : remove all_pos_0, all_pos_1, all_seq_id from llama_batch (#9745 )	2024-10-18 23:18:01 +02:00
llama.swiftui	llama : use cmake for swift build (#10525 )	2024-12-08 13:14:54 +02:00
llava	clip : add sycl support (#10574 )	2024-12-04 01:26:37 +01:00
lookahead	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
lookup	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
main	readme : add option, update default value, fix formatting (#10271 )	2024-12-03 12:50:08 +02:00
main-cmake-pkg	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
parallel	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
passkey	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
perplexity	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
quantize	ggml : refactor online repacking (#10446 )	2024-12-07 14:37:50 +02:00
quantize-stats	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
retrieval	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
rpc	ggml : move CPU backend to a separate file (#10144 )	2024-11-03 19:34:08 +01:00
run	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
save-load-state	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
server	bug-fix: snprintf prints NULL in place of the last character (#10419 )	2024-12-11 14:48:04 +01:00
simple	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
simple-chat	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
speculative	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
speculative-simple	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
sycl	[SYCL]set context default value to avoid memory issue, update guide (#9476 )	2024-09-18 08:30:31 +08:00
tokenize	ggml : move AMX to the CPU backend (#10570 )	2024-11-29 21:54:58 +01:00
chat-13B.bat	Create chat-13B.bat (#592 )	2023-03-29 20:21:09 +03:00
chat-13B.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
chat-persistent.sh	scripts : fix pattern and get n_tokens in one go (#10221 )	2024-11-09 09:06:54 +02:00
chat-vicuna.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
chat.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
CMakeLists.txt	cmake : enable warnings in llama (#10474 )	2024-11-26 14:18:08 +02:00
convert_legacy_llama.py	metadata: Detailed Dataset Authorship Metadata (#8875 )	2024-11-13 21:10:38 +11:00
json_schema_pydantic_example.py	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
json_schema_to_grammar.py	grammar : fix JSON Schema for string regex with top-level alt. (#9903 )	2024-10-16 19:03:24 +03:00
llama.vim	llama.vim : bump generation time limit to 3s [no ci]	2024-10-23 17:16:56 +03:00
llm.vim	llm.vim : stop generation at multiple linebreaks, bind to <F2> (#2879 )	2023-08-30 09:50:55 +03:00
Miku.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
pydantic_models_to_grammar_examples.py	examples : Rewrite pydantic_models_to_grammar_examples.py (#8493 )	2024-07-20 22:09:17 -04:00
pydantic_models_to_grammar.py	pydantic : replace uses of __annotations__ with get_type_hints (#8474 )	2024-07-14 19:51:21 -04:00
reason-act.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
regex_to_grammar.py	py : switch to snake_case (#8305 )	2024-07-05 07:53:33 +03:00
server_embd.py	py : type-check all Python scripts with Pyright (#8341 )	2024-07-07 15:04:39 -04:00
server-llama2-13B.sh	`build`: rename main → llama-cli, server → llama-server, llava-cli → llama-llava-cli, etc... (#7809 )	2024-06-13 00:41:52 +01:00
ts-type-to-grammar.sh	JSON schema conversion: ⚡️ faster repetitions, min/maxLength for strings, cap number length (#6555 )	2024-04-12 19:43:38 +01:00