llama.cpp/examples
..
baby-llama ggml : change ggml_scale to take a float instead of tensor (#4573) 2023-12-21 23:20:49 +02:00
batched examples : add passkey test (#3856) 2024-01-08 11:14:04 +02:00
batched-bench
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding build : link against build info instead of compiling against it (#3879) 2023-11-02 08:50:16 +02:00
export-lora export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894) 2024-01-12 19:54:53 +02:00
finetune finetune : print sample-start/include-sample-start (#5072) 2024-01-22 13:11:01 +02:00
gguf gguf : simplify example dependencies 2023-12-21 23:08:14 +02:00
imatrix Adding some imatrix tools (#5302) 2024-02-04 10:39:58 +02:00
infill Remove unused data and add fixes (#5154) 2024-01-27 15:25:55 +01:00
jeopardy parallel : add option to load external prompt file (#3416) 2023-10-06 16:16:38 +03:00
llama-bench refactor : switch to emplace_back to avoid extra object (#5291) 2024-02-03 13:23:37 +02:00
llama.android android : use release cmake build type by default (#5123) 2024-01-25 19:05:51 +02:00
llama.swiftui
llava llava : add missing .py, and fix paths in README.md (#5414) 2024-02-08 16:20:03 +02:00
lookahead english : use typos to fix comments and logs (#4354) 2023-12-12 11:53:36 +02:00
lookup lookup : add prompt lookup decoding example (#4484) 2023-12-22 18:05:56 +02:00
main refactor : switch to emplace_back to avoid extra object (#5291) 2024-02-03 13:23:37 +02:00
main-cmake-pkg main-cmake-pkg : fix build issue (#4665) 2023-12-29 16:18:20 +02:00
parallel llama : KV cache view API + better KV cache management (#4170) 2023-11-23 19:07:56 +02:00
passkey examples : add passkey test (#3856) 2024-01-08 11:14:04 +02:00
perplexity refactor : switch to emplace_back to avoid extra object (#5291) 2024-02-03 13:23:37 +02:00
quantize
quantize-stats refactor : switch to emplace_back to avoid extra object (#5291) 2024-02-03 13:23:37 +02:00
save-load-state
server server : update /props with "total_slots" value (#5373) 2024-02-07 08:15:19 +02:00
simple simple : update error message for KV cache check (#4324) 2023-12-04 18:04:21 +02:00
speculative
sycl [SYCL] update guide of SYCL backend (#5254) 2024-02-02 15:53:27 +08:00
tokenize
train-text-from-scratch finetune : fix ggml_allocr lifetimes (tmp workaround) (#5033) 2024-01-19 20:20:50 +02:00
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-to-grammar.py
llama2-13b.sh
llama2.sh
llama.vim
llm.vim
make-ggml.py
Miku.sh
pydantic_models_to_grammar.py
pydantic-models-to-grammar-examples.py
reason-act.sh
server-llama2-13B.sh