llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-25 10:58:56 +01:00

History

…
..
baby-llama	ggml : change ggml_scale to take a float instead of tensor (#4573 )	2023-12-21 23:20:49 +02:00
batched	examples : add passkey test (#3856 )	2024-01-08 11:14:04 +02:00
batched-bench
batched.swift
beam-search
benchmark
convert-llama2c-to-ggml
embedding	build : link against build info instead of compiling against it (#3879 )	2023-11-02 08:50:16 +02:00
export-lora	export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894 )	2024-01-12 19:54:53 +02:00
finetune	finetune : print sample-start/include-sample-start (#5072 )	2024-01-22 13:11:01 +02:00
gguf	gguf : simplify example dependencies	2023-12-21 23:08:14 +02:00
imatrix	Adding some imatrix tools (#5302 )	2024-02-04 10:39:58 +02:00
infill	Remove unused data and add fixes (#5154 )	2024-01-27 15:25:55 +01:00
jeopardy	parallel : add option to load external prompt file (#3416 )	2023-10-06 16:16:38 +03:00
llama-bench	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
llama.android	android : use release cmake build type by default (#5123 )	2024-01-25 19:05:51 +02:00
llama.swiftui
llava	llava : add missing .py, and fix paths in README.md (#5414 )	2024-02-08 16:20:03 +02:00
lookahead	english : use `typos` to fix comments and logs (#4354 )	2023-12-12 11:53:36 +02:00
lookup	lookup : add prompt lookup decoding example (#4484 )	2023-12-22 18:05:56 +02:00
main	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
main-cmake-pkg	main-cmake-pkg : fix build issue (#4665 )	2023-12-29 16:18:20 +02:00
parallel	llama : KV cache view API + better KV cache management (#4170 )	2023-11-23 19:07:56 +02:00
passkey	examples : add passkey test (#3856 )	2024-01-08 11:14:04 +02:00
perplexity	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
quantize
quantize-stats	refactor : switch to emplace_back to avoid extra object (#5291 )	2024-02-03 13:23:37 +02:00
save-load-state
server	server : update `/props` with "total_slots" value (#5373 )	2024-02-07 08:15:19 +02:00
simple	simple : update error message for KV cache check (#4324 )	2023-12-04 18:04:21 +02:00
speculative
sycl	[SYCL] update guide of SYCL backend (#5254 )	2024-02-02 15:53:27 +08:00
tokenize
train-text-from-scratch	finetune : fix ggml_allocr lifetimes (tmp workaround) (#5033 )	2024-01-19 20:20:50 +02:00
alpaca.sh
base-translate.sh
chat-13B.bat
chat-13B.sh
chat-persistent.sh
chat-vicuna.sh
chat.sh
CMakeLists.txt
gpt4all.sh
json-schema-to-grammar.py
llama2-13b.sh
llama2.sh
llama.vim
llm.vim
make-ggml.py
Miku.sh
pydantic_models_to_grammar.py
pydantic-models-to-grammar-examples.py
reason-act.sh
server-llama2-13B.sh