llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-02-05 16:10:42 +01:00

History

Xuan Son Nguyen 6c5bc0625f server : (refactoring) do not rely on JSON internally (#10643 ) * server : (refactoring) reduce usage of json internally * move all response types to struct * wip [no ci] * many fixes * add virtual function * fix index * minor style fix * add std::move * refactor handle_completions_generic * add virtual functions * remove server.hpp * clarify server_sent_event RFC specs * apply review comments * fix model_alias and completion_probabilities * small clean up * remove virtual for to_json_oai_compat() * naming oai_compat --> oaicompat * fix unwanted recursive call * update docs		2024-12-06 11:14:32 +01:00
..
test_basic.py	server : add more test cases (#10569 )	2024-11-29 21:48:56 +01:00
test_chat_completion.py	server : (refactoring) do not rely on JSON internally (#10643 )	2024-12-06 11:14:32 +01:00
test_completion.py	server : (refactoring) do not rely on JSON internally (#10643 )	2024-12-06 11:14:32 +01:00
test_ctx_shift.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_embedding.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_infill.py	server : add more test cases (#10569 )	2024-11-29 21:48:56 +01:00
test_lora.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_rerank.py	server : add more test cases (#10569 )	2024-11-29 21:48:56 +01:00
test_security.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_slot_save.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_speculative.py	server : fix speculative decoding with context shift (#10641 )	2024-12-04 22:38:20 +02:00
test_tokenize.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00