llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-28 21:07:06 +01:00

History

Xuan Son Nguyen ce8784bdb1 server : fix format_infill (#10724 ) * server : fix format_infill * fix * rename * update test * use another model * update test * update test * test_invalid_input_extra_req		2024-12-08 23:04:29 +01:00
..
test_basic.py	server : (refactor) no more json in server_task input (#10691 )	2024-12-07 20:21:09 +01:00
test_chat_completion.py	server : (refactor) no more json in server_task input (#10691 )	2024-12-07 20:21:09 +01:00
test_completion.py	server : bring back info of final chunk in stream mode (#10722 )	2024-12-08 20:38:51 +01:00
test_ctx_shift.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_embedding.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_infill.py	server : fix format_infill (#10724 )	2024-12-08 23:04:29 +01:00
test_lora.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_rerank.py	server : add more test cases (#10569 )	2024-11-29 21:48:56 +01:00
test_security.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_slot_save.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00
test_speculative.py	server : fix speculative decoding with context shift (#10641 )	2024-12-04 22:38:20 +02:00
test_tokenize.py	server : replace behave with pytest (#10416 )	2024-11-26 16:20:18 +01:00