Xuan Son Nguyen
958367bf53
server : refactor slot input data, move tokenizer to HTTP thread (#10023)
* server : refactor slot input data, move tokenizer to HTTP thread
* move prompt_tokens.empty() check
* fix incorrect if branch
* fix infinite generation loop
* bring back infill validation
* add infill test
* try fixing format_infill
* fix test
* remove redundant code
* rename completion to inference
* update docs
* use llama_tokens everywhere
2024-10-24 21:51:22 +02:00
..
2024-08-30 01:20:53 +02:00
2024-10-10 22:57:42 +02:00
2024-10-18 23:18:01 +02:00
2024-09-13 09:53:38 +03:00
2024-10-10 22:57:42 +02:00
2024-10-18 23:18:01 +02:00
2024-07-25 10:39:04 +02:00
2024-10-10 22:57:42 +02:00
2024-10-18 23:18:01 +02:00
2024-10-10 22:57:42 +02:00
2024-09-07 15:16:19 +03:00
2024-10-10 22:57:42 +02:00
2024-07-20 17:15:42 +03:00
2024-07-16 10:14:16 +03:00
2024-10-02 10:21:57 +03:00
2024-10-10 22:57:42 +02:00
2024-10-18 23:18:01 +02:00
2024-10-18 23:18:01 +02:00
2024-06-13 00:41:52 +01:00
2024-10-18 23:18:01 +02:00
2024-10-18 23:18:01 +02:00
2024-10-21 09:46:40 +03:00
2024-10-18 23:18:01 +02:00
2024-10-18 23:18:01 +02:00
2024-10-18 23:18:01 +02:00
2024-10-18 23:18:01 +02:00
2024-07-02 12:18:10 -04:00
2024-10-18 23:18:01 +02:00
2024-10-10 22:57:42 +02:00
2024-10-18 23:18:01 +02:00
2024-09-20 20:55:36 +02:00
2024-10-08 14:21:43 +02:00
2024-10-10 22:57:42 +02:00
2024-10-10 20:14:55 +02:00
2024-10-21 09:46:40 +03:00
2024-10-24 21:51:22 +02:00
2024-10-18 23:18:01 +02:00
2024-10-21 09:46:40 +03:00
2024-09-18 08:30:31 +08:00
2024-10-10 22:57:42 +02:00
2024-06-13 00:41:52 +01:00
2023-03-29 20:21:09 +03:00
2024-06-13 00:41:52 +01:00
2024-06-13 00:41:52 +01:00
2024-06-13 00:41:52 +01:00
2024-06-13 00:41:52 +01:00
2024-10-02 10:14:44 +03:00
2024-07-18 20:40:15 +10:00
2024-07-07 15:04:39 -04:00
2024-10-16 19:03:24 +03:00
2024-10-23 17:16:56 +03:00
2023-08-30 09:50:55 +03:00
2024-06-13 00:41:52 +01:00
2024-07-20 22:09:17 -04:00
2024-07-14 19:51:21 -04:00
2024-06-13 00:41:52 +01:00
2024-07-05 07:53:33 +03:00
2024-07-07 15:04:39 -04:00
2024-06-13 00:41:52 +01:00
2024-04-12 19:43:38 +01:00