Default Branch

9ba399dfa7 · server : add support for "encoding_format": "base64" to the */embeddings endpoints (#10967) · Updated 2024-12-24 21:33:04 +01:00

Branches

124e4dced2 · Update · Updated 2024-04-22 11:42:32 +02:00

1732
2

3750706962 · llama : add llama_token_is_eog() · Updated 2024-04-20 15:52:03 +02:00

1697
4

f02ea667c1 · ggml : temporary disable llamafile sgemm until fixed · Updated 2024-04-16 21:45:56 +02:00

1706
1

eedd42e376 · KV Cache defrag hash overflow - TMP Fix by @slaren · Updated 2024-04-16 10:24:34 +02:00

1709
1

8b495540fa · imatrix : remove invalid assert · Updated 2024-04-12 10:45:12 +02:00

1734
1

072e0a4d3b · scipts : add LICENSE and gen-authors.sh to sync · Updated 2024-04-09 08:19:33 +02:00

1810
3

a37696d4f1 · speculative : more robust tokenizer comparison · Updated 2024-04-05 00:28:13 +02:00

1780
9

4c190ba676 · cuda : reduce registers · Updated 2024-03-28 20:17:08 +01:00

1822
77

64b7d85891 · llama : fix command-r inference · Updated 2024-03-28 11:22:24 +01:00

1827
1

6be02b5969 · cuda : fix build · Updated 2024-03-27 09:31:52 +01:00

1844
72

87a6088ffe · rename unicodedata.{cpp,h} to unicode-data.{cpp,h} · Updated 2024-03-26 15:52:33 +01:00

1859
7

9c5fd6be14 · minor : spacing · Updated 2024-03-26 13:09:02 +01:00

1857
2

6f20e2672f · Include IQ2_XXS and IQ2_XS in teet-quantize-fns · Updated 2024-03-25 18:01:20 +01:00

1861
1

210e469114 · cuda : fix LLAMA_CUDA_F16 build · Updated 2024-03-25 15:31:10 +01:00

1863
1

d05c13b3b9 · llama : fix BPE LF token on MSVC · Updated 2024-03-23 19:03:16 +01:00

1883
3

3a468e6f9f · llama : fix type of KQ_mask and KQ_pos · Updated 2024-03-22 16:12:17 +01:00

1888
68

0e826d12a5 · quantize: be able to specify the token embedding tensor type · Updated 2024-03-22 15:27:34 +01:00

1897
2

8c3d5b5a79 · common : remove defaults · Updated 2024-03-22 14:33:24 +01:00

1893
2

12aa74ba7d · minor : spacing · Updated 2024-03-22 14:24:57 +01:00

2118
6

31f2d03f1b · server : enable continuous batching by default · Updated 2024-03-22 10:16:43 +01:00

1897
1