Default Branch

f35726c2fb · build: apply MSVC /bigobj option to c/cpp files only (#11423) · Updated 2025-01-26 03:10:03 +01:00

Branches

4dad9fa50e · metal : use residency sets · Updated 2025-01-26 11:39:18 +01:00

1
2

6da9021ab4 · examples : add idle tool for investigating GPU idle overhead · Updated 2025-01-26 11:31:12 +01:00

1
1

172fe7f347 · docker : add GGML_CPU_ARM_ARCH arg to select ARM architecture to build for · Updated 2025-01-25 16:25:02 +01:00

6
1

de9d2c6f09 · test [pack] · Updated 2025-01-24 23:24:31 +01:00

10
3

f07c2ec505 · llama : add option to override tensor buffers · Updated 2025-01-24 20:56:09 +01:00

11
1

969b264657 · Revert "TMP : push artifacts" · Updated 2025-01-24 16:58:09 +01:00

16
15

ff4cb6ef4c · release : pack /lib and /include in the packages · Updated 2025-01-24 12:28:37 +01:00

16
1

f203a1ac25 · Update documentation · Updated 2025-01-23 17:20:23 +01:00

20
1

510b626c03 · export-lora : fix tok_embd tensor · Updated 2025-01-21 12:29:13 +01:00

38
1

c9e7cbb08b · safer jinja llama_chat_templates struct · Updated 2025-01-20 16:58:29 +01:00

55
34

a47d389c27 · context : prepare for abstraction · Updated 2025-01-20 08:30:25 +01:00

44
18

90a0349349 · recommended way to check if the version is 0.3, as requested by ngxson · Updated 2025-01-19 14:43:59 +01:00

53
2

ba421dd04e · gguf-test: tensor data comparison · Updated 2025-01-18 09:49:47 +01:00

55
7

492eaad571 · ci : change python3 -> python · Updated 2025-01-15 15:18:56 +01:00

69
1

0cf9a06799 · vocab : minor [no ci] · Updated 2025-01-14 09:36:28 +01:00

78
2

a97b3621cf · ggml : ggml_backend_graph_copy -> ggml_backend_graph_copy_state · Updated 2025-01-12 16:57:51 +01:00

93
15

9af90481d0 · Vulkan: Add renderdoc tracing support · Updated 2025-01-12 14:47:36 +01:00

95
1

fbddb26250 · ggml-cuda : use i and j instead of i0 and i in vec_dot_tq2_0_q8_1 · Updated 2025-01-12 03:06:49 +01:00

102
7

15fbcb5df7 · wip: add cencellable request · Updated 2025-01-10 15:23:13 +01:00

97
1

9605c5fb28 · cmake : remove explicit _XOPEN_SOURCE · Updated 2025-01-06 12:02:48 +01:00

147
2