Georgi Gerganov
|
8bad78b8e2
|
sync : ggml (part 1)
|
2023-12-07 13:18:36 +02:00 |
|
Georgi Gerganov
|
3d68f364f1
|
ggml : sync (im2col, GPU conv, 32-bit arm compat) (#4060)
ggml-ci
|
2023-11-13 16:55:52 +02:00 |
|
Georgi Gerganov
|
4760e7cc0b
|
sync : ggml (backend v2) (#3912)
* sync : ggml (backend v2) (wip)
* sync : migrate examples and llama.cpp to dynamic graphs (wip)
* sync : update tests + fix max op params to 64
ggml-ci
* sync : ggml-cuda
ggml-ci
* llama : fix save/load state context size
ggml-ci
* sync : try to fix build on tvOS
* sync : pass custom graph sizes in training examples
* sync : update graph copies to new ggml API
* sync : update sync-ggml.sh with new files
* scripts : fix header in sync script
* train : fix context size calculations
* llama : increase inference graph size up to 4096 nodes
* train : allocate grads for backward graphs
* train : allocate grads for gb_tmp
|
2023-11-13 14:16:23 +02:00 |
|
Georgi Gerganov
|
207b51900e
|
ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)
* ggml : move FP16 <-> FP32 stuff to ggml-impl.h
ggml-ci
* tests : fix ARM build
* ggml : explicitly initialize deprecated type traits
* ggml : add math.h to ggml-impl.h
* ggml : remove duplicate static assert macros
* ggml : prefix lookup tables with ggml_
ggml-ci
* ggml-impl : move extern "C" to start of file
|
2023-10-30 19:19:15 +02:00 |
|