llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-02-06 00:20:34 +01:00

History

Michał Moskal ff227703d6 sampling : support for llguidance grammars (#10224 ) * initial porting of previous LLG patch * update for new APIs * build: integrate llguidance as an external project * use '%llguidance' as marker to enable llg lark syntax * add some docs * clarify docs * code style fixes * remove llguidance.h from .gitignore * fix tests when llg is enabled * pass vocab not model to llama_sampler_init_llg() * copy test-grammar-integration.cpp to test-llguidance.cpp * clang fmt * fix ref-count bug * build and run test * gbnf -> lark syntax * conditionally include llguidance test based on LLAMA_LLGUIDANCE flag * rename llguidance test file to test-grammar-llguidance.cpp * add gh action for llg test * align tests with LLG grammar syntax and JSON Schema spec * llama_tokenizer() in fact requires valid utf8 * update llg * format file * add $LLGUIDANCE_LOG_LEVEL support * fix whitespace * fix warning * include <cmath> for INFINITY * add final newline * fail llama_sampler_init_llg() at runtime * Link gbnf_to_lark.py script; fix links; refer to llg docs for lexemes * simplify #includes * improve doc string for LLAMA_LLGUIDANCE * typo in merge * bump llguidance to 0.6.12		2025-02-02 09:55:32 +02:00
..
bench.yml.disabled	ggml-backend : add device and backend reg interfaces (#9707 )	2024-10-03 01:49:47 +02:00
build.yml	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
close-issue.yml	ci : fine-grant permission (#9710 )	2024-10-04 11:47:19 +02:00
docker.yml	ci : fix build CPU arm64 (#11472 )	2025-01-29 00:02:56 +01:00
editorconfig.yml	ci : pin dependency to specific version (#11137 )	2025-01-08 12:07:20 +01:00
gguf-publish.yml	ci : update checkout, setup-python and upload-artifact to latest (#6456 )	2024-04-03 21:01:13 +03:00
labeler.yml	labeler.yml: Use settings from ggerganov/llama.cpp [no ci] (#7363 )	2024-05-19 20:51:03 +10:00
python-check-requirements.yml	py : fix requirements check '==' -> '~=' (#8982 )	2024-08-12 11:02:01 +03:00
python-lint.yml	ci : add ubuntu cuda build, build with one arch on windows (#10456 )	2024-11-26 13:05:07 +01:00
python-type-check.yml	ci : reduce severity of unused Pyright ignore comments (#9697 )	2024-09-30 14:13:16 -04:00
server.yml	Tool call support (generic + native for Llama, Functionary, Hermes, Mistral, Firefunction, DeepSeek) w/ lazy grammars (#9639 )	2025-01-30 19:13:58 +00:00