Commit Graph

  • 113e685d18
    inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -0400
  • 289637a6a3 streaming conversion without pytorch Dmitry Wolf 2023-03-15 21:25:01 +0300
  • b1a4a2a02a
    Update main.cpp Anujsharma002 2023-03-15 23:10:29 +0530
  • 44f74671cb add docker action pipeline Bernat Vadell 2023-03-15 17:45:33 +0100
  • 279400845b
    CMake: properly find and link to threading library Gleb Popov 2023-03-15 19:36:16 +0300
  • 405d94e61f
    inline -> static inline for "bytesFromNibbles" hoangmit 2023-03-15 11:08:54 -0400
  • 2788f373be
    Get the build working Justine Tunney 2023-03-15 02:52:31 -0700
  • 1b73521840 Use F16 for memory_k and memory_v Ty Everett 2023-03-14 23:10:12 -0700
  • a4fb5a9719 fixed warning with std::ignore about unused function result Justin 2023-03-15 02:37:01 +0000
  • 9eb4598fa3 added it in more places Justin 2023-03-15 02:33:56 +0000
  • 1ddaee93fb added sigint handler for ansi_color_reset Justin 2023-03-15 01:56:14 +0000
  • e3d780d833 added ctx_size parameter Justin 2023-03-15 01:47:51 +0000
  • cb5234aeea
    Added magnet link to download model files Jonathan Mackenzie 2023-03-15 12:25:51 +1100
  • 116f94bfa2 fixed color reset on exit Justin 2023-03-15 01:23:55 +0000
  • c3c89050c3
    Merge branch 'ggerganov:master' into master apaz 2023-03-15 00:34:40 +0000
  • 3878230201 Resolved merge conflicts. apaz-cli 2023-03-14 18:42:12 -0500
  • 6894b63556 Added mmap() iostream implementation. apaz-cli 2023-03-14 18:32:51 -0500
  • f89ca6fd36 Set executable bit Danil Yarantev (Yardanico) 2023-03-15 00:11:07 +0300
  • f781b1904d Fix line endings in quantize.sh from CRLF to LF Danil Yarantev (Yardanico) 2023-03-15 00:10:02 +0300
  • 9da4e66710
    Use tokenizer.vocab_size() instead of hardcoding 32000 in convert-pth-to-ggml.py Ronsor 2023-03-14 13:33:41 -0700
  • 84fb7f516c Nix: fix ARM neon dotproduct on macOS Niklas Korz 2023-03-14 21:14:29 +0100
  • a296589180 Remove direnv Niklas Korz 2023-03-14 18:34:25 +0100
  • 0759896135 Nix: remove channel compatibility Niklas Korz 2023-03-14 18:33:53 +0100
  • 1171c0e663 Nix: use python packages supplied by withPackages Niklas Korz 2023-03-14 13:29:03 +0100
  • 3ee54ef522 direnv: use flake instead of use_flake Niklas Korz 2023-03-13 12:01:57 +0100
  • a9230b679b Nix: development shel, direnv and compatibility Niklas Korz 2023-03-13 10:48:39 +0100
  • 034b589102 Nix: only add Accelerate framework on macOS Niklas Korz 2023-03-13 10:23:53 +0100
  • 8defc0a54e Nix flake Niklas Korz 2023-03-12 10:15:46 +0100
  • e1b1e12a41 modularize qunash 2023-03-14 22:52:22 +0300
  • 901c34d1b1
    Update .devops/tools.sh Bernat Vadell 2023-03-14 20:46:41 +0100
  • 47857e564c
    Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -0700
  • fcbadc6f59
    Update ggml.c Georgi Gerganov 2023-03-14 21:34:12 +0200
  • 777aa4e53d
    Don't use vdotq_s32 if it's not available Ronsor 2023-03-14 10:53:41 -0700
  • 297863982c Change subtext according to ggerganov recommendations Leszek Hanusz 2023-03-14 18:19:45 +0100
  • a4590d3be9 add quantize into tool docker image Bernat Vadell 2023-03-14 18:04:21 +0100
  • ae380dddbf Forgot to regenerate the pdf with the lowercase a in llama Leszek Hanusz 2023-03-14 16:35:07 +0100
  • 5fc3112110 Adding llama banner in README.md Leszek Hanusz 2023-03-14 16:14:25 +0100
  • 9959b1f374 split dockerfile into main & tools Bernat Vadell 2023-03-14 15:46:03 +0100
  • 6d9ad1058b feat: split build & runtime stages Bernat Vadell 2023-03-14 15:03:25 +0100
  • 60f819a2b1
    Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +0200
  • 23fe89d2e9 Add section to README on how to run the project on Android Radoslav Gerganov 2023-03-14 14:43:32 +0200
  • ce509c79c5 feat: dockerize llamacpp Bernat Vadell 2023-03-14 14:16:52 +0100
  • 311fc3fca2
    Functionality addition CMakeLists.txt nusu-github 2023-03-14 19:51:08 +0900
  • 8e1210f993
    Update README.md Musab Gultekin 2023-03-14 13:19:33 +0300
  • c81a58190c
    Update README.md Musab Gultekin 2023-03-14 13:10:39 +0300
  • 5e539c06a8 Fix build for Android Radoslav Gerganov 2023-03-14 10:59:06 +0200
  • 97ab2b2578
    Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +0200
  • 585431924a perhaps more consistent Rickey Bowers Jr 2023-03-14 00:43:11 -0600
  • 570e4ff810 add SIGINT support for _WIN32 environments Rickey Bowers Jr 2023-03-14 00:39:22 -0600
  • 68897eb13a Use STL in input processing and fix off-by-one bug Thomas Antony 2023-03-13 21:07:49 -0700
  • fdb1987f6f
    add ptread link to fix cmake build under linux mmyjona 2023-03-14 12:19:45 +0800
  • c2af31149f Refactoring: more concise and readable qunash 2023-03-14 03:57:35 +0300
  • d8aba05a62 Use f-strings instead of concatenation qunash 2023-03-14 01:55:51 +0300
  • 94f368fd53 Refactor get_n_parts function to simplify code and improve readability qunash 2023-03-14 01:50:50 +0300
  • 2f700a2738
    Add windows to the CI (#98) Sebastián A 2023-03-13 17:29:10 -0300
  • c09a9cfb06
    CMake build in Release by default (#75) Georgi Gerganov 2023-03-13 21:22:15 +0200
  • b5ae4eb92d Add windows to the CI Sebastián Aedo 2023-03-13 15:31:15 -0300
  • 7ec903d3c1
    Update contribution section, hot topics, limitations, etc. Georgi Gerganov 2023-03-13 19:21:51 +0200
  • 4497ad819c
    Print system information Georgi Gerganov 2023-03-13 19:15:08 +0200
  • ed6849cc07
    Initial support for CMake (#75) Sebastián A 2023-03-13 14:12:33 -0300
  • 41be0a3b3d
    Add NetBSD support. (#90) Thomas Klausner 2023-03-13 17:40:54 +0100
  • 671d5cac15
    Use fprintf for diagnostic output (#48) Pavol Rusnak 2023-03-13 17:39:56 +0100
  • 84d9015c4a
    Use vdotq_s32 to improve performance (#67) Georgi Gerganov 2023-03-13 18:36:44 +0200
  • 0fa481b092
    Back to original change Georgi Gerganov 2023-03-13 18:35:42 +0200
  • 63fd76fbb0
    Reduce model loading time (#43) uint256_t 2023-03-14 01:33:43 +0900
  • 3419f88f11
    Minor Georgi Gerganov 2023-03-13 18:33:23 +0200
  • 2a20f48efa
    Fix UTF-8 handling (including colors) (#79) Val Kharitonov 2023-03-13 12:24:18 -0400
  • d1f224712d
    Add quantize script for batch quantization (#92) Pavol Rusnak 2023-03-13 17:15:20 +0100
  • acf35ec45a
    Fix file list on Mac OS Georgi Gerganov 2023-03-13 18:14:22 +0200
  • 23d334b660
    Fix script name Georgi Gerganov 2023-03-13 18:09:18 +0200
  • 5f5332b4ef
    README for new quantize.sh Georgi Gerganov 2023-03-13 18:08:14 +0200
  • d328973aa1
    Indentation Georgi Gerganov 2023-03-13 18:05:18 +0200
  • bc2b35ec69
    Updating README after running 60B of llama.cpp Joshua Herman 2023-03-13 09:43:34 -0500
  • a1eff53587 remove unused header wizard 2023-03-13 22:42:12 +0800
  • daedf7ce97
    Add quantize script for batch quantization Pavol Rusnak 2023-03-13 13:49:46 +0100
  • 6d9c32e94a Initial support for CMake Sebastián Aedo 2023-03-12 23:17:55 -0300
  • fa8ce2a207 Add NetBSD support. Thomas Klausner 2023-03-13 12:25:28 +0100
  • 6b9e424671 add support to load tokenizer.model from command line argument wizard 2023-03-13 18:01:05 +0800
  • 7438b83939
    Merge branch 'ggerganov:master' into master wizd 2023-03-13 17:03:31 +0800
  • ed10def70e clean code wizard 2023-03-13 17:02:06 +0800
  • 15f06f6b4f buffering utf-8 output to make it complete for spliting output. wizard 2023-03-13 16:27:11 +0800
  • 1808ee0500
    Add initial contribution guidelines Georgi Gerganov 2023-03-13 09:42:26 +0200
  • 6c8258665b
    use fprintf for diagnostic output Pavol Rusnak 2023-03-13 08:25:02 +0100
  • 9ec1dfdbe2 Strip trailing whitespace from prompt file Matthew McAllister 2023-03-12 22:13:23 -0700
  • 86e967c54b buffering output for UTF-8 encoded token wizard 2023-03-13 13:06:01 +0800
  • cb8c464a60 Fix UTF-8 handling (including colors) Valery Kharitonov 2023-03-13 01:01:34 -0400
  • 1133eea479 Truncate prompt if longer than context + n_predict Matthew McAllister 2023-03-12 21:49:17 -0700
  • b8f20713b9
    Add __pycache__ and *.bin to gitignore Jed Fox 2023-03-12 22:21:22 -0400
  • 34af8a97e8
    FIx parsing single-byte UTF-8 tokens by manually parsing the protobuf Jed Fox 2023-03-12 20:16:05 -0400
  • a169bb889c Gate signal support on being on a unixoid system. (#74) dev Matvey Soloviev 2023-03-13 04:08:01 +0100
  • 9b4077184a introduces justfile Barton Rhodes 2023-03-13 02:37:04 +0000
  • 1b87fe1e90 call a standalone function to untokenize output wizard 2023-03-13 10:00:34 +0800
  • 307dba3dd2 first try to intergrate sentencepiece wizard 2023-03-13 10:00:01 +0800
  • ce7ebb3319
    Another antiprompt fix beiller 2023-03-08 16:44:50 -0500
  • 9425a21db5
    Bugfix and back to printing as normal Fix antiprompt beiller 2023-03-08 16:44:50 -0500
  • efaa30ef11
    Use vector maekawatoshiki 2023-03-13 10:38:41 +0900
  • 703571861f
    undo complicated printing until its fixed sadly beiller 2023-03-08 16:44:50 -0500
  • 5ed9219221
    Update convert-pth-to-ggml.py Dietrich Daroch 2023-03-12 17:45:55 -0700
  • bb40590714 Misc: Use argparse Dietrich Daroch 2023-03-12 17:18:59 -0700
  • 81c9c9e8a6 Misc: Make the conversion script executable Dietrich Daroch 2023-03-12 17:05:13 -0700