Commit Graph

3234 Commits

Author SHA1 Message Date
Georgi Gerganov
2d555e5b42
Add CI (#60) 2023-03-12 22:08:24 +02:00
Georgi Gerganov
7c9e54e55e
Revert "weights_only" arg - this causing more trouble than help 2023-03-12 20:59:01 +02:00
Oleksandr Nikitin
b9bd1d0141
python/pytorch compat notes (#44) 2023-03-12 14:16:33 +02:00
beiller
129c7d1ea8
Add repetition penalty (#20)
* Adding repeat penalization

* Update utils.h

* Update utils.cpp

* Numeric fix

Should probably still scale by temp even if penalized

* Update comments, more proper application

I see that numbers can go negative so a fix from a referenced commit

* Minor formatting

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-03-12 11:27:42 +02:00
Georgi Gerganov
702fddf5c5
Clarify meaning of hacking 2023-03-12 09:03:25 +02:00
Georgi Gerganov
7d86e25bf6
README: add "Supported platforms" + update hot topics 2023-03-12 08:41:54 +02:00
deepdiffuser
a93120236f
use weights_only in conversion script (#32)
this restricts malicious weights from executing arbitrary code by restricting the unpickler to only loading tensors, primitive types, and dictionaries
2023-03-12 08:36:35 +02:00
Pavol Rusnak
6a9a67f0be
Add LICENSE (#21) 2023-03-12 08:36:03 +02:00
Georgi Gerganov
da1a4ff01f
Update README.md 2023-03-12 01:26:32 +02:00
Juraj Bednar
6b2cb6302f
Fix a typo in model name (#16) 2023-03-11 19:32:20 +02:00
Georgi Gerganov
4235e3d5b3
Update README.md 2023-03-11 18:10:18 +02:00
Georgi Gerganov
f1eaff4721 Add AVX2 support for x86 architectures thanks to @Const-me ! 2023-03-11 18:04:25 +02:00
Georgi Gerganov
a9e58529ea Fix un-initialized FP16 tables on x86 (#15, #2) 2023-03-11 17:40:14 +02:00
Georgi Gerganov
7d9ed7b25f
Bump memory buffer 2023-03-11 12:45:01 +02:00
Georgi Gerganov
0c6803321c
Update README.md 2023-03-11 12:31:21 +02:00
Georgi Gerganov
f60fa9e50a
.gitignore models/ 2023-03-11 12:27:02 +02:00
Georgi Gerganov
7211862c94
Update Makefile var + add comment 2023-03-11 12:27:02 +02:00
Georgi Gerganov
a5c5ae2f54
Update README.md 2023-03-11 11:34:25 +02:00
Georgi Gerganov
ea977e85ec
Update README.md 2023-03-11 11:34:11 +02:00
Georgi Gerganov
007a8f6f45
Support all LLaMA models + change Q4_0 quantization storage 2023-03-11 11:28:30 +02:00
Simon Willison
5f2f970d51
Include Python dependencies in README (#6) 2023-03-11 07:47:26 +02:00
Georgi Gerganov
73c6ed5e87
Update README.md 2023-03-11 01:30:47 +02:00
Georgi Gerganov
01eeed8fb1
Update README.md 2023-03-11 01:22:58 +02:00
Georgi Gerganov
6da2df34ee
Update README.md 2023-03-11 01:18:10 +02:00
Jean-Michaël Celerier
9dcf4dba45
Add missing headers for memcpy and assert (#3) 2023-03-11 01:04:06 +02:00
Georgi Gerganov
920a7fe2d9
Update README.md 2023-03-11 00:55:22 +02:00
Georgi Gerganov
3a57ee59de
Update README.md 2023-03-11 00:51:46 +02:00
Georgi Gerganov
b85028522d
Update README.md 2023-03-11 00:09:19 +02:00
Georgi Gerganov
8a01f565ff
Update README.md 2023-03-10 23:53:11 +02:00
Georgi Gerganov
70bc0b8b15
Fix a bug in the rope calculation 2023-03-10 23:46:57 +02:00
Georgi Gerganov
18ebda34d6
Update README.md 2023-03-10 21:52:27 +02:00
Georgi Gerganov
319cdb3e1f
Final touches 2023-03-10 21:50:46 +02:00
Georgi Gerganov
775328064e
Create README.md 2023-03-10 21:47:46 +02:00
Georgi Gerganov
26c0846629
Initial release 2023-03-10 20:56:40 +02:00