tdrussell
|
cb26163a20
|
Fix off-by-one error in exllama_hf caching logic (#4145)
|
2023-10-05 12:20:56 -03:00 |
|
oobabooga
|
ae4ba3007f
|
Add grammar to transformers and _HF loaders (#4091)
|
2023-10-05 10:01:36 -03:00 |
|
oobabooga
|
b6fe6acf88
|
Add threads_batch parameter
|
2023-10-01 21:28:00 -07:00 |
|
jllllll
|
41a2de96e5
|
Bump llama-cpp-python to 0.2.11
|
2023-10-01 18:08:10 -05:00 |
|
oobabooga
|
f2d82f731a
|
Add recommended NTKv1 alpha values
|
2023-09-29 13:48:38 -07:00 |
|
oobabooga
|
abe99cddeb
|
Extend evaluation slider bounds
|
2023-09-29 13:06:26 -07:00 |
|
oobabooga
|
96da2e1c0d
|
Read more metadata (config.json & quantize_config.json)
|
2023-09-29 06:14:16 -07:00 |
|
oobabooga
|
56b5a4af74
|
exllamav2 typical_p
|
2023-09-28 20:10:12 -07:00 |
|
oobabooga
|
f8e9733412
|
Minor syntax change
|
2023-09-28 19:32:35 -07:00 |
|
oobabooga
|
f931184b53
|
Increase truncation limits to 32768
|
2023-09-28 19:28:22 -07:00 |
|
oobabooga
|
1dd13e4643
|
Read Transformers config.json metadata
|
2023-09-28 19:19:47 -07:00 |
|
StoyanStAtanasov
|
7e6ff8d1f0
|
Enable NUMA feature for llama_cpp_python (#4040)
|
2023-09-26 22:05:00 -03:00 |
|
oobabooga
|
87ea2d96fd
|
Add a note about RWKV loader
|
2023-09-26 17:43:39 -07:00 |
|
oobabooga
|
0c89180966
|
Another minor fix
|
2023-09-26 06:54:21 -07:00 |
|
oobabooga
|
365335e1ae
|
Minor fix
|
2023-09-26 06:47:19 -07:00 |
|
oobabooga
|
1ca54faaf0
|
Improve --multi-user mode
|
2023-09-26 06:42:33 -07:00 |
|
oobabooga
|
019371c0b6
|
Lint
|
2023-09-25 20:31:11 -07:00 |
|
oobabooga
|
814520fed1
|
Extension install improvements
|
2023-09-25 20:27:06 -07:00 |
|
oobabooga
|
7f1460af29
|
Change a warning
|
2023-09-25 20:22:27 -07:00 |
|
oobabooga
|
862b45b1c7
|
Extension install improvements
|
2023-09-25 19:48:30 -07:00 |
|
oobabooga
|
c8952cce55
|
Move documentation from UI to docs/
|
2023-09-25 12:28:28 -07:00 |
|
oobabooga
|
d0d221df49
|
Add --use_fast option (closes #3741)
|
2023-09-25 12:19:43 -07:00 |
|
oobabooga
|
b973b91d73
|
Automatically filter by loader (closes #4072)
|
2023-09-25 10:28:35 -07:00 |
|
oobabooga
|
63de9eb24f
|
Clean up the transformers loader
|
2023-09-24 20:26:26 -07:00 |
|
oobabooga
|
36c38d7561
|
Add disable_exllama to Transformers loader (for GPTQ LoRA training)
|
2023-09-24 20:03:11 -07:00 |
|
oobabooga
|
55a685d999
|
Minor fixes
|
2023-09-24 14:15:10 -07:00 |
|
oobabooga
|
08cf150c0c
|
Add a grammar editor to the UI (#4061)
|
2023-09-24 18:05:24 -03:00 |
|
oobabooga
|
eb0b7c1053
|
Fix a minor UI bug
|
2023-09-24 07:17:33 -07:00 |
|
oobabooga
|
3edac43426
|
Remove print statement
|
2023-09-24 07:13:00 -07:00 |
|
oobabooga
|
b227e65d86
|
Add grammar to llama.cpp loader (closes #4019)
|
2023-09-24 07:10:45 -07:00 |
|
oobabooga
|
2e7b6b0014
|
Create alternative requirements.txt with AMD and Metal wheels (#4052)
|
2023-09-24 09:58:29 -03:00 |
|
oobabooga
|
7a3ca2c68f
|
Better detect EXL2 models
|
2023-09-23 13:05:55 -07:00 |
|
oobabooga
|
b1467bd064
|
Move one-click-installers into the repository (#4028 from oobabooga/one-click)
|
2023-09-22 17:43:07 -03:00 |
|
oobabooga
|
c075969875
|
Add instructions
|
2023-09-22 13:10:03 -07:00 |
|
oobabooga
|
8ab3eca9ec
|
Add a warning for outdated installations
|
2023-09-22 09:35:19 -07:00 |
|
oobabooga
|
95976a9d4f
|
Fix a bug while deleting characters
|
2023-09-22 06:02:34 -07:00 |
|
oobabooga
|
d5330406fa
|
Add a rename menu for chat histories
|
2023-09-21 19:16:51 -07:00 |
|
oobabooga
|
00ab450c13
|
Multiple histories for each character (#4022)
|
2023-09-21 17:19:32 -03:00 |
|
oobabooga
|
029da9563f
|
Avoid redundant function call in llamacpp_hf
|
2023-09-19 14:14:40 -07:00 |
|
oobabooga
|
869f47fff9
|
Lint
|
2023-09-19 13:51:57 -07:00 |
|
oobabooga
|
13ac55fa18
|
Reorder some functions
|
2023-09-19 13:51:57 -07:00 |
|
oobabooga
|
03dc69edc5
|
ExLlama_HF (v1 and v2) prefix matching
|
2023-09-19 13:12:19 -07:00 |
|
oobabooga
|
5075087461
|
Fix command-line arguments being ignored
|
2023-09-19 13:11:46 -07:00 |
|
oobabooga
|
ff5d3d2d09
|
Add missing import
|
2023-09-18 16:26:54 -07:00 |
|
oobabooga
|
605ec3c9f2
|
Add a warning about ExLlamaV2 without flash-attn
|
2023-09-18 12:26:35 -07:00 |
|
oobabooga
|
f0ef971edb
|
Remove obsolete warning
|
2023-09-18 12:25:10 -07:00 |
|
oobabooga
|
745807dc03
|
Faster llamacpp_HF prefix matching
|
2023-09-18 11:02:45 -07:00 |
|
BadisG
|
893a72a1c5
|
Stop generation immediately when using "Maximum tokens/second" (#3952)
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
|
2023-09-18 14:27:06 -03:00 |
|
Cebtenzzre
|
8466cf229a
|
llama.cpp: fix ban_eos_token (#3987)
|
2023-09-18 12:15:02 -03:00 |
|
oobabooga
|
0ede2965d5
|
Remove an error message
|
2023-09-17 18:46:08 -07:00 |
|