Johan
1d5a015ce7
Enable special token support for exllamav2 ( #4314 )
2023-10-21 01:54:06 -03:00
turboderp
ae8cd449ae
ExLlamav2_HF: Convert logits to FP32 ( #4310 )
2023-10-18 23:16:05 -03:00
oobabooga
f17f7a6913
Increase the evaluation table height
2023-10-16 12:55:35 -07:00
oobabooga
8ea554bc19
Check for torch.xpu.is_available()
2023-10-16 12:53:40 -07:00
oobabooga
188d20e9e5
Reduce the evaluation table height
2023-10-16 10:53:42 -07:00
oobabooga
2d44adbb76
Clear the torch cache while evaluating
2023-10-16 10:52:50 -07:00
oobabooga
71cac7a1b2
Increase the height of the evaluation table
2023-10-15 21:56:40 -07:00
oobabooga
e14bde4946
Minor improvements to evaluation logs
2023-10-15 20:51:43 -07:00
oobabooga
b88b2b74a6
Experimental Intel Arc transformers support (untested)
2023-10-15 20:51:11 -07:00
Forkoz
8cce1f1126
Exllamav2 lora support ( #4229 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-10-14 16:12:41 -03:00
oobabooga
773c17faec
Fix a warning
2023-10-10 20:53:38 -07:00
oobabooga
f63361568c
Fix safetensors kwarg usage in AutoAWQ
2023-10-10 19:03:09 -07:00
oobabooga
39f16ff83d
Fix default/notebook tabs css
2023-10-10 18:45:12 -07:00
oobabooga
fae8062d39
Bump to latest gradio (3.47) ( #4258 )
2023-10-10 22:20:49 -03:00
oobabooga
9fab9a1ca6
Minor fix
2023-10-10 14:08:11 -07:00
oobabooga
a49cc69a4a
Ignore rope_freq_base if value is 10000
2023-10-10 13:57:40 -07:00
oobabooga
3a9d90c3a1
Download models with 4 threads by default
2023-10-10 13:52:10 -07:00
Forkoz
35695e18c7
Remove import. ( #4247 )
...
For real this time.
2023-10-09 18:06:11 -03:00
Forkoz
2e471071af
Update llama_attn_hijack.py ( #4231 )
2023-10-08 15:16:48 -03:00
Brian Dashore
98fa73a974
Text Generation: stop if EOS token is reached ( #4213 )
2023-10-07 19:46:42 -03:00
Brian Dashore
7743b5e9de
Llamacpp_HF: Fix CFG cache init ( #4219 )
...
Documentation says that model.context_params should be sent when
a new context is created. The current code uses model.params which
doesn't exist.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-10-07 19:38:29 -03:00
turboderp
8a98646a21
Bump ExLlamaV2 to 0.0.5 ( #4186 )
2023-10-05 19:12:22 -03:00
oobabooga
7ffb424c7b
Add AutoAWQ to README
2023-10-05 09:22:37 -07:00
cal066
cc632c3f33
AutoAWQ: initial support ( #3999 )
2023-10-05 13:19:18 -03:00
tdrussell
cb26163a20
Fix off-by-one error in exllama_hf caching logic ( #4145 )
2023-10-05 12:20:56 -03:00
oobabooga
ae4ba3007f
Add grammar to transformers and _HF loaders ( #4091 )
2023-10-05 10:01:36 -03:00
oobabooga
b6fe6acf88
Add threads_batch parameter
2023-10-01 21:28:00 -07:00
jllllll
41a2de96e5
Bump llama-cpp-python to 0.2.11
2023-10-01 18:08:10 -05:00
oobabooga
f2d82f731a
Add recommended NTKv1 alpha values
2023-09-29 13:48:38 -07:00
oobabooga
abe99cddeb
Extend evaluation slider bounds
2023-09-29 13:06:26 -07:00
oobabooga
96da2e1c0d
Read more metadata (config.json & quantize_config.json)
2023-09-29 06:14:16 -07:00
oobabooga
56b5a4af74
exllamav2 typical_p
2023-09-28 20:10:12 -07:00
oobabooga
f8e9733412
Minor syntax change
2023-09-28 19:32:35 -07:00
oobabooga
f931184b53
Increase truncation limits to 32768
2023-09-28 19:28:22 -07:00
oobabooga
1dd13e4643
Read Transformers config.json metadata
2023-09-28 19:19:47 -07:00
StoyanStAtanasov
7e6ff8d1f0
Enable NUMA feature for llama_cpp_python ( #4040 )
2023-09-26 22:05:00 -03:00
oobabooga
87ea2d96fd
Add a note about RWKV loader
2023-09-26 17:43:39 -07:00
oobabooga
0c89180966
Another minor fix
2023-09-26 06:54:21 -07:00
oobabooga
365335e1ae
Minor fix
2023-09-26 06:47:19 -07:00
oobabooga
1ca54faaf0
Improve --multi-user mode
2023-09-26 06:42:33 -07:00
oobabooga
019371c0b6
Lint
2023-09-25 20:31:11 -07:00
oobabooga
814520fed1
Extension install improvements
2023-09-25 20:27:06 -07:00
oobabooga
7f1460af29
Change a warning
2023-09-25 20:22:27 -07:00
oobabooga
862b45b1c7
Extension install improvements
2023-09-25 19:48:30 -07:00
oobabooga
c8952cce55
Move documentation from UI to docs/
2023-09-25 12:28:28 -07:00
oobabooga
d0d221df49
Add --use_fast option ( closes #3741 )
2023-09-25 12:19:43 -07:00
oobabooga
b973b91d73
Automatically filter by loader ( closes #4072 )
2023-09-25 10:28:35 -07:00
oobabooga
63de9eb24f
Clean up the transformers loader
2023-09-24 20:26:26 -07:00
oobabooga
36c38d7561
Add disable_exllama to Transformers loader (for GPTQ LoRA training)
2023-09-24 20:03:11 -07:00
oobabooga
55a685d999
Minor fixes
2023-09-24 14:15:10 -07:00
oobabooga
08cf150c0c
Add a grammar editor to the UI ( #4061 )
2023-09-24 18:05:24 -03:00
oobabooga
eb0b7c1053
Fix a minor UI bug
2023-09-24 07:17:33 -07:00
oobabooga
3edac43426
Remove print statement
2023-09-24 07:13:00 -07:00
oobabooga
b227e65d86
Add grammar to llama.cpp loader ( closes #4019 )
2023-09-24 07:10:45 -07:00
oobabooga
2e7b6b0014
Create alternative requirements.txt with AMD and Metal wheels ( #4052 )
2023-09-24 09:58:29 -03:00
oobabooga
7a3ca2c68f
Better detect EXL2 models
2023-09-23 13:05:55 -07:00
oobabooga
b1467bd064
Move one-click-installers into the repository ( #4028 from oobabooga/one-click)
2023-09-22 17:43:07 -03:00
oobabooga
c075969875
Add instructions
2023-09-22 13:10:03 -07:00
oobabooga
8ab3eca9ec
Add a warning for outdated installations
2023-09-22 09:35:19 -07:00
oobabooga
95976a9d4f
Fix a bug while deleting characters
2023-09-22 06:02:34 -07:00
oobabooga
d5330406fa
Add a rename menu for chat histories
2023-09-21 19:16:51 -07:00
oobabooga
00ab450c13
Multiple histories for each character ( #4022 )
2023-09-21 17:19:32 -03:00
oobabooga
029da9563f
Avoid redundant function call in llamacpp_hf
2023-09-19 14:14:40 -07:00
oobabooga
869f47fff9
Lint
2023-09-19 13:51:57 -07:00
oobabooga
13ac55fa18
Reorder some functions
2023-09-19 13:51:57 -07:00
oobabooga
03dc69edc5
ExLlama_HF (v1 and v2) prefix matching
2023-09-19 13:12:19 -07:00
oobabooga
5075087461
Fix command-line arguments being ignored
2023-09-19 13:11:46 -07:00
oobabooga
ff5d3d2d09
Add missing import
2023-09-18 16:26:54 -07:00
oobabooga
605ec3c9f2
Add a warning about ExLlamaV2 without flash-attn
2023-09-18 12:26:35 -07:00
oobabooga
f0ef971edb
Remove obsolete warning
2023-09-18 12:25:10 -07:00
oobabooga
745807dc03
Faster llamacpp_HF prefix matching
2023-09-18 11:02:45 -07:00
BadisG
893a72a1c5
Stop generation immediately when using "Maximum tokens/second" ( #3952 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-09-18 14:27:06 -03:00
Cebtenzzre
8466cf229a
llama.cpp: fix ban_eos_token ( #3987 )
2023-09-18 12:15:02 -03:00
oobabooga
0ede2965d5
Remove an error message
2023-09-17 18:46:08 -07:00
missionfloyd
cc8eda298a
Move hover menu shortcuts to right side ( #3951 )
2023-09-17 22:33:00 -03:00
oobabooga
280cca9f66
Merge remote-tracking branch 'refs/remotes/origin/main'
2023-09-17 18:01:27 -07:00
oobabooga
b062d50c45
Remove exllama import that causes problems
2023-09-17 18:00:32 -07:00
James Braza
fee38e0601
Simplified ExLlama cloning instructions and failure message ( #3972 )
2023-09-17 19:26:05 -03:00
Lu Guanghua
9858acee7b
Fix unexpected extensions load after gradio restart ( #3965 )
2023-09-17 17:35:43 -03:00
oobabooga
d9b0f2c9c3
Fix llama.cpp double decoding
2023-09-17 13:07:48 -07:00
oobabooga
d71465708c
llamacpp_HF prefix matching
2023-09-17 11:51:01 -07:00
oobabooga
37e2980e05
Recommend mul_mat_q for llama.cpp
2023-09-17 08:27:11 -07:00
oobabooga
a069f3904c
Undo part of ad8ac545a5
2023-09-17 08:12:23 -07:00
oobabooga
ad8ac545a5
Tokenization improvements
2023-09-17 07:02:00 -07:00
saltacc
cd08eb0753
token probs for non HF loaders ( #3957 )
2023-09-17 10:42:32 -03:00
kalomaze
7c9664ed35
Allow full model URL to be used for download ( #3919 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-09-16 10:06:13 -03:00
saltacc
ed6b6411fb
Fix exllama tokenizers ( #3954 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-09-16 09:42:38 -03:00
missionfloyd
2ad6ca8874
Add back chat buttons with --chat-buttons ( #3947 )
2023-09-16 00:39:37 -03:00
oobabooga
ef04138bc0
Improve the UI tokenizer
2023-09-15 19:30:44 -07:00
oobabooga
c3e4c9fdc2
Add a simple tokenizer to the UI
2023-09-15 19:09:03 -07:00
saltacc
f01b9aa71f
Add customizable ban tokens ( #3899 )
2023-09-15 18:27:27 -03:00
oobabooga
5b117590ad
Add some scrollbars to Parameters tab
2023-09-15 09:17:37 -07:00
Johan
fdcee0c215
Allow custom tokenizer for llamacpp_HF loader ( #3941 )
2023-09-15 12:38:38 -03:00
oobabooga
fd7257c7f8
Prevent code blocks from flickering while streaming
2023-09-15 07:46:26 -07:00
oobabooga
a3ecf3bb65
Add cai-chat-square chat style
2023-09-14 16:15:08 -07:00
oobabooga
3d1c0f173d
User config precedence over GGUF metadata
2023-09-14 12:15:52 -07:00
oobabooga
94dc64f870
Add a border
2023-09-14 07:20:36 -07:00
oobabooga
70aafa34dc
Fix blockquote markdown rendering
2023-09-14 05:57:04 -07:00
oobabooga
644a9b8765
Change the chat generate button
2023-09-14 05:16:44 -07:00
oobabooga
ecc90f9f62
Continue on Alt + Enter
2023-09-14 03:59:12 -07:00