oobabooga
|
c06f630bcc
|
Increase max_updates_second maximum value
|
2023-12-24 13:29:47 -08:00 |
|
Casper
|
92d5e64a82
|
Bump AutoAWQ to 0.1.8 (#5061)
|
2023-12-24 14:27:34 -03:00 |
|
oobabooga
|
d76b00c211
|
Pin lm_eval package version
|
2023-12-24 09:22:31 -08:00 |
|
oobabooga
|
8c60495878
|
UI: add "Maximum UI updates/second" parameter
|
2023-12-24 09:17:40 -08:00 |
|
zhangningboo
|
1b8b61b928
|
Fix output_ids decoding for Qwen/Qwen-7B-Chat (#5045)
|
2023-12-22 23:11:02 -03:00 |
|
kabachuha
|
dbe438564e
|
Support for sending images into OpenAI chat API (#4827)
|
2023-12-22 22:45:53 -03:00 |
|
Stefan Daniel Schwarz
|
8956f3ebe2
|
Synthia instruction templates (#5041)
|
2023-12-22 22:19:43 -03:00 |
|
Yiximail
|
afc91edcb2
|
Reset the model_name after unloading the model (#5051)
|
2023-12-22 22:18:24 -03:00 |
|
oobabooga
|
c1b99f45cb
|
Make --help output instant
|
2023-12-21 09:32:20 -08:00 |
|
oobabooga
|
2706149c65
|
Organize the CMD arguments by group (#5027)
|
2023-12-21 00:33:55 -03:00 |
|
oobabooga
|
c727a70572
|
Remove redundancy from modules/loaders.py
|
2023-12-20 19:18:07 -08:00 |
|
luna
|
6efbe3009f
|
let exllama v1 models load safetensor loras (#4854)
|
2023-12-20 13:29:19 -03:00 |
|
oobabooga
|
bcba200790
|
Fix EOS being ignored in ExLlamav2 after previous commit
|
2023-12-20 07:54:06 -08:00 |
|
oobabooga
|
f0f6d9bdf9
|
Add HQQ back & update version
This reverts commit 2289e9031e .
|
2023-12-20 07:46:09 -08:00 |
|
oobabooga
|
b15f510154
|
Optimize ExLlamav2 (non-HF) loader
|
2023-12-20 07:31:42 -08:00 |
|
oobabooga
|
258c695ead
|
Add rich requirement
|
2023-12-19 21:58:36 -08:00 |
|
oobabooga
|
fadb295d4d
|
Lint
|
2023-12-19 21:36:57 -08:00 |
|
oobabooga
|
2289e9031e
|
Remove HQQ from requirements (after https://github.com/oobabooga/text-generation-webui/issues/4993)
|
2023-12-19 21:33:49 -08:00 |
|
oobabooga
|
fb8ee9f7ff
|
Add a specific error if HQQ is missing
|
2023-12-19 21:32:58 -08:00 |
|
oobabooga
|
366c93a008
|
Hide a warning
|
2023-12-19 21:03:20 -08:00 |
|
oobabooga
|
9992f7d8c0
|
Improve several log messages
|
2023-12-19 20:54:32 -08:00 |
|
oobabooga
|
23818dc098
|
Better logger
Credits: vladmandic/automatic
|
2023-12-19 20:38:33 -08:00 |
|
oobabooga
|
95600073bc
|
Add an informative error when extension requirements are missing
|
2023-12-19 20:20:45 -08:00 |
|
oobabooga
|
d8279dc710
|
Replace character name placeholders in chat context (closes #5007)
|
2023-12-19 17:31:46 -08:00 |
|
oobabooga
|
e83e6cedbe
|
Organize the model menu
|
2023-12-19 13:18:26 -08:00 |
|
oobabooga
|
f4ae0075e8
|
Fix conversion from old template format to jinja2
|
2023-12-19 13:16:52 -08:00 |
|
oobabooga
|
de138b8ba6
|
Add llama-cpp-python wheels with tensor cores support (#5003)
|
2023-12-19 17:30:53 -03:00 |
|
oobabooga
|
0a299d5959
|
Bump llama-cpp-python to 0.2.24 (#5001)
|
2023-12-19 15:22:21 -03:00 |
|
oobabooga
|
83cf1a6b67
|
Fix Yi space issue (closes #4996)
|
2023-12-19 07:54:19 -08:00 |
|
oobabooga
|
9847809a7a
|
Add a warning about ppl evaluation without --no_use_fast
|
2023-12-18 18:09:24 -08:00 |
|
oobabooga
|
f6d701624c
|
UI: mention that QuIP# does not work on Windows
|
2023-12-18 18:05:02 -08:00 |
|
oobabooga
|
a23a004434
|
Update the example template
|
2023-12-18 17:47:35 -08:00 |
|
oobabooga
|
3d10c574e7
|
Fix custom system messages in instruction templates
|
2023-12-18 17:45:06 -08:00 |
|
dependabot[bot]
|
9e48e50428
|
Update optimum requirement from ==1.15.* to ==1.16.* (#4986)
|
2023-12-18 21:43:29 -03:00 |
|
俞航
|
9fa3883630
|
Add ROCm wheels for exllamav2 (#4973)
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
|
2023-12-18 21:40:38 -03:00 |
|
Water
|
674be9a09a
|
Add HQQ quant loader (#4888)
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
|
2023-12-18 21:23:16 -03:00 |
|
oobabooga
|
64a57d9dc2
|
Remove duplicate instruction templates
|
2023-12-17 21:39:47 -08:00 |
|
oobabooga
|
1f9e25e76a
|
UI: update "Saved instruction templates" dropdown after loading template
|
2023-12-17 21:19:06 -08:00 |
|
oobabooga
|
da1c8d77ea
|
Merge remote-tracking branch 'refs/remotes/origin/dev' into dev
|
2023-12-17 21:05:10 -08:00 |
|
oobabooga
|
cac89df97b
|
Instruction templates: better handle unwanted bos tokens
|
2023-12-17 21:04:30 -08:00 |
|
oobabooga
|
f0d6ead877
|
llama.cpp: read instruction template from GGUF metadata (#4975)
|
2023-12-18 01:51:58 -03:00 |
|
oobabooga
|
3f3cd4fbe4
|
UI: improve list style in chat modes
|
2023-12-17 20:26:57 -08:00 |
|
oobabooga
|
306c479d3a
|
Minor fix to Vigogne-Chat template
|
2023-12-17 19:15:54 -08:00 |
|
Hirose
|
3f973e1fbf
|
Add detection for Eric Hartford's Dolphin models in models/config.yaml (#4966)
|
2023-12-17 23:56:34 -03:00 |
|
Eve
|
7c6f39382b
|
Add Orca-Vicuna instruction template (#4971)
|
2023-12-17 23:55:23 -03:00 |
|
FartyPants (FP HAM)
|
59da429cbd
|
Update Training PRO (#4972)
- rolling back safetensors to bi, until it is fixed correctly
- removing the ugly checkpoint detour
|
2023-12-17 23:54:06 -03:00 |
|
oobabooga
|
f1f2c4c3f4
|
Add --num_experts_per_token parameter (ExLlamav2) (#4955)
|
2023-12-17 12:08:33 -03:00 |
|
oobabooga
|
12690d3ffc
|
Better HF grammar implementation (#4953)
|
2023-12-17 02:01:23 -03:00 |
|
oobabooga
|
aa200f8723
|
UI: remove no longer necessary js in Default/Notebook tabs
|
2023-12-16 19:39:00 -08:00 |
|
oobabooga
|
7a84d7b2da
|
Instruct style improvements (#4951)
|
2023-12-16 22:16:26 -03:00 |
|