Wojtab
e9e75a9ec7
Generalize multimodality (llava/minigpt4 7b and 13b now supported) ( #1741 )
2023-05-09 20:18:02 -03:00
oobabooga
13e7ebfc77
Change a comment
2023-05-09 15:56:32 -03:00
LaaZa
218bd64bd1
Add the option to not automatically load the selected model ( #1762 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-05-09 15:52:35 -03:00
Kamil Szurant
641500dcb9
Use current input for Impersonate (continue impersonate feature) ( #1147 )
2023-05-09 02:37:42 -03:00
oobabooga
b5260b24f1
Add support for custom chat styles ( #1917 )
2023-05-08 12:35:03 -03:00
Matthew McAllister
0c048252b5
Fix character menu when default chat mode is 'instruct' ( #1873 )
2023-05-07 23:50:38 -03:00
oobabooga
56a5969658
Improve the separation between instruct/chat modes ( #1896 )
2023-05-07 23:47:02 -03:00
oobabooga
56f6b7052a
Sort dropdowns numerically
2023-05-05 23:14:56 -03:00
oobabooga
8aafb1f796
Refactor text_generation.py, add support for custom generation functions ( #1817 )
2023-05-05 18:53:03 -03:00
Tom Jobbins
876fbb97c0
Allow downloading model from HF branch via UI ( #1662 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-05-05 13:59:01 -03:00
oobabooga
95d04d6a8d
Better warning messages
2023-05-03 21:43:17 -03:00
Tom Jobbins
3c67fc0362
Allow groupsize 1024, needed for larger models eg 30B to lower VRAM usage ( #1660 )
2023-05-02 00:46:26 -03:00
oobabooga
a777c058af
Precise prompts for instruct mode
2023-04-26 03:21:53 -03:00
oobabooga
f39c99fa14
Load more than one LoRA with --lora, fix a bug
2023-04-25 22:58:48 -03:00
oobabooga
b6af2e56a2
Add --character flag, add character to settings.json
2023-04-24 13:19:42 -03:00
oobabooga
caaa556159
Move extensions block definition to the bottom
2023-04-24 03:30:35 -03:00
oobabooga
b1ee674d75
Make interface state (mostly) persistent on page reload
2023-04-24 03:05:47 -03:00
oobabooga
47809e28aa
Minor changes
2023-04-24 01:04:48 -03:00
Andy Salerno
654933c634
New universal API with streaming/blocking endpoints ( #990 )
...
Previous title: Add api_streaming extension and update api-example-stream to use it
* Merge with latest main
* Add parameter capturing encoder_repetition_penalty
* Change some defaults, minor fixes
* Add --api, --public-api flags
* remove unneeded/broken comment from blocking API startup. The comment is already correctly emitted in try_start_cloudflared by calling the lambda we pass in.
* Update on_start message for blocking_api, it should say 'non-streaming' and not 'streaming'
* Update the API examples
* Change a comment
* Update README
* Remove the gradio API
* Remove unused import
* Minor change
* Remove unused import
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-23 15:52:43 -03:00
oobabooga
2dca8bb25e
Sort imports
2023-04-21 17:20:59 -03:00
oobabooga
c238ba9532
Add a 'Count tokens' button
2023-04-21 17:18:34 -03:00
oobabooga
2d766d2e19
Improve notebook mode button sizes
2023-04-21 02:37:58 -03:00
oobabooga
b4af319fa2
Add a workaround for GALACTICA on some systems
2023-04-19 01:43:10 -03:00
oobabooga
61126f4674
Change the button styles
2023-04-19 00:56:24 -03:00
oobabooga
649e4017a5
Style improvements
2023-04-19 00:36:28 -03:00
oobabooga
c58c1d89bd
Clean method to prevent gradio from phoning home
2023-04-18 03:56:20 -03:00
oobabooga
e1b80e6fe6
Comment the gradio patch
2023-04-18 01:57:59 -03:00
oobabooga
36f7c022f2
Rename a file
2023-04-18 01:38:33 -03:00
oobabooga
00186f76f4
Monkey patch gradio to prevent it from calling home
2023-04-18 01:13:16 -03:00
oobabooga
c3dc348d1c
Don't show 'None' in the LoRA list
2023-04-17 13:52:23 -03:00
oobabooga
209fcd21d5
Reorganize Parameters tab
2023-04-17 00:33:22 -03:00
oobabooga
b937c9d8c2
Add skip_special_tokens checkbox for Dolly model ( #1218 )
2023-04-16 14:24:49 -03:00
oobabooga
a9c7ef4159
Exclude yaml files from model list
2023-04-16 12:47:30 -03:00
Mikel Bober-Irizar
16a3a5b039
Merge pull request from GHSA-hv5m-3rp9-xcpf
...
* Remove eval of API input
* Remove unnecessary eval/exec for security
* Use ast.literal_eval
* Use ast.literal_eval
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-16 01:36:50 -03:00
oobabooga
ac189011cb
Add "Save current settings for this model" button
2023-04-15 12:54:02 -03:00
oobabooga
b9dcba7762
Don't overwrite --gpu_memory on boot (#1237/#1235)
2023-04-15 11:59:31 -03:00
oobabooga
628f8e6168
Reorganize chat buttons
2023-04-14 23:17:15 -03:00
oobabooga
c4aa1a42b1
Fix chat history downloading
2023-04-14 19:38:30 -03:00
oobabooga
3a337cfded
Use argparse defaults
2023-04-14 15:35:06 -03:00
oobabooga
43e01282b3
Don't override user initial wbits/groupsize
2023-04-14 15:24:03 -03:00
Alex "mcmonkey" Goodwin
64e3b44e0f
initial multi-lora support ( #1103 )
...
---------
Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-14 14:52:06 -03:00
oobabooga
ebb81eb176
Add Undo/Regenerate buttons to notebook mode
2023-04-14 14:34:56 -03:00
oobabooga
581f739b2f
Reorganize
2023-04-14 11:38:26 -03:00
oobabooga
8e31f2bad4
Automatically set wbits/groupsize/instruct based on model name ( #1167 )
2023-04-14 11:07:28 -03:00
v0xie
9d66957207
Add --listen-host launch option ( #1122 )
2023-04-13 21:35:08 -03:00
oobabooga
c13e8651ad
Suppress "TypedStorage is deprecated" warnings
2023-04-13 12:09:42 -03:00
oobabooga
17ce7c8671
Suppress annoying bitsandbytes welcome message
2023-04-13 12:04:39 -03:00
oobabooga
fbb448ce4f
If only 1 model is available, load that model
2023-04-13 11:44:10 -03:00
oobabooga
5744b31593
Reorganize some buttons
2023-04-13 11:05:47 -03:00
Xan
6e19ae4b2f
Fix gpt-j model type in UI ( #1129 )
2023-04-13 10:17:20 -03:00