text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-12-27 14:49:32 +01:00

Author	SHA1	Message	Date
oobabooga	9331ab4798	Read GGUF metadata (#3873 )	2023-09-11 18:49:30 -03:00
oobabooga	df52dab67b	Lint	2023-09-11 07:57:38 -07:00
oobabooga	ed86878f02	Remove GGML support	2023-09-11 07:44:00 -07:00
John Smith	cc7b7ba153	fix lora training with alpaca_lora_4bit (#3853 )	2023-09-11 01:22:20 -03:00
Forkoz	15e9b8c915	Exllama new rope settings (#3852 )	2023-09-11 01:14:36 -03:00
oobabooga	4affa08821	Do not impose instruct mode while loading models	2023-09-02 11:31:33 -07:00
oobabooga	47e490c7b4	Set use_cache=True by default for all models	2023-08-30 13:26:27 -07:00
missionfloyd	787219267c	Allow downloading single file from UI (#3737 )	2023-08-29 23:32:36 -03:00
oobabooga	cec8db52e5	Add max_tokens_second param (#3533 )	2023-08-29 17:44:31 -03:00
oobabooga	2b58a89f6a	Clear instruction template before loading new one	2023-08-29 13:11:32 -07:00
oobabooga	36864cb3e8	Use Alpaca as the default instruction template	2023-08-29 13:06:25 -07:00
oobabooga	9a202f7fb2	Prevent <ul> lists from flickering during streaming	2023-08-28 20:45:07 -07:00
oobabooga	439dd0faab	Fix stopping strings in the chat API	2023-08-28 19:40:11 -07:00
oobabooga	c75f98a6d6	Autoscroll Notebook/Default textareas during streaming	2023-08-28 18:22:03 -07:00
oobabooga	558e918fd6	Add a typing dots (...) animation to chat tab	2023-08-28 13:50:36 -07:00
oobabooga	57e9ded00c	Make it possible to scroll during streaming (#3721 )	2023-08-28 16:03:20 -03:00
Cebtenzzre	2f5d769a8d	accept floating-point alpha value on the command line (#3712 )	2023-08-27 18:54:43 -03:00
oobabooga	b2296dcda0	Ctrl+S to show/hide chat controls	2023-08-27 13:14:33 -07:00
Ravindra Marella	e4c3e1bdd2	Fix ctransformers model unload (#3711 ) Add missing comma in model types list Fixes marella/ctransformers#111	2023-08-27 10:53:48 -03:00
oobabooga	0c9e818bb8	Update truncation length based on max_seq_len/n_ctx	2023-08-26 23:10:45 -07:00
oobabooga	3361728da1	Change some comments	2023-08-26 22:24:44 -07:00
oobabooga	8aeae3b3f4	Fix llamacpp_HF loading	2023-08-26 22:15:06 -07:00
oobabooga	7f5370a272	Minor fixes/cosmetics	2023-08-26 22:11:07 -07:00
jllllll	4d61a7d9da	Account for deprecated GGML parameters	2023-08-26 14:07:46 -05:00
jllllll	4a999e3bcd	Use separate llama-cpp-python packages for GGML support	2023-08-26 10:40:08 -05:00
oobabooga	83640d6f43	Replace ggml occurences with gguf	2023-08-26 01:06:59 -07:00
jllllll	db42b365c9	Fix ctransformers threads auto-detection (#3688 )	2023-08-25 14:37:02 -03:00
cal066	960980247f	ctransformers: gguf support (#3685 )	2023-08-25 11:33:04 -03:00
oobabooga	21058c37f7	Add missing file	2023-08-25 07:10:26 -07:00
oobabooga	f4f04c8c32	Fix a typo	2023-08-25 07:08:38 -07:00
oobabooga	5c7d8bfdfd	Detect CodeLlama settings	2023-08-25 07:06:57 -07:00
oobabooga	52ab2a6b9e	Add rope_freq_base parameter for CodeLlama	2023-08-25 06:55:15 -07:00
oobabooga	feecd8190f	Unescape inline code blocks	2023-08-24 21:01:09 -07:00
oobabooga	3320accfdc	Add CFG to llamacpp_HF (second attempt) (#3678 )	2023-08-24 20:32:21 -03:00
oobabooga	d6934bc7bc	Implement CFG for ExLlama_HF (#3666 )	2023-08-24 16:27:36 -03:00
oobabooga	87442c6d18	Fix Notebook Logits tab	2023-08-22 21:00:12 -07:00
oobabooga	c0b119c3a3	Improve logit viewer format	2023-08-22 20:35:12 -07:00
oobabooga	8545052c9d	Add the option to use samplers in the logit viewer	2023-08-22 20:18:16 -07:00
oobabooga	25e5eaa6a6	Remove outdated training warning	2023-08-22 13:16:44 -07:00
oobabooga	335c49cc7e	Bump peft and transformers	2023-08-22 13:14:59 -07:00
cal066	e042bf8624	ctransformers: add mlock and no-mmap options (#3649 )	2023-08-22 16:51:34 -03:00
oobabooga	6cca8b8028	Only update notebook token counter on input For performance during streaming	2023-08-21 05:39:55 -07:00
oobabooga	2cb07065ec	Fix an escaping bug	2023-08-20 21:50:42 -07:00
oobabooga	a74dd9003f	Fix HTML escaping for perplexity_colors extension	2023-08-20 21:40:22 -07:00
oobabooga	57036abc76	Add "send to default/notebook" buttons to chat tab	2023-08-20 19:54:59 -07:00
oobabooga	429cacd715	Add a token counter similar to automatic1111 It can now be found in the Default and Notebook tabs	2023-08-20 19:37:33 -07:00
oobabooga	120fb86c6a	Add a simple logit viewer (#3636 )	2023-08-20 20:49:21 -03:00
oobabooga	ef17da70af	Fix ExLlama truncation	2023-08-20 08:53:26 -07:00
oobabooga	ee964bcce9	Update a comment about RoPE scaling	2023-08-20 07:01:43 -07:00
missionfloyd	1cae784761	Unescape last message (#3623 )	2023-08-19 09:29:08 -03:00
Cebtenzzre	942ad6067d	llama.cpp: make Stop button work with streaming disabled (#3620 )	2023-08-19 00:17:27 -03:00
oobabooga	f6724a1a01	Return the visible history with "Copy last reply"	2023-08-18 13:04:45 -07:00
oobabooga	b96fd22a81	Refactor the training tab (#3619 )	2023-08-18 16:58:38 -03:00
oobabooga	c4733000d7	Return the visible history with "Remove last"	2023-08-18 09:25:51 -07:00
oobabooga	7cba000421	Bump llama-cpp-python, +tensor_split by @shouyiwang, +mul_mat_q (#3610 )	2023-08-18 12:03:34 -03:00
oobabooga	bdb6eb5734	Restyle the chat input box + several CSS improvements - Remove extra spacing below the last chat message - Change the background color of code blocks in dark mode - Remove border radius from selected header bar elements - Make the chat scrollbar more discrete	2023-08-17 11:10:38 -07:00
oobabooga	cebe07f29c	Unescape HTML inside code blocks	2023-08-16 21:08:26 -07:00
oobabooga	a4e903e932	Escape HTML in chat messages	2023-08-16 09:25:52 -07:00
oobabooga	73d9befb65	Make "Show controls" customizable through settings.yaml	2023-08-16 07:04:18 -07:00
oobabooga	2a29208224	Add a "Show controls" button to chat UI (#3590 )	2023-08-16 02:39:58 -03:00
cal066	991bb57e43	ctransformers: Fix up model_type name consistency (#3567 )	2023-08-14 15:17:24 -03:00
oobabooga	ccfc02a28d	Add the --disable_exllama option for AutoGPTQ (#3545 from clefever/disable-exllama)	2023-08-14 15:15:55 -03:00
oobabooga	7e57b35b5e	Clean up old code	2023-08-14 10:10:39 -07:00
oobabooga	4d067e9b52	Add back a variable to keep old extensions working	2023-08-14 09:39:06 -07:00
oobabooga	d8a82d34ed	Improve a warning	2023-08-14 08:46:05 -07:00
oobabooga	3e0a9f9cdb	Refresh the character dropdown when saving/deleting a character	2023-08-14 08:23:41 -07:00
oobabooga	890b4abdad	Fix session saving	2023-08-14 07:55:52 -07:00
oobabooga	619cb4e78b	Add "save defaults to settings.yaml" button (#3574 )	2023-08-14 11:46:07 -03:00
oobabooga	a95e6f02cb	Add a placeholder for custom stopping strings	2023-08-13 21:17:20 -07:00
oobabooga	ff9b5861c8	Fix impersonate when some text is present (closes #3564 )	2023-08-13 21:10:47 -07:00
oobabooga	cc7e6ef645	Fix a CSS conflict	2023-08-13 19:24:09 -07:00
Eve	66c04c304d	Various ctransformers fixes (#3556 ) --------- Co-authored-by: cal066 <cal066@users.noreply.github.com>	2023-08-13 23:09:03 -03:00
oobabooga	4a05aa92cb	Add "send to" buttons for instruction templates - Remove instruction templates from prompt dropdowns (default/notebook) - Add 3 buttons to Parameters > Instruction template as a replacement - Increase the number of lines of 'negative prompt' field to 3, and add a scrollbar - When uploading a character, switch to the Character tab - When uploading chat history, switch to the Chat tab	2023-08-13 18:35:45 -07:00
oobabooga	f6db2c78d1	Fix ctransformers seed	2023-08-13 05:48:53 -07:00
oobabooga	a1a9ec895d	Unify the 3 interface modes (#3554 )	2023-08-13 01:12:15 -03:00
cal066	bf70c19603	ctransformers: move thread and seed parameters (#3543 )	2023-08-13 00:04:03 -03:00
Chris Lefever	0230fa4e9c	Add the --disable_exllama option for AutoGPTQ	2023-08-12 02:26:58 -04:00
oobabooga	0e05818266	Style changes	2023-08-11 16:35:57 -07:00
oobabooga	2f918ccf7c	Remove unused parameter	2023-08-11 11:15:22 -07:00
oobabooga	28c8df337b	Add repetition_penalty_range to ctransformers	2023-08-11 11:04:19 -07:00
cal066	7a4fcee069	Add ctransformers support (#3313 ) --------- Co-authored-by: cal066 <cal066@users.noreply.github.com> Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com> Co-authored-by: randoentity <137087500+randoentity@users.noreply.github.com>	2023-08-11 14:41:33 -03:00
oobabooga	8dbaa20ca8	Don't replace last reply with an empty message	2023-08-10 13:14:48 -07:00
oobabooga	0789554f65	Allow --lora to use an absolute path	2023-08-10 10:03:12 -07:00
oobabooga	3929971b66	Don't show oobabooga_llama-tokenizer in the model dropdown	2023-08-10 10:02:48 -07:00
oobabooga	c7f52bbdc1	Revert "Remove GPTQ-for-LLaMa monkey patch support" This reverts commit `e3d3565b2a`.	2023-08-10 08:39:41 -07:00
jllllll	d6765bebc4	Update installation documentation	2023-08-10 00:53:48 -05:00
jllllll	d7ee4c2386	Remove unused import	2023-08-10 00:10:14 -05:00
jllllll	e3d3565b2a	Remove GPTQ-for-LLaMa monkey patch support AutoGPTQ will be the preferred GPTQ LoRa loader in the future.	2023-08-09 23:59:04 -05:00
jllllll	bee73cedbd	Streamline GPTQ-for-LLaMa support	2023-08-09 23:42:34 -05:00
oobabooga	6c6a52aaad	Change the filenames for caches and histories	2023-08-09 07:47:19 -07:00
oobabooga	d8fb506aff	Add RoPE scaling support for transformers (including dynamic NTK) https://github.com/huggingface/transformers/pull/24653	2023-08-08 21:25:48 -07:00
Friedemann Lipphardt	901b028d55	Add option for named cloudflare tunnels (#3364 )	2023-08-08 22:20:27 -03:00
oobabooga	bf08b16b32	Fix disappearing profile picture bug	2023-08-08 14:09:01 -07:00
Gennadij	0e78f3b4d4	Fixed a typo in "rms_norm_eps", incorrectly set as n_gqa (#3494 )	2023-08-08 00:31:11 -03:00
oobabooga	37fb719452	Increase the Context/Greeting boxes sizes	2023-08-08 00:09:00 -03:00
oobabooga	584dd33424	Fix missing example_dialogue when uploading characters	2023-08-07 23:44:59 -03:00
oobabooga	412f6ff9d3	Change alpha_value maximum and step	2023-08-07 06:08:51 -07:00
oobabooga	a373c96d59	Fix a bug in modules/shared.py	2023-08-06 20:36:35 -07:00
oobabooga	3d48933f27	Remove ancient deprecation warnings	2023-08-06 18:58:59 -07:00
oobabooga	c237ce607e	Move characters/instruction-following to instruction-templates	2023-08-06 17:50:32 -07:00
oobabooga	65aa11890f	Refactor everything (#3481 )	2023-08-06 21:49:27 -03:00
oobabooga	d4b851bdc8	Credit turboderp	2023-08-06 13:43:15 -07:00
oobabooga	0af10ab49b	Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325 )	2023-08-06 17:22:48 -03:00
missionfloyd	5134878344	Fix chat message order (#3461 )	2023-08-05 13:53:54 -03:00
jllllll	44f31731af	Create logs dir if missing when saving history (#3462 )	2023-08-05 13:47:16 -03:00
Forkoz	9dcb37e8d4	Fix: Mirostat fails on models split across multiple GPUs	2023-08-05 13:45:47 -03:00
oobabooga	8df3cdfd51	Add SSL certificate support (#3453 )	2023-08-04 13:57:31 -03:00
missionfloyd	2336b75d92	Remove unnecessary chat.js (#3445 )	2023-08-04 01:58:37 -03:00
oobabooga	4b3384e353	Handle unfinished lists during markdown streaming	2023-08-03 17:15:18 -07:00
Pete	f4005164f4	Fix llama.cpp truncation (#3400 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-08-03 20:01:15 -03:00
oobabooga	87dab03dc0	Add the --cpu option for llama.cpp to prevent CUDA from being used (#3432 )	2023-08-03 11:00:36 -03:00
oobabooga	3e70bce576	Properly format exceptions in the UI	2023-08-03 06:57:21 -07:00
oobabooga	32c564509e	Fix loading session in chat mode	2023-08-02 21:13:16 -07:00
oobabooga	0e8f9354b5	Add direct download for session/chat history JSONs	2023-08-02 19:43:39 -07:00
oobabooga	32a2bbee4a	Implement auto_max_new_tokens for ExLlama	2023-08-02 11:03:56 -07:00
oobabooga	e931844fe2	Add auto_max_new_tokens parameter (#3419 )	2023-08-02 14:52:20 -03:00
Pete	6afc1a193b	Add a scrollbar to notebook/default, improve chat scrollbar style (#3403 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-08-02 12:02:36 -03:00
oobabooga	b53ed70a70	Make llamacpp_HF 6x faster	2023-08-01 13:18:20 -07:00
oobabooga	8d46a8c50a	Change the default chat style and the default preset	2023-08-01 09:35:17 -07:00
oobabooga	959feba602	When saving model settings, only save the settings for the current loader	2023-08-01 06:10:09 -07:00
oobabooga	f094330df0	When saving a preset, only save params that differ from the defaults	2023-07-31 19:13:29 -07:00
oobabooga	84297d05c4	Add a "Filter by loader" menu to the Parameters tab	2023-07-31 19:09:02 -07:00
oobabooga	7de7b3d495	Fix newlines in exported character yamls	2023-07-31 10:46:02 -07:00
oobabooga	5ca37765d3	Only replace {{user}} and {{char}} at generation time	2023-07-30 11:42:30 -07:00
oobabooga	6e16af34fd	Save uploaded characters as yaml Also allow yaml characters to be uploaded directly	2023-07-30 11:25:38 -07:00
oobabooga	b31321c779	Define visible_text before applying chat_input extensions	2023-07-26 07:27:14 -07:00
oobabooga	b17893a58f	Revert "Add tensor split support for llama.cpp (#3171 )" This reverts commit `031fe7225e`.	2023-07-26 07:06:01 -07:00
oobabooga	28779cd959	Use dark theme by default	2023-07-25 20:11:57 -07:00
oobabooga	c2e0d46616	Add credits	2023-07-25 15:49:04 -07:00
oobabooga	77d2e9f060	Remove flexgen 2	2023-07-25 15:18:25 -07:00
oobabooga	75c2dd38cf	Remove flexgen support	2023-07-25 15:15:29 -07:00
Foxtr0t1337	85b3a26e25	Ignore values which are not string in training.py (#3287 )	2023-07-25 19:00:25 -03:00
Shouyi	031fe7225e	Add tensor split support for llama.cpp (#3171 )	2023-07-25 18:59:26 -03:00
Eve	f653546484	README updates and improvements (#3198 )	2023-07-25 18:58:13 -03:00
oobabooga	ef8637e32d	Add extension example, replace input_hijack with chat_input_modifier (#3307 )	2023-07-25 18:49:56 -03:00
oobabooga	a07d070b6c	Add llama-2-70b GGML support (#3285 )	2023-07-24 16:37:03 -03:00
jllllll	1141987a0d	Add checks for ROCm and unsupported architectures to llama_cpp_cuda loading (#3225 )	2023-07-24 11:25:36 -03:00
Ikko Eltociear Ashimine	b2d5433409	Fix typo in deepspeed_parameters.py (#3222 ) configration -> configuration	2023-07-24 11:17:28 -03:00
oobabooga	4b19b74e6c	Add CUDA wheels for llama-cpp-python by jllllll	2023-07-19 19:33:43 -07:00
oobabooga	913e060348	Change the default preset to Divine Intellect It seems to reduce hallucination while using instruction-tuned models.	2023-07-19 08:24:37 -07:00
randoentity	a69955377a	[GGML] Support for customizable RoPE (#3083 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-07-17 22:32:37 -03:00
appe233	89e0d15cf5	Use 'torch.backends.mps.is_available' to check if mps is supported (#3164 )	2023-07-17 21:27:18 -03:00
oobabooga	8c1c2e0fae	Increase max_new_tokens upper limit	2023-07-17 17:08:22 -07:00
oobabooga	b1a6ea68dd	Disable "autoload the model" by default	2023-07-17 07:40:56 -07:00
oobabooga	a199f21799	Optimize llamacpp_hf a bit	2023-07-16 20:49:48 -07:00
oobabooga	6a3edb0542	Clean up llamacpp_hf.py	2023-07-15 22:40:55 -07:00
oobabooga	27a84b4e04	Make AutoGPTQ the default again Purely for compatibility with more models. You should still use ExLlama_HF for LLaMA models.	2023-07-15 22:29:23 -07:00
oobabooga	5e3f7e00a9	Create llamacpp_HF loader (#3062 )	2023-07-16 02:21:13 -03:00
oobabooga	94dfcec237	Make it possible to evaluate exllama perplexity (#3138 )	2023-07-16 01:52:55 -03:00
oobabooga	b284f2407d	Make ExLlama_HF the new default for GPTQ	2023-07-14 14:03:56 -07:00

1 2 3 4 5 ...

991 Commits