text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2025-01-28 04:47:18 +01:00

Author	SHA1	Message	Date
oobabooga	0af10ab49b	Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325 )	2023-08-06 17:22:48 -03:00
missionfloyd	5134878344	Fix chat message order (#3461 )	2023-08-05 13:53:54 -03:00
jllllll	44f31731af	Create logs dir if missing when saving history (#3462 )	2023-08-05 13:47:16 -03:00
Forkoz	9dcb37e8d4	Fix: Mirostat fails on models split across multiple GPUs	2023-08-05 13:45:47 -03:00
oobabooga	8df3cdfd51	Add SSL certificate support (#3453 )	2023-08-04 13:57:31 -03:00
missionfloyd	2336b75d92	Remove unnecessary chat.js (#3445 )	2023-08-04 01:58:37 -03:00
oobabooga	4b3384e353	Handle unfinished lists during markdown streaming	2023-08-03 17:15:18 -07:00
Pete	f4005164f4	Fix llama.cpp truncation (#3400 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-08-03 20:01:15 -03:00
oobabooga	87dab03dc0	Add the --cpu option for llama.cpp to prevent CUDA from being used (#3432 )	2023-08-03 11:00:36 -03:00
oobabooga	3e70bce576	Properly format exceptions in the UI	2023-08-03 06:57:21 -07:00
oobabooga	32c564509e	Fix loading session in chat mode	2023-08-02 21:13:16 -07:00
oobabooga	0e8f9354b5	Add direct download for session/chat history JSONs	2023-08-02 19:43:39 -07:00
oobabooga	32a2bbee4a	Implement auto_max_new_tokens for ExLlama	2023-08-02 11:03:56 -07:00
oobabooga	e931844fe2	Add auto_max_new_tokens parameter (#3419 )	2023-08-02 14:52:20 -03:00
Pete	6afc1a193b	Add a scrollbar to notebook/default, improve chat scrollbar style (#3403 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-08-02 12:02:36 -03:00
oobabooga	b53ed70a70	Make llamacpp_HF 6x faster	2023-08-01 13:18:20 -07:00
oobabooga	8d46a8c50a	Change the default chat style and the default preset	2023-08-01 09:35:17 -07:00
oobabooga	959feba602	When saving model settings, only save the settings for the current loader	2023-08-01 06:10:09 -07:00
oobabooga	f094330df0	When saving a preset, only save params that differ from the defaults	2023-07-31 19:13:29 -07:00
oobabooga	84297d05c4	Add a "Filter by loader" menu to the Parameters tab	2023-07-31 19:09:02 -07:00
oobabooga	7de7b3d495	Fix newlines in exported character yamls	2023-07-31 10:46:02 -07:00
oobabooga	5ca37765d3	Only replace {{user}} and {{char}} at generation time	2023-07-30 11:42:30 -07:00
oobabooga	6e16af34fd	Save uploaded characters as yaml Also allow yaml characters to be uploaded directly	2023-07-30 11:25:38 -07:00
oobabooga	b31321c779	Define visible_text before applying chat_input extensions	2023-07-26 07:27:14 -07:00
oobabooga	b17893a58f	Revert "Add tensor split support for llama.cpp (#3171 )" This reverts commit `031fe7225e`.	2023-07-26 07:06:01 -07:00
oobabooga	28779cd959	Use dark theme by default	2023-07-25 20:11:57 -07:00
oobabooga	c2e0d46616	Add credits	2023-07-25 15:49:04 -07:00
oobabooga	77d2e9f060	Remove flexgen 2	2023-07-25 15:18:25 -07:00
oobabooga	75c2dd38cf	Remove flexgen support	2023-07-25 15:15:29 -07:00
Foxtr0t1337	85b3a26e25	Ignore values which are not string in training.py (#3287 )	2023-07-25 19:00:25 -03:00
Shouyi	031fe7225e	Add tensor split support for llama.cpp (#3171 )	2023-07-25 18:59:26 -03:00
Eve	f653546484	README updates and improvements (#3198 )	2023-07-25 18:58:13 -03:00
oobabooga	ef8637e32d	Add extension example, replace input_hijack with chat_input_modifier (#3307 )	2023-07-25 18:49:56 -03:00
oobabooga	a07d070b6c	Add llama-2-70b GGML support (#3285 )	2023-07-24 16:37:03 -03:00
jllllll	1141987a0d	Add checks for ROCm and unsupported architectures to llama_cpp_cuda loading (#3225 )	2023-07-24 11:25:36 -03:00
Ikko Eltociear Ashimine	b2d5433409	Fix typo in deepspeed_parameters.py (#3222 ) configration -> configuration	2023-07-24 11:17:28 -03:00
oobabooga	4b19b74e6c	Add CUDA wheels for llama-cpp-python by jllllll	2023-07-19 19:33:43 -07:00
oobabooga	913e060348	Change the default preset to Divine Intellect It seems to reduce hallucination while using instruction-tuned models.	2023-07-19 08:24:37 -07:00
randoentity	a69955377a	[GGML] Support for customizable RoPE (#3083 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-07-17 22:32:37 -03:00
appe233	89e0d15cf5	Use 'torch.backends.mps.is_available' to check if mps is supported (#3164 )	2023-07-17 21:27:18 -03:00
oobabooga	8c1c2e0fae	Increase max_new_tokens upper limit	2023-07-17 17:08:22 -07:00
oobabooga	b1a6ea68dd	Disable "autoload the model" by default	2023-07-17 07:40:56 -07:00
oobabooga	a199f21799	Optimize llamacpp_hf a bit	2023-07-16 20:49:48 -07:00
oobabooga	6a3edb0542	Clean up llamacpp_hf.py	2023-07-15 22:40:55 -07:00
oobabooga	27a84b4e04	Make AutoGPTQ the default again Purely for compatibility with more models. You should still use ExLlama_HF for LLaMA models.	2023-07-15 22:29:23 -07:00
oobabooga	5e3f7e00a9	Create llamacpp_HF loader (#3062 )	2023-07-16 02:21:13 -03:00
oobabooga	94dfcec237	Make it possible to evaluate exllama perplexity (#3138 )	2023-07-16 01:52:55 -03:00
oobabooga	b284f2407d	Make ExLlama_HF the new default for GPTQ	2023-07-14 14:03:56 -07:00
Morgan Schweers	6d1e911577	Add support for logits processors in extensions (#3029 )	2023-07-13 17:22:41 -03:00
oobabooga	e202190c4f	lint	2023-07-12 11:33:25 -07:00

1 2 3 4 5 ...

789 Commits