text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2025-01-12 13:27:38 +01:00

Author	SHA1	Message	Date
oobabooga	b92d7fd43e	Add warnings for when AutoGPTQ, TensorRT-LLM, or HQQ are missing	2024-09-28 20:30:24 -07:00
oobabooga	7276dca933	Fix a typo	2024-09-27 20:28:17 -07:00
RandoInternetPreson	46996f6519	ExllamaV2 tensor parallelism to increase multi gpu inference speeds (#6356 )	2024-09-28 00:26:03 -03:00
Philipp Emanuel Weidmann	301375834e	Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition (#6335 )	2024-09-27 22:50:12 -03:00
oobabooga	5c918c5b2d	Make it possible to sort DRY	2024-09-27 15:40:48 -07:00
oobabooga	7424f789bf	Fix the sampling monkey patch (and add more options to sampler_priority) (#6411 )	2024-09-27 19:03:25 -03:00
oobabooga	bba5b36d33	Don't import PEFT unless necessary	2024-09-03 19:40:53 -07:00
oobabooga	c5b40eb555	llama.cpp: prevent prompt evaluation progress bar with just 1 step	2024-09-03 17:37:06 -07:00
GralchemOz	4c74c7a116	Fix UnicodeDecodeError for BPE-based Models (especially GLM-4) (#6357 )	2024-09-02 23:00:59 -03:00
oobabooga	fd9cb26619	UI: update the DRY parameters descriptions/order	2024-08-19 19:40:17 -07:00
oobabooga	e926c03b3d	Add a --tokenizer-dir command-line flag for llamacpp_HF	2024-08-06 19:41:18 -07:00
oobabooga	30b4d8c8b2	Fix Llama 3.1 template including lengthy "tools" headers	2024-07-29 11:52:17 -07:00
oobabooga	9dcff21da9	Remove unnecessary shared.previous_model_name variable	2024-07-28 18:35:11 -07:00
oobabooga	514fb2e451	Fix UI error caused by --idle-timeout	2024-07-28 18:30:06 -07:00
oobabooga	5223c009fe	Minor change after previous commit	2024-07-27 23:13:34 -07:00
oobabooga	7050bb880e	UI: make n_ctx/max_seq_len/truncation_length numbers rather than sliders	2024-07-27 23:11:53 -07:00
Harry	078e8c8969	Make compress_pos_emb float (#6276 )	2024-07-28 03:03:19 -03:00
oobabooga	ffc713f72b	UI: fix multiline LaTeX equations	2024-07-27 15:36:10 -07:00
oobabooga	493f8c3242	UI: remove animation after clicking on "Stop" in the Chat tab	2024-07-27 15:22:34 -07:00
oobabooga	e4d411b841	UI: fix rendering LaTeX enclosed between \[ and \]	2024-07-27 15:21:44 -07:00
oobabooga	f32d26240d	UI: Fix the chat "stop" event	2024-07-26 23:03:05 -07:00
oobabooga	b80d5906c2	UI: fix saving characters	2024-07-25 15:09:31 -07:00
oobabooga	42e80108f5	UI: clear the markdown LRU cache when using the default/notebook tabs	2024-07-25 08:01:42 -07:00
oobabooga	7e2851e505	UI: fix "Command for chat-instruct mode" not appearing by default	2024-07-24 15:04:12 -07:00
oobabooga	947016d010	UI: make the markdown LRU cache infinite (for really long conversations)	2024-07-24 11:54:26 -07:00
oobabooga	e637b702ff	UI: make text between quotes colored in chat mode	2024-07-23 21:30:32 -07:00
oobabooga	1815877061	UI: fix the default character not loading correctly on startup	2024-07-23 18:48:10 -07:00
oobabooga	e6181e834a	Remove AutoAWQ as a standalone loader (it works better through transformers)	2024-07-23 15:31:17 -07:00
oobabooga	f18c947a86	Update the tensorcores description	2024-07-22 18:06:41 -07:00
oobabooga	aa809e420e	Bump llama-cpp-python to 0.2.83, add back tensorcore wheels Also add back the progress bar patch	2024-07-22 18:05:11 -07:00
oobabooga	11bbf71aa5	Bump back llama-cpp-python (#6257 )	2024-07-22 16:19:41 -03:00
oobabooga	0f53a736c1	Revert the llama-cpp-python update	2024-07-22 12:02:25 -07:00
oobabooga	a687f950ba	Remove the tensorcores llama.cpp wheels They are not faster than the default wheels anymore and they use a lot of space.	2024-07-22 11:54:35 -07:00
oobabooga	017d2332ea	Remove no longer necessary llama-cpp-python patch	2024-07-22 11:50:36 -07:00
oobabooga	f2d802e707	UI: make Default/Notebook contents persist on page reload	2024-07-22 11:07:10 -07:00
oobabooga	8768b69a2d	Lint	2024-07-21 22:08:14 -07:00
oobabooga	79e8dbe45f	UI: minor optimization	2024-07-21 22:06:49 -07:00
oobabooga	7ef2414357	UI: Make the file saving dialogs more robust	2024-07-21 15:38:20 -07:00
oobabooga	423372d6e7	Organize ui_file_saving.py	2024-07-21 13:23:18 -07:00
oobabooga	17df2d7bdf	UI: don't export the instruction template on "Save UI defaults to settings.yaml"	2024-07-21 10:45:01 -07:00
oobabooga	d05846eae5	UI: refresh the pfp cache on handle_your_picture_change	2024-07-21 10:17:22 -07:00
oobabooga	e9d4bff7d0	Update the --tensor_split description	2024-07-20 22:04:48 -07:00
oobabooga	916d1d8283	UI: improve the style of code blocks in light theme	2024-07-20 20:32:57 -07:00
oobabooga	564d8c8c0d	Make alpha_value a float number	2024-07-20 20:02:54 -07:00
oobabooga	79c4d3da3d	Optimize the UI (#6251 )	2024-07-21 00:01:42 -03:00
Alberto Cano	a14c510afb	Customize the subpath for gradio, use with reverse proxy (#5106 )	2024-07-20 19:10:39 -03:00
Vhallo	a9a6d72d8c	Use gr.Number for RoPE scaling parameters (#6233 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2024-07-20 18:57:09 -03:00
oobabooga	aa7c14a463	Use chat-instruct mode by default	2024-07-19 21:43:52 -07:00
InvectorGator	4148a9201f	Fix for MacOS users encountering model load errors (#6227 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com> Co-authored-by: Invectorgator <Kudzu12gaming@outlook.com>	2024-07-13 00:04:19 -03:00
oobabooga	e436d69e2b	Add --no_xformers and --no_sdpa flags for ExllamaV2	2024-07-11 15:47:37 -07:00

1 2 3 4 5 ...

1432 Commits