text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-11-25 17:29:22 +01:00

Author	SHA1	Message	Date
zappityzap	317e6f9b6c	exclude .modelfile extension	2024-11-02 11:14:10 -07:00
zappityzap	b6c4e35fd8	exclude tokenizer from list	2024-11-02 11:02:49 -07:00
zappityzap	eaf9980c9a	lint whitespace	2024-11-02 10:49:39 -07:00
zappityzap	32c2bb3cb8	switch to os.walk, allow symlinks	2024-11-01 20:19:12 -07:00
PIRI	e1061ba7e3	Make token bans work again on HF loaders (#6488 )	2024-10-24 15:24:02 -03:00
oobabooga	2468cfd8bb	Merge remote-tracking branch 'refs/remotes/origin/dev' into dev	2024-10-14 13:25:27 -07:00
oobabooga	bb62e796eb	Fix locally compiled llama-cpp-python failing to import	2024-10-14 13:24:13 -07:00
oobabooga	c9a9f63d1b	Fix llama.cpp loader not being random (thanks @reydeljuego12345)	2024-10-14 13:07:07 -07:00
PIRI	03a2e70054	Fix temperature_last when temperature not in sampler priority (#6439 )	2024-10-09 11:25:14 -03:00
oobabooga	49dfa0adaf	Fix the "save preset" event	2024-10-01 11:20:48 -07:00
oobabooga	93c250b9b6	Add a UI element for enable_tp	2024-10-01 11:16:15 -07:00
oobabooga	cca9d6e22d	Lint	2024-10-01 10:21:06 -07:00
oobabooga	4d9ce586d3	Update llama_cpp_python_hijack.py, fix llamacpp_hf	2024-09-30 14:49:21 -07:00
oobabooga	bbdeed3cf4	Make sampler priority high if unspecified	2024-09-29 20:45:27 -07:00
Manuel Schmid	0f90a1b50f	Do not set value for histories in chat when --multi-user is used (#6317 )	2024-09-29 01:08:55 -03:00
oobabooga	c61b29b9ce	Simplify the warning when flash-attn fails to import	2024-09-28 20:33:17 -07:00
oobabooga	b92d7fd43e	Add warnings for when AutoGPTQ, TensorRT-LLM, or HQQ are missing	2024-09-28 20:30:24 -07:00
oobabooga	7276dca933	Fix a typo	2024-09-27 20:28:17 -07:00
RandoInternetPreson	46996f6519	ExllamaV2 tensor parallelism to increase multi gpu inference speeds (#6356 )	2024-09-28 00:26:03 -03:00
Philipp Emanuel Weidmann	301375834e	Exclude Top Choices (XTC): A sampler that boosts creativity, breaks writing clichés, and inhibits non-verbatim repetition (#6335 )	2024-09-27 22:50:12 -03:00
oobabooga	5c918c5b2d	Make it possible to sort DRY	2024-09-27 15:40:48 -07:00
oobabooga	7424f789bf	Fix the sampling monkey patch (and add more options to sampler_priority) (#6411 )	2024-09-27 19:03:25 -03:00
oobabooga	bba5b36d33	Don't import PEFT unless necessary	2024-09-03 19:40:53 -07:00
oobabooga	c5b40eb555	llama.cpp: prevent prompt evaluation progress bar with just 1 step	2024-09-03 17:37:06 -07:00
GralchemOz	4c74c7a116	Fix UnicodeDecodeError for BPE-based Models (especially GLM-4) (#6357 )	2024-09-02 23:00:59 -03:00
oobabooga	fd9cb26619	UI: update the DRY parameters descriptions/order	2024-08-19 19:40:17 -07:00
oobabooga	e926c03b3d	Add a --tokenizer-dir command-line flag for llamacpp_HF	2024-08-06 19:41:18 -07:00
oobabooga	30b4d8c8b2	Fix Llama 3.1 template including lengthy "tools" headers	2024-07-29 11:52:17 -07:00
oobabooga	9dcff21da9	Remove unnecessary shared.previous_model_name variable	2024-07-28 18:35:11 -07:00
oobabooga	514fb2e451	Fix UI error caused by --idle-timeout	2024-07-28 18:30:06 -07:00
oobabooga	5223c009fe	Minor change after previous commit	2024-07-27 23:13:34 -07:00
oobabooga	7050bb880e	UI: make n_ctx/max_seq_len/truncation_length numbers rather than sliders	2024-07-27 23:11:53 -07:00
Harry	078e8c8969	Make compress_pos_emb float (#6276 )	2024-07-28 03:03:19 -03:00
oobabooga	ffc713f72b	UI: fix multiline LaTeX equations	2024-07-27 15:36:10 -07:00
oobabooga	493f8c3242	UI: remove animation after clicking on "Stop" in the Chat tab	2024-07-27 15:22:34 -07:00
oobabooga	e4d411b841	UI: fix rendering LaTeX enclosed between \[ and \]	2024-07-27 15:21:44 -07:00
oobabooga	f32d26240d	UI: Fix the chat "stop" event	2024-07-26 23:03:05 -07:00
oobabooga	b80d5906c2	UI: fix saving characters	2024-07-25 15:09:31 -07:00
oobabooga	42e80108f5	UI: clear the markdown LRU cache when using the default/notebook tabs	2024-07-25 08:01:42 -07:00
oobabooga	7e2851e505	UI: fix "Command for chat-instruct mode" not appearing by default	2024-07-24 15:04:12 -07:00
oobabooga	947016d010	UI: make the markdown LRU cache infinite (for really long conversations)	2024-07-24 11:54:26 -07:00
oobabooga	e637b702ff	UI: make text between quotes colored in chat mode	2024-07-23 21:30:32 -07:00
oobabooga	1815877061	UI: fix the default character not loading correctly on startup	2024-07-23 18:48:10 -07:00
oobabooga	e6181e834a	Remove AutoAWQ as a standalone loader (it works better through transformers)	2024-07-23 15:31:17 -07:00
oobabooga	f18c947a86	Update the tensorcores description	2024-07-22 18:06:41 -07:00
oobabooga	aa809e420e	Bump llama-cpp-python to 0.2.83, add back tensorcore wheels Also add back the progress bar patch	2024-07-22 18:05:11 -07:00
oobabooga	11bbf71aa5	Bump back llama-cpp-python (#6257 )	2024-07-22 16:19:41 -03:00
oobabooga	0f53a736c1	Revert the llama-cpp-python update	2024-07-22 12:02:25 -07:00
oobabooga	a687f950ba	Remove the tensorcores llama.cpp wheels They are not faster than the default wheels anymore and they use a lot of space.	2024-07-22 11:54:35 -07:00
oobabooga	017d2332ea	Remove no longer necessary llama-cpp-python patch	2024-07-22 11:50:36 -07:00

1 2 3 4 5 ...

1448 Commits