text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-11-22 08:07:56 +01:00

Author	SHA1	Message	Date
oobabooga	c5b40eb555	llama.cpp: prevent prompt evaluation progress bar with just 1 step	2024-09-03 17:37:06 -07:00
oobabooga	aa809e420e	Bump llama-cpp-python to 0.2.83, add back tensorcore wheels Also add back the progress bar patch	2024-07-22 18:05:11 -07:00
oobabooga	11bbf71aa5	Bump back llama-cpp-python (#6257 )	2024-07-22 16:19:41 -03:00
oobabooga	0f53a736c1	Revert the llama-cpp-python update	2024-07-22 12:02:25 -07:00
oobabooga	a687f950ba	Remove the tensorcores llama.cpp wheels They are not faster than the default wheels anymore and they use a lot of space.	2024-07-22 11:54:35 -07:00
oobabooga	017d2332ea	Remove no longer necessary llama-cpp-python patch	2024-07-22 11:50:36 -07:00
InvectorGator	4148a9201f	Fix for MacOS users encountering model load errors (#6227 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com> Co-authored-by: Invectorgator <Kudzu12gaming@outlook.com>	2024-07-13 00:04:19 -03:00
oobabooga	512b311137	Improve the llama-cpp-python exception messages	2024-07-11 13:00:29 -07:00
oobabooga	aa653e3b5a	Prevent llama.cpp from being monkey patched more than once (closes #6201 )	2024-07-05 03:34:15 -07:00
oobabooga	a47de06088	Force only 1 llama-cpp-python version at a time for now	2024-07-04 19:43:34 -07:00
oobabooga	f243b4ca9c	Make llama-cpp-python not crash immediately	2024-07-04 19:16:00 -07:00
oobabooga	51fb766bea	Add back my llama-cpp-python wheels, bump to 0.2.65 (#5964 )	2024-04-30 09:11:31 -03:00
oobabooga	9b623b8a78	Bump llama-cpp-python to 0.2.64, use official wheels (#5921 )	2024-04-23 23:17:05 -03:00
oobabooga	3e3a7c4250	Bump llama-cpp-python to 0.2.61 & fix the crash	2024-04-11 14:15:34 -07:00
oobabooga	afb51bd5d6	Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt) (#5669 )	2024-03-09 00:25:33 -03:00
oobabooga	069ed7c6ef	Lint	2024-02-13 16:05:41 -08:00
oobabooga	86c320ab5a	llama.cpp: add a progress bar for prompt evaluation	2024-02-07 21:56:10 -08:00

17 Commits