text-generation-webui

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-11-26 17:50:22 +01:00

Author	SHA1	Message	Date
oobabooga	419c34eca4	Update GPTQ-models-(4-bit-mode).md	2023-05-31 23:49:00 -03:00
oobabooga	486ddd62df	Add tfs and top_a to the API examples	2023-05-31 23:44:38 -03:00
oobabooga	b6c407f51d	Don't stream at more than 24 fps This is a performance optimization	2023-05-31 23:41:42 -03:00
oobabooga	a160230893	Update GPTQ-models-(4-bit-mode).md	2023-05-31 23:38:15 -03:00
oobabooga	2cdf525d3b	Bump llama-cpp-python version	2023-05-31 23:29:02 -03:00
jllllll	412e7a6a96	Update README.md to include missing flags (#2449 )	2023-05-31 11:07:56 -03:00
AlpinDale	6627f7feb9	Add notice about downgrading `gcc` and `g++` (#2446 )	2023-05-30 22:28:53 -03:00
Atinoda	bfbd13ae89	Update docker repo link (#2340 )	2023-05-30 22:14:49 -03:00
matatonic	a6d3f010a5	extensions/openai: include all available models in Model.list (#2368 ) Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>	2023-05-30 22:13:37 -03:00
matatonic	e5b756ecfe	Fixes #2331 , IndexError: string index out of range (#2383 )	2023-05-30 22:07:40 -03:00
Juan M Uys	b984a44f47	fix error when downloading a model for the first time (#2404 )	2023-05-30 22:07:12 -03:00
Yiximail	4715123f55	Add a `/api/v1/stop-stream` API that allows the user to interrupt the generation (#2392 )	2023-05-30 22:03:40 -03:00
matatonic	ebcadc0042	extensions/openai: cross_origin + chunked_response (updated fix) (#2423 )	2023-05-30 21:54:24 -03:00
matatonic	df50f077db	fixup missing tfs top_a params, defaults reorg (#2443 )	2023-05-30 21:52:33 -03:00
Forkoz	9ab90d8b60	Fix warning for qlora (#2438 )	2023-05-30 11:09:18 -03:00
oobabooga	0db4e191bd	Improve chat buttons on mobile devices	2023-05-30 00:30:15 -03:00
oobabooga	3209440b7c	Rearrange chat buttons	2023-05-30 00:17:31 -03:00
oobabooga	3578dd3611	Change a warning message	2023-05-29 22:40:54 -03:00
oobabooga	3a6e194bc7	Change a warning message	2023-05-29 22:39:23 -03:00
oobabooga	e763ace593	Update GPTQ-models-(4-bit-mode).md	2023-05-29 22:35:49 -03:00
oobabooga	86ef695d37	Update GPTQ-models-(4-bit-mode).md	2023-05-29 22:20:55 -03:00
oobabooga	8e0a997c60	Add new parameters to API extension	2023-05-29 22:03:08 -03:00
Luis Lopez	9e7204bef4	Add tail-free and top-a sampling (#2357 )	2023-05-29 21:40:01 -03:00
oobabooga	b4662bf4af	Download gptq_model*.py using download-model.py	2023-05-29 16:12:54 -03:00
oobabooga	540a161a08	Update GPTQ-models-(4-bit-mode).md	2023-05-29 15:45:40 -03:00
oobabooga	b8d2f6d876	Merge remote-tracking branch 'refs/remotes/origin/main'	2023-05-29 15:33:05 -03:00
oobabooga	1394f44e14	Add triton checkbox for AutoGPTQ	2023-05-29 15:32:45 -03:00
oobabooga	166a0d9893	Update GPTQ-models-(4-bit-mode).md	2023-05-29 15:07:59 -03:00
oobabooga	962d05ca7e	Update README.md	2023-05-29 14:56:55 -03:00
oobabooga	4a190a98fd	Update GPTQ-models-(4-bit-mode).md	2023-05-29 14:56:05 -03:00
matatonic	2b7ba9586f	Fixes #2326 , KeyError: 'assistant' (#2382 )	2023-05-29 14:19:57 -03:00
oobabooga	6de727c524	Improve Eta Sampling preset	2023-05-29 13:56:15 -03:00
oobabooga	f34d20922c	Minor fix	2023-05-29 13:31:17 -03:00
oobabooga	983eef1e29	Attempt at evaluating falcon perplexity (failed)	2023-05-29 13:28:25 -03:00
Honkware	204731952a	Falcon support (trust-remote-code and autogptq checkboxes) (#2367 ) --------- Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>	2023-05-29 10:20:18 -03:00
Forkoz	60ae80cf28	Fix hang in tokenizer for AutoGPTQ llama models. (#2399 )	2023-05-28 23:10:10 -03:00
oobabooga	2f811b1bdf	Change a warning message	2023-05-28 22:48:20 -03:00
oobabooga	9ee1e37121	Fix return message when no model is loaded	2023-05-28 22:46:32 -03:00
oobabooga	f27135bdd3	Add Eta Sampling preset Also remove some presets that I do not consider relevant	2023-05-28 22:44:35 -03:00
oobabooga	00ebea0b2a	Use YAML for presets and settings	2023-05-28 22:34:12 -03:00
Elias Vincent Simon	2cf711f35e	update SpeechRecognition dependency (#2345 )	2023-05-26 00:34:57 -03:00
jllllll	78dbec4c4e	Add 'scipy' to requirements.txt #2335 (#2343 ) Unlisted dependency of bitsandbytes	2023-05-25 23:26:25 -03:00
Luis Lopez	0dbc3d9b2c	Fix get_documents_ids_distances return error when n_results = 0 (#2347 )	2023-05-25 23:25:36 -03:00
jllllll	07a4f0569f	Update README.md to account for BnB Windows wheel (#2341 )	2023-05-25 18:44:26 -03:00
oobabooga	acfd876f29	Some qol changes to "Perplexity evaluation"	2023-05-25 15:06:22 -03:00
oobabooga	8efdc01ffb	Better default for compute_dtype	2023-05-25 15:05:53 -03:00
oobabooga	fc33216477	Small fix for n_ctx in llama.cpp	2023-05-25 13:55:51 -03:00
oobabooga	35009c32f0	Beautify all CSS	2023-05-25 13:12:34 -03:00
oobabooga	231305d0f5	Update README.md	2023-05-25 12:05:08 -03:00
oobabooga	37d4ad012b	Add a button for rendering markdown for any model	2023-05-25 11:59:27 -03:00

1 2 3 4 5 ...

1918 Commits